normal basis is usin ing novel concurrent seria ial
play

Normal Basis is Usin ing Novel Concurrent Seria ial Squarin ing - PowerPoint PPT Presentation

A New Mult ltipli licative In Inverse Archit itecture in in Normal Basis is Usin ing Novel Concurrent Seria ial Squarin ing and Mult ltipli lication Amin Monfared, Hayssam El-Razouk and Arash Reyhani-Masoleh Presented by: Arash


  1. A New Mult ltipli licative In Inverse Archit itecture in in Normal Basis is Usin ing Novel Concurrent Seria ial Squarin ing and Mult ltipli lication Amin Monfared, Hayssam El-Razouk and Arash Reyhani-Masoleh Presented by: Arash Reyhani-Masoleh Department of Electrical and Computer Engineering Western University, London, Ontario, Canada 24 th IEEE Symposium on Computer Arithmetic, 2017 1

  2. Outline β€’ Motivation β€’ Arithmetic operations over 𝐻𝐺(2 𝑛 ) using Gaussian Normal Basis (GNB) β€’ Proposed digit-level square-multiply architecture β€’ It computes 𝐡 Γ— 𝐢 2 𝑓 β€’ Both digits of inputs 𝐡 and 𝐢 are entered serially β€’ Denoted by Digit-Level Fully Serial-In Square-Multiply (DL-FSISM) β€’ Proposed inversion architecture β€’ It uses the DL-FSISM β€’ ASIC implementations and comparison β€’ Conclusions and future work 2

  3. Motivation: Fin inite Fields β€’ Many applications use arithmetic operations over 𝐻𝐺(2 𝑛 ) β€’ Cryptography: Elliptic Curve, AES β€’ Error control coding β€’ Reed-Solomon code β€’ There are different bases to represent a field element. β€’ Polynomial basis, normal basis (NB), dual basis, etc. β€’ In NB, squaring is free in hardware. 3

  4. Motivation: Gaussian Normal Basis is (GNB) β€’ GNB over 𝐻𝐺 2 𝑛 is a special class of NB and exists whenever 𝑛 is not divisible by 8. β€’ GNBs have been included in IEEE and NIST standards for ECDSA. β€’ Any field element 𝐡 can be represented as π‘›βˆ’1 𝑏 𝑗 𝛾 2 𝑗 , where 𝑏 𝑗 πœ—{0,1} and 𝐡 = ෍ 𝑗=0 {𝛾, … , 𝛾 2 π‘›βˆ’1 } is a GNB over 𝐻𝐺 2 𝑛 . β€’ In this paper, we consider GNB and propose new digit-level architectures for square-multiply and inversion. 4

  5. ic Operations over 𝐻𝐺(2 𝑛 ) using GNB Arit ithmetic GNB β€’ Addition β€’ Let 𝐡 and 𝐢 be two Field elements represented in GNB. β€’ The addition operation is bit-wise XOR operation of the coordinates of the two inputs: π‘›βˆ’1 (𝑏 𝑗 +𝑐 𝑗 )𝛾 2 𝑗 𝐡 + 𝐢 = ෍ 𝑗=0 β€’ Squaring β€’ Squaring operation is performed by right cyclic shift of the coordinates of 𝐡 : π‘›βˆ’1 𝐡 2 = ෍ 𝑏 𝑗 𝛾 2 𝑗+1 𝑗=0 β€’ It is free in hardware if all coordinates are available in parallel. 5

  6. Arit ithmetic ic Operations usin ing GNB: : Mult ltip ipli lication β€’ Finite field multiplication is more complex than addition and squaring. β€’ Multiplication can be implemented in digit-level architectures, in which the digit size can be chosen based on available resources. β€’ In this paper, we have used two different types of digit-level multiplier namely: β€’ Digit-Level Parallel-In Serial-Out (DL-PISO) β€’ Digit-Level Parallel-In Parallel-Out (DL-PIPO) β€’ Also, we have proposed a new multiplier/squarer architecture β€’ Digit-Level Fully Serial-In Square-Multiply (DL-FSISM). 6

  7. Arit ithmetic ic Operations usin ing GNB: : In Inversion β€’ Based on Fermat Little Theorem, an inversion can be calculated by β€’ 𝐡 βˆ’1 = 𝐡 2 𝑛 βˆ’2 ∈ 𝐻𝐺 2 𝑛 , 𝐡 β‰  0. β€’ In Itoh and Tsujii algorithm (ITA) [4], the number of multiplications is reduced based on decomposing 2 π‘›βˆ’1 βˆ’ 1 β€’ As an example for the NIST recommended field over 𝐻𝐺(2 233 ) : 2 232 βˆ’ 1 = (1 + 2)(1 + 2 2 )(1 + 2 4 )(1 + 2 8 (1 + 2 8 )(1 + 2 16 )(1 + 2 32 (1 + 2 32 )(1 + 2 64 (1 + 2 64 )))) β€’ The inversion using ITA takes a total of 10 iterations. β€’ Each iteration consists of one single digit-level parallel-in parallel- out (DL-PIPO) multiplication and one free squaring. -------------------------------------------------------------------------------------- 7 [4] T. Itoh and S. Tsujii , β€œA fast algorithm for computing multiplicative inverses in GF(2 m ) using normal bases,” Information and computation, vol. 78, no. 3, pp. 171 – 177, 1988.

  8. Arit rithmetic ic Operatio ions usi sing GNB: In Inversio ion ( cont’d) β€’ Our inversion flow diagram (based on ITA) uses an interleaved computations of digit-level parallel-in serial-out (DL-PISO) multiplier and our new DL-FSISM architecture. β€’ It only needs a total of 5 iterations. β€’ Each iteration consists of two single multiplications (and squarings) β€’ In this paper, we propose a new digit-level fully serial-in parallel-out square-multiply (DL-FSISM) architecture which performs concurrent squaring and multiplication without introducing any delay. 8

  9. Proposed Dig igit it-Level l Fully lly Se Seri rial-In Sq Square-Mult ltip iply ly (DL-FSISM) (D β€’ Let A and B be field elements and e be an integer. β€’ The proposed scheme reads the inputs of A and B digit-by- digit serially and concurrently computes 𝐺 = 𝐡 Γ— 𝐢 2 𝑓 . β€’ The composite operations of squaring and multiplication are concurrently performed without introducing any additional delay. 𝑛 β€’ For a digit size of 𝑒 bits, it would take ⌈ 𝑒 βŒ‰ clock cycles to generate the result 𝐺 = 𝐡 Γ— 𝐢 2 𝑓 . 9

  10. Proposed DL-FSISM: Key y Formulation Proposition 1: Let 𝐡 and 𝐢 be two 𝐻𝐺(2 𝑛 ) elements that are represented in GNB {𝛾, … , 𝛾 2 π‘›βˆ’1 } . One can compute 𝐺 = 𝐡𝐢 2 𝑓 , by proceeding from 𝑗 = 0 to 𝑙 βˆ’ 1 , the result 𝐺 = 𝐺 π‘™βˆ’1 = 𝐡 (π‘™βˆ’1) (𝐢 π‘™βˆ’1 ) 2 𝑓 is obtained using the following recurrence relation 𝐺 𝑗 = 𝐺 π‘—βˆ’1 2 𝑒 + Οƒ π‘˜=0 2 𝑓 π‘’βˆ’1 πœ€ π‘˜ 𝑏 𝑒 π‘™βˆ’1βˆ’π‘— +π‘˜ , 𝐢 𝑗 + 2 π‘’βˆ’π‘“ π‘’βˆ’1 πœ€ ) 2 𝑓 𝑐 𝑒 π‘™βˆ’1βˆ’π‘— +π‘˜ , 𝐡 π‘—βˆ’1 (Οƒ π‘˜=0 π‘˜ π‘›βˆ’1 𝑀 π‘š 𝛾 2 π‘š ∈ 𝐻𝐺 2 𝑛 . π‘˜ 𝑣, π‘Š = π‘£π‘Šπ›Ύ 2 π‘˜ , u πœ— 0,1 and π‘Š = Οƒ π‘š=0 where πœ€ 10

  11. Proposed DL-FSISM: Archit itecture π‘’βˆ’1 π‘’βˆ’1 π‘—βˆ’1 2 𝑒 + ෍ 2 𝑓 2 π‘’βˆ’π‘“ ) 2 𝑓 π‘˜ 𝑏 𝑒 π‘™βˆ’1βˆ’π‘— +π‘˜ , 𝐢 𝑗 𝑐 𝑒 π‘™βˆ’1βˆ’π‘— +π‘˜ , 𝐡 π‘—βˆ’1 𝐺 𝑗 = 𝐺 πœ€ + (෍ πœ€ π‘˜ π‘˜=0 π‘˜=0 β€’ Three registers X, a d(k-1-i)+d- 1 in1 1 d m-d d m B (i) Β»e n Y, and Z are d- 1 m e n in2 B B B m - - i - m m 0 k 1 k 1 B (i) + 0 m-d -1 <Y> initially cleared d n d 0 m -1 a d(k-1-i)+ 0 m <Z> in1 1 + d d 𝐡𝐢 2 𝑓 β€’ Digits of inputs 0 m in2 m m d are entered to X b d(k-1-i)+d- 1 in1 1 d and Y serially ((A (i-1) Β»d)Β«e n ) m-d m d- 1 m e n d in2 m + A (i- 1 ) from MSB A A A - - - e n 0 k 1 i k 1 n b d(k-1-i)+ 0 0 m-d -1 m m in1 <X> 1 d d β€’ After ⌈ 𝑛 n 0 m 𝑒 βŒ‰ clock in2 m cycles, Z contains 𝐡𝐢 2 𝑓 11

  12. Proposed DL-FSISM: Archit itecture (cont’d) π‘’βˆ’1 π‘’βˆ’1 π‘—βˆ’1 2 𝑒 + ෍ 2 𝑓 2 π‘’βˆ’π‘“ ) 2 𝑓 π‘˜ 𝑏 𝑒 π‘™βˆ’1βˆ’π‘— +π‘˜ , 𝐢 𝑗 π‘˜ Γ— 𝑐 𝑒 π‘™βˆ’1βˆ’π‘— +π‘˜ , 𝐡 π‘—βˆ’1 𝐺 𝑗 = 𝐺 πœ€ + (෍ πœ€ π‘˜=0 π‘˜=0 a d(k-1-i)+d- 1 in1 1 d m-d d m B (i) Β»e n d- 1 m e n in2 B B B - - i - m m m k 1 0 k 1 B (i) + 0 m-d -1 <Y> d n d 0 m -1 a d(k-1-i)+ 0 m <Z> in1 1 + d d 0 m in2 m m d b d(k-1-i)+d- 1 in1 1 d ((A (i-1) Β»d)Β«e n ) m-d m d- 1 m e n in2 d m + A (i- 1 ) A A A - - - e n 0 k 1 i k 1 n b d(k-1-i)+ 0 m πœ€ 0 m-d -1 m in1 <X> 1 π‘˜ d d n 0 m in2 m 1 1 in1 in2 1  m j j 1 m m m m e 1 e 1 m m 1 X 2 -e n X 2 e n e n e n 1 X X m m 1 m m m m e v m e v m n n 12

  13. Proposed In Inversion Archit itecture β€’ The inversion core is made by serially connecting of DL-PISO and DL-FSISM β€’ The register file only stores from the multipliers β€’ 𝑒 -bits register is Ξ΅ = {2,8,32,64} 32 added between two multipliers to shorten the critical path β€’ Each iteration selects one of inputs of multiplexers and takes ⌈ 𝑛 𝑒 βŒ‰ +1 clock cycles 13

  14. In Inversion Archit itecture Comparison (Number of It Iterations) Architecture Algorithm Multiplication Number of m = 163 m = 233 m = 283 m = 409 m = 571 type Iterations [4] ITA 1 Γ— Single N 1 9 10 11 11 13 [7, 6] TIT/MTIT 1 Γ— double N 2 5 9 8 7 8 [8] Optimal-3 1 Γ— double N 3 5 7 6 7 7 chain ⌈ N 1 Proposed ITA 2 Γ— Single 5 5 6 6 7 2 βŒ‰ Interleaved β€’ Our Proposed inversion architecture reduces the required number of iterations as compared with previous works. β€’ The best performance is achieved when 𝑛 = 233. [4] T. Itoh and S. Tsujii , β€œA fast algorithm for computing multiplicative inverses in GF(2m) using normal bases,” Information and computation, vol. 78, no. 3, pp. 171 – 177, 1988. [6] J. Hu, W. Guo , J. Wei, and R. Cheung, β€œFast and Generic Inversion Architectures Over GF(2m) Using Modified Itoh– Tsujii Algorithms,” IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 62, pp. 367– 371, April 2015. [7] R. Azarderakhsh, K. Jarvinen, and V. Dimitrov , β€œFast Inversion in GF(2m) with Normal Basis Using Hybrid - Double Multipliers,” IEEE Trans. Comput., vol. 63, pp. 1041 – 1047, April 2014. [8] K. Jarvinen, V. Dimitrov, and R. Azarderakhsh , β€œA Generalization of Addition Chains and Fast Inversions in Binary Fields,” IEEE Trans. Comput., vol. 64, pp. 2421 – 2432, Sept. 2015. 14

Recommend


More recommend