Block-Level Relaxation for Timing-Robust Asynchronous Circuits - PowerPoint PPT Presentation

Block-Level Relaxation for Timing-Robust Asynchronous Circuits Based on Eager Evaluation Cheolj oo Jeong* S teven M. Nowick Computer S cience Department Columbia University *[now at Cadence Design S ystems]

Outline 1. Introduction 2. Background: Asynchronous Threshold Networks 3. Gate-Level Relaxation 4. Block-Level Relaxation 5. Experimental Results 6. Conclusions and Future Work 2

Recent Challenges in Microelectronics Design • Reliability challenge – Variability issues in deep submicron technology • process, temperature, voltage • noise, crosstalk – Dynamic voltage scaling • Communication challenge – Increasing disparity between gate and wire delay • Productivity challenge – Increasing system complexity + heterogeneity – S hrinking time to market, timing closure issues – Even when IP blocks are used, interface timing verification is difficult 3

Benefits and Challenges of Asynchronous Circuits • Potential benefits: – Mitigates timing closure problem – Low power consumption – Low electromagnetic interference (EMI) – Modularity, “ plug-and-play” composition – Accommodates timing variability • Challenges: – Robust design is required: hazard-freedom – Area overhead (sometimes) – Lack of CAD tools – Lack of systematic optimization techniques 4

Asynchronous Threshold Networks • Asynchronous threshold networks – One of the most robust asynchronous circuit styles – Based on delay-insensit ive encoding • Communication: robust to arbitrary delays • Logic block design: imposes very weak timing constraints (1-sided) • S imple example: OR2 a 0 z 0 C a 1 a C z b z 1 C b 0 C b 1 Boolean OR2 gate Async dual-rail threshold network for OR2 5

Challenges and Overall Research Goals • Challenges in asynchronous threshold network synthesis – Large area and latency overheads – Few existing optimization techniques – Even less support for CAD tools • Overall Research Agenda: – Develop systematic optimization techniques and CAD tools for highly-robust asynchronous threshold networks – S upport design-space exploration: automated scripts, target different cost functions – Current optimization targets: area + delay + delay-area tradeoffs – Future extensions: power (straightforward) 6

Overall Research Goals Two automated optimization techniques proposed 1. Relaxation algorithms: multi-level optimization – Existing synthesis approaches are conservative = over-designed – Approach: selective use of eager-evaluation logic • without affecting overall circuit’ s timing robustness – Can apply at two granularities: • gate-level [Jeong/ Nowick AS PDAC-07, Zhou/ S okolov/ Yakovlev ICCAD-06] • block-level [NEW] 7

Overall Research Goals (cont.) 2. Technology mapping algorithms – First general and systematic technology mapping for robust asynchronous threshold networks [Jeong/ Nowick Async-06, IEEE Trans. On CAD (April 2008)] – Evaluated on substantial benchmarks: • > 10,000 gates, > 1000 inputs/ outputs • Industrial (Theseus Logic): DES , GCD • Academic: large MCNC circuits – Use fully-characterized industrial cell library (Theseus Logic): • slew rate, loading, distinct i-to-o paths/ rise vs. fall transitions – Advanced technique: area optimization under hard delay constraints – S ignificant average improvements: • Delay: 31.6% , Area: 9.5% (runtime: 6.2 sec) “ATN_OPT” CAD Package: downloadable (for Linux) http: / / www.cs.columbia.edu/ ~ nowick/ asynctools 8

Basic S ynthesis Flow (Theseus Logic/ Camgian Networks) S ingle-rail Boolean network Considered as abst ract mult i-valued circuit simple dual-rail expansion (delay-insensitive encoding) Inst ant iat ed Boolean circuit Dual-rail async threshold network (robust , unopt imized) 9

New Optimized S ynthesis Flow S ingle-rail Boolean network Relaxation (i.e. relaxed dual-rail expansion) “ Relaxed” dual-rail async threshold network opt imized Technology mapping Optimally-mapped dual-rail async threshold opt imized network 10

New Optimized S ynthesis Flow Focus of this paper S ingle-rail Boolean network Relaxation (i.e. relaxed dual-rail expansion) “ Relaxed” dual-rail async threshold network opt imized Technology mapping Optimally-mapped dual-rail async threshold opt imized network 11

S ingle-Rail Boolean Networks • Boolean Logic Network: S t art ing point for dual-rail circuit synt hesis – Modelled using three-valued logic with {0, 1, NULL} • 0/ 1 = data values, NULL = no data (invalid data) – Computation alternates between DATA and NULL phases N 1 1 N N a N N 1 1 N z N 0 N 0 N b 3-valued 3-valued inputs output Boolean OR gate – DATA (Evaluate) phase: • outputs have DATA values only after all inputs have DATA values – NULL (Reset) phase: • outputs have NULL values only after all inputs have NULL values 13

Delay-Insensitive Encoding • Approach: – S ingle Boolean signal is represented by two wires – Goal: map abstract Boolean netlist to robust dual-rail asynchronous circuit spacer a 1 a 0 a a 0 a 0 0 NULL a 1 dual-rail 0 1 0 valid expansion data 1 0 1 Not allowed 1 1 invalid Encoding table - Motivation: robust data communication 14

Dual-Rail Expansion S ingle Boolean gate: expanded into dual-rail network dual-rail complete set dual-rail output 3-valued of minterms 0-rail inputs inputs 3-valued output a 0 z 0 C a 1 a C z b z 1 C b 0 C b 1 1-rail Boolean OR gate “ DIMS ” -style dual-rail OR circuit 15

S ummary: Existing S ynthesis Approach • S tarting point: single-rail abstract Boolean network (3-valued) • Approach: performs dual-rail expansion of each gate – Use 'template-based' mapping • End point: unoptimized dual-rail asynchronous threshold network • Result: timing-robust asynchronous netlist a 0 b 0 C C x C C a 1 a z 0 b 1 C C z C a 0 z 1 b 0 C C b y C C a 1 b 1 C Boolean logic network Dual-rail asynchronous threshold network 16

Hazard Issues • Ideal Goal = Delay-Insensitivity (delay model) – Allows arbitrary gate and wire delay • circuit operates correctly under all conditions – Most robust design style • when circuit produces new output, all gates stable = “ timing robustness” • “ Orphans” = hazards to delay-insensitivity – “ unobservable ” signal t ransit ion sequences – Wire orphans : unobservable wires at fanout – Gat e orphans : unobservable paths at fanout 17

Hazard Issues • Wire orphan example: C primary outputs 0 C 0 0 wire orphan! = unobservable wire transition (at fanout point) Wire orphan example If unobservable wire too slow, will interfere with next data item (glitch) 18

Hazard Issues • Gate orphan example: gate orphan! = unobservable path through 1+ gates (at fanout point) a 0 z 0 0 C b 0 0 0 z 1 C a 1 0 b 1 Gate orphan example If unobservable path too slow, will interfere with next data item (glitch) 19

Hazard Issues: S ummary • Wire orphans: typically not a problem in practice – unobserved signal transition on wire (at fanout point) – S olution: handle during physical synthesis (e.g. Theseus Logic) • enforce simple 1-sided timing constraint • Gate orphans: difficult to handle – unobserved signal transition on path (at fanout point) – can result in unexpected glitches: if delays too long – harder to overcome with physical design tools invariant of the proposed optimization algorithms: ensure no gate orphans introduced 20

Overview of Relaxation • Relaxation: Multi-level optimization – Allows more efficient dual-rail expansion using eager-evaluating logic – Idea: select ively replace some gates by eager blocks • either at gat e-level or block-level – Advantage: if carefully performed, no loss of overall circuit robustness • Proposed flow S ingle-rail Boolean network Relaxation Relaxed dual-rail async threshold network opt imized 22

Input Completeness • A dual-rail implementation of a Boolean gate is input-complete w.r.t. its input signals if an output changes only aft er all the inputs arrive. a 0 C z 0 b 0 a C z b z 1 C a 1 C b 1 Boolean OR gate Input-complete dual-rail OR network (input complete w.r.t. input signals a and b) Enforcing input completeness for every gate is the traditional synthesis approach to avoid hazards (i.e. gate orphans). 23

Input Incompleteness • A dual-rail implementation of a Boolean gate is input-incomplete w.r.t. its input signals (“ eager-evaluating” ), if the output can change before all inputs arrive. a 0 z 0 b 0 a z b a 1 z 1 b 1 Boolean OR gate Input -incomplet e dual-rail OR network 24

Block-Level Relaxation for Timing-Robust Asynchronous Circuits - PowerPoint PPT Presentation

Block-Level Relaxation for Timing-Robust Asynchronous Circuits Based on Eager Evaluation Cheolj oo Jeong* S teven M. Nowick Computer S cience Department Columbia University *[now at Cadence Design S ystems] Outline 1. Introduction

How to Design Fast Asynchronous How to Design Fast Asynchronous Routers for Asynchronous Routers

AN ASYNCHRONOUS DIVIDER IMPLEMENTATION Navaneeth Jamadagni and Jo Ebergen 2 Asynchronous

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Asynchronous Replication

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

Timing and Coordination Essential Knowledge 2.E.2 and 2.E.3 Timing and Coordination Timing

Problem 1 k zero bits n bits IV Block Block Block Block Cipher Cipher Cipher Cipher

Asynchronous sequence circuits An asynchronous sequence machine is a sequence circuit without

Planning and Optimization October 16, 2019 C2. Delete Relaxation: Properties of Relaxed

Debye Relaxation Model MSE/EE 590 Discussion 1 Pressure Broadening and Debyes Relaxation

Liberty Timing File (LIB) Advanced VLSI Design CMPE 641 Liberty Timing File The .lib file is an

Timing Library Format (TLF) Advanced VLSI Design CMPE 414 Timing Library Format (TLF) TLF is an

Timing Analysis Timing Path Groups and Types Timing paths are grouped into path groups

Digital Design Discussion: RTL Storage Components Shift Register Timing Register File Timing

Clocking & Timing Asynchronous Self Timed Design Self Timed Design Synchronous Circuit

Policy Exploration for JITDs (Java) Team Datum Recap Experimentation on current policies

Social Network Games Krzysztof R. Apt CWI & and University of Amsterdam Joint work with

WHOM TO BEFRIEND TO INFLUENCE PEOPLE Manuel Lafond (University of Montreal) Lata Narayanan

Blockly Lists & Iteration CT @ VT Things we are seeing Using lists to represent a data

Circular Coinduction A Proof Theoretical Foundation su 1 Dorel Lucanu 2 Grigore Ro 1

Codings of rotations are full A. Blondin Mass e S. Brlek S. Labb e L. Vuillon

Dendric words and dendric subshifts Paulina CECCHI B . (Joint work with Val e ) erie Berth

r rts rtrst

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Block-Level Relaxation for Timing-Robust Asynchronous Circuits - PowerPoint PPT Presentation

Block-Level Relaxation for Timing-Robust Asynchronous Circuits Based on Eager Evaluation Cheolj oo Jeong* S teven M. Nowick Computer S cience Department Columbia University *[now at Cadence Design S ystems] Outline 1. Introduction

How to Design Fast Asynchronous How to Design Fast Asynchronous Routers for Asynchronous Routers

AN ASYNCHRONOUS DIVIDER IMPLEMENTATION Navaneeth Jamadagni and Jo Ebergen 2 Asynchronous

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Asynchronous Replication

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

Timing and Coordination Essential Knowledge 2.E.2 and 2.E.3 Timing and Coordination Timing

Problem 1 k zero bits n bits IV Block Block Block Block Cipher Cipher Cipher Cipher

Asynchronous sequence circuits An asynchronous sequence machine is a sequence circuit without

Planning and Optimization October 16, 2019 C2. Delete Relaxation: Properties of Relaxed

Debye Relaxation Model MSE/EE 590 Discussion 1 Pressure Broadening and Debyes Relaxation

Liberty Timing File (LIB) Advanced VLSI Design CMPE 641 Liberty Timing File The .lib file is an

Timing Library Format (TLF) Advanced VLSI Design CMPE 414 Timing Library Format (TLF) TLF is an

Timing Analysis Timing Path Groups and Types Timing paths are grouped into path groups

Digital Design Discussion: RTL Storage Components Shift Register Timing Register File Timing

Clocking &amp; Timing Asynchronous Self Timed Design Self Timed Design Synchronous Circuit

Policy Exploration for JITDs (Java) Team Datum Recap Experimentation on current policies

Social Network Games Krzysztof R. Apt CWI &amp; and University of Amsterdam Joint work with

WHOM TO BEFRIEND TO INFLUENCE PEOPLE Manuel Lafond (University of Montreal) Lata Narayanan

Blockly Lists &amp; Iteration CT @ VT Things we are seeing Using lists to represent a data

Circular Coinduction A Proof Theoretical Foundation su 1 Dorel Lucanu 2 Grigore Ro 1

Codings of rotations are full A. Blondin Mass e S. Brlek S. Labb e L. Vuillon

Dendric words and dendric subshifts Paulina CECCHI B . (Joint work with Val e ) erie Berth

r rts rtrst

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Clocking & Timing Asynchronous Self Timed Design Self Timed Design Synchronous Circuit

Social Network Games Krzysztof R. Apt CWI & and University of Amsterdam Joint work with

Blockly Lists & Iteration CT @ VT Things we are seeing Using lists to represent a data