high throughput
play

High-Throughput Linear Sorter System Jorge Ortiz David Andrews - PowerPoint PPT Presentation

A Configurable High-Throughput Linear Sorter System Jorge Ortiz David Andrews Information and Computer Science and Telecommunication Technology Computer Engineering Center The University of Arkansas 2335 Irving Hill Road 504 J.B.


  1. A Configurable High-Throughput Linear Sorter System  Jorge Ortiz  David Andrews Information and Computer Science and Telecommunication Technology Computer Engineering Center The University of Arkansas 2335 Irving Hill Road 504 J.B. Hunt Building, Lawrence, KS Fayetteville, AR jorgeo@ku.edu dandrews@uark.edu

  2. Introduction

  3. Introduction  Sorting an important system function Popular sorting algorithms not efficient or fast in hardware implementations  Linear sorters ideal for hardware, but sort at a rate of 1 value per cycle  Sorting networks better at throughput, but with high area and latency cost  Need a better solution for high throughput, low latency sorting

  4. Contributions  Expanding the linear sorter implementation and making it versatile, reconfigurable and better suited for streaming input and output  Parallelizing the linear sorter for increased throughput  Implementing the high-throughput linear sorter, and outmatching the performance of current linear sorter approaches

  5. Background

  6. Background  Software quicksort, mergesort and heapsort use divide-and-conquer techniques to achieve efficiency  Hardware sorting plagued with overhead from data movements, synchronization, bookkeeping and memory accesses  Need better use of concurrent data comparisons and swaps, rather than the extended execution of multiple assembly instructions like its software counterpart

  7. Sorting Networks  Swap comparators sort pairs of values  Sink lowest value, then operate on remaining S n-1 items Bubble Sort  Receive parallel data at inputs 3 3  High #PE and 2 2 latency, resort with 5 5 each new insertion 4 4 1 1

  8. Sorting Networks  Swap comparators sort pairs of values  Sink lowest value, then operate on remaining S n-1 items Bubble Sort  Receive parallel data at inputs 3 2  High #PE and 2 3 latency, resort with 5 5 each new insertion 4 4 1 1

  9. Sorting Networks  Swap comparators sort pairs of values  Sink lowest value, then operate on remaining S n-1 items Bubble Sort  Receive parallel data at inputs 3 2  High #PE and 2 3 latency, resort with 5 4 each new insertion 4 5 1 1

  10. Sorting Networks  Swap comparators sort pairs of values  Sink lowest value, then operate on remaining S n-1 items Bubble Sort  Receive parallel data at inputs 3 2  High #PE and 2 3 latency, resort with 5 4 each new insertion 4 1 1 5

  11. Sorting Networks  Swap comparators sort pairs of values  Sink lowest value, then operate on remaining S n-1 items Bubble Sort  Receive parallel data at inputs 3 2  High #PE and 2 3 latency, resort with 5 1 each new insertion 4 4 1 5

  12. Sorting Networks  Swap comparators sort pairs of values  Sink lowest value, then operate on remaining S n-1 items Bubble Sort  Receive parallel data at inputs 3 2  High #PE and 2 1 latency, resort with 5 3 each new insertion 4 4 1 5

  13. Sorting Networks  Swap comparators sort pairs of values  Sink lowest value, then operate on remaining S n-1 items Bubble Sort  Receive parallel data at inputs 3 1  High #PE and 2 2 latency, resort with 5 3 each new insertion 4 4 1 5

  14. Linear Sorters  Sorted insertions  Single clock latency, small logic & regular  Forwards incoming structure value to all nodes  Streaming input &  Each node shifts output autonomously depending on  Serial input, need neighbors’ values higher throughput Input: Output:

  15. Linear Sorters  Sorted insertions  Single clock latency, small logic & regular  Forwards incoming structure value to all nodes  Streaming input &  Each node shifts output autonomously depending on  Serial input, need neighbors’ values higher throughput Input: 3 Output:

  16. Linear Sorters  Sorted insertions  Single clock latency, small logic & regular  Forwards incoming structure value to all nodes  Streaming input &  Each node shifts output autonomously depending on  Serial input, need neighbors’ values higher throughput Input: 2 3 Output:

  17. Linear Sorters  Sorted insertions  Single clock latency, small logic & regular  Forwards incoming structure value to all nodes  Streaming input &  Each node shifts output autonomously depending on  Serial input, need neighbors’ values higher throughput Input: 5 2 3 Output:

  18. Linear Sorters  Sorted insertions  Single clock latency, small logic & regular  Forwards incoming structure value to all nodes  Streaming input &  Each node shifts output autonomously depending on  Serial input, need neighbors’ values higher throughput Input: 4 2 3 5 Output:

  19. Linear Sorters  Sorted insertions  Single clock latency, small logic & regular  Forwards incoming structure value to all nodes  Streaming input &  Each node shifts output autonomously depending on  Serial input, need neighbors’ values higher throughput Input: 1 2 3 4 5 Output:

  20. Linear Sorters  Sorted insertions  Single clock latency, small logic & regular  Forwards incoming structure value to all nodes  Streaming input &  Each node shifts output autonomously depending on  Serial input, need neighbors’ values higher throughput Input: 1 2 3 4 5 Output: 1 2 3 4 5

  21. Configurable Linear Sorter

  22. Configurable Linear Sorter  Increase versatility for linear sorters  Configurable: ◦ Linear sorter depth ◦ Sorting direction ◦ Sort on tags (for example, timestamps) rather than data ◦ User-defined data and tag size

  23. Configurable Linear Sorter Increase functionality for linear sorters 1. Detect full conditions 2. Buffer input while full 3. Retrieve output serially for streaming 4. Delete top value, freeing nodes 5. Augment with left shift functionality 6. Test tags before deleting them

  24. Extended Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5

  25. Extended Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5

  26. Extended Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7

  27. Extended Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7 3 2 5 6 7

  28. Extended Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7 3 2 5 6 7 4 1 2 5 6 7

  29. Extended Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7 3 2 5 6 7 4 1 2 5 6 7 5 9 1 2 5 6 7

  30. Extended Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7 3 2 5 6 7 4 1 2 5 6 7 5 9 1 2 5 6 7 6 3 1 2 5 6 7 9

  31. Extended Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7 3 2 5 6 7 4 1 2 5 6 7 5 9 1 2 5 6 7 6 3 1 2 5 6 7 9 7 8 2 3 5 6 7 9

  32. Extended Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7 3 2 5 6 7 4 1 2 5 6 7 5 9 1 2 5 6 7 6 3 1 2 5 6 7 9 7 8 2 3 5 6 7 9 8 4 3 5 6 7 8 9

  33. Interleaved Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7 3 2 5 6 7 4 1 2 5 6 7 5 9 1 2 5 6 7 6 3 1 2 5 6 7 9 7 8 2 3 5 6 7 9 8 4 3 5 6 7 8 9 9 4 5 6 7 8 9

  34. Interleaved Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7 3 2 5 6 7 4 1 2 5 6 7 5 9 1 2 5 6 7 6 3 1 2 5 6 7 9 7 8 2 3 5 6 7 9 8 4 3 5 6 7 8 9 9 4 5 6 7 8 9 10 4 5 6 7 8 9

  35. Interleaved Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7 3 2 5 6 7 4 1 2 5 6 7 5 9 1 2 5 6 7 6 3 1 2 5 6 7 9 7 8 2 3 5 6 7 9 8 4 3 5 6 7 8 9 9 4 5 6 7 8 9 10 4 5 6 7 8 9 11 5 6 7 8 9

  36. Extended Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7 3 2 5 6 7 4 1 2 5 6 7 5 9 1 2 5 6 7 6 3 1 2 5 6 7 9 7 8 2 3 5 6 7 9 8 4 3 5 6 7 8 9 9 4 5 6 7 8 9 10 4 5 6 7 8 9 11 5 6 7 8 9 12 6 7 8 9

  37. Extended Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7 3 2 5 6 7 4 1 2 5 6 7 5 9 1 2 5 6 7 6 3 1 2 5 6 7 9 7 8 2 3 5 6 7 9 8 4 3 5 6 7 8 9 9 4 5 6 7 8 9 10 4 5 6 7 8 9 11 5 6 7 8 9 12 6 7 8 9 13 7 8 9

  38. Extended Linear Sorter System Node 1 Node 2 Node 3 Node 4 Node 5 Node 6 0 5 1 7 5 2 6 5 7 3 2 5 6 7 4 1 2 5 6 7 5 9 1 2 5 6 7 6 3 1 2 5 6 7 9 7 8 2 3 5 6 7 9 8 4 3 5 6 7 8 9 9 4 5 6 7 8 9 10 4 5 6 7 8 9 11 5 6 7 8 9 12 6 7 8 9 13 7 8 9 14 8 9

Recommend


More recommend