2011 CANDE Workshop Program (PDF)
November 10, 2011, Double Tree Hotel, San Jose, CA, USA
8:00-8:25 
Breakfast
8:25-8:30 
Opening Remarks
8:30-9:30 
Session I: Sense and Sensibility
Chair: Gi-Joon Nam, IBM, USA
 Using Infrared Imaging 
to Improve IC Design 
Sherief Reda, Brown University
 Synthesizing Logical 
Computation on Stochastic Bit Streams for Sensing Applications 
Marc Riedel, University of Minnesota, Twin Cities
 9:30-10:00 
Special Session: Student Research Highlights
 10:00-10:30 
Coffee Break + Poster 
Session
 10:30-12:00
Session II: GPU for EDA Computing  
Chair: Kyle Rupnow, Advanced Digital Sciences Center, Singapore
EDA Computing with 
OpenCL for GPU Architectures and Beyond 
Rasit Onur Topaloglu, GLOBALFOUNDRIES, Inc.
 Leveraging GPU 
Computing for VLSI CAD and Beyond 
Peng Li, Texas A & M University
 GPU Evolution - 
Simplifying the Step to Heterogeneous Computing 
Stephen Jones, NVIDIA, Inc.
 12:00-1:15 
Lunch and Lunch Keynote 
EDA or the art of trading off apples, peaches 
and oranges (... and why many-core GPU will barely move the needle) 
Patrick 
Groeneveld, Magma [Slides]
 1:15-3:00
Session III: Panel Discussion -- EDA 
Roadmap: Why and How? 
Moderator: Farinaz Koushanfar, Rice University
Panelists: 
Andrew Kahng, University of California, San Diego, 
Juan-Antonio Carballo, NetLogic Microsystems, 
Dave Noice, Cadence, Leon Stok, IBM
 3:00-3:15
Coffee Break
 3:15-4:15
Keynote Session 
Many-core GPU Computing - Current Victories and 
Coming Battles 
Wen-Mei Hwu, University of Illinois, Urbana-Champaign 
[Slides]
 4:15-4:30
Coffee Break
 4:30-5:30
Session IV: New FPGA Advances 
Chair: 
Guy Lemieux, University of British Columbia, Canada 
System-level Trends in 
FPGA Devices 
Mike Hutton, Altera, Inc.
 Recent Advances in Xilinx 
FPGAs 
Tim Tuan, Xilinx, Inc.
 5:30-6:30
Evening Reception (sponsored by IEEE CEDA)
Technical Session Details
 8:30-9:30
Session I: Sense and Sensibility
 Title: Using Infrared Imaging to Improve IC Design 
Sherief Reda, Brown University
 Abstract: 
Approximate design models, computational limits, and runtime and process 
variabilities all contribute towards deviating the results of CAD tools from 
actual IC post-silicon characteristics.  Infrared imaging is a powerful 
technique to quantify post-silicon characteristics under realistic loading 
conditions. In this talk, an overview of basic concepts in infrared imaging and 
instrumentation will be first presented, and then a number of IC design 
applications will be discussed and demonstrated, including: hot spot detection, 
power mapping, reliability characterization, and Trojan detection. Results from 
test chips will be presented and analyzed.
Sherief 
Reda is an assistant professor of Electrical Sciences and Computer Engineering 
at the School of Engineering, Brown University, where he heads the SCALE 
laboratory (https://scale.engin.brown.edu). 
Professor Reda received his Ph.D. degree in computer engineering from University 
of California, San Diego, and B.Sc. degree from AinShams University, Cairo. His 
research interests include thermal / power modeling and management techniques, 
variability modeling and yield optimization, and physical design. He has 
received four best paper nominations and two best paper awards at DATE’02 and 
ISLPED’10.
Title: Synthesizing Logical Computation on Stochastic Bit 
Streams for Sensing Applications 
Marc Riedel, University of Minnesota, Twin Cities
 Abstract:
Most digital systems operate on a positional representation of data, such as 
binary radix. A positional representation is a compact way to encode signal 
values: in binary radix, 2^n distinct values can be represented with n bits. 
 However, operating on it requires complex logic: in each operation such as 
addition or multiplication, the signal must be "decoded," with the higher order 
bits weighted more than the lower order bits.  We advocate an alternative 
representation: random bit streams where the signal value is encoded by the 
probability of obtaining a one versus a zero. This representation is much less 
compact than binary radix. However, complex operations can be performed with 
very simple logic. For instance, multiplication can be performed with a single 
AND gate. Also, because the representation is uniform, with all bits weighted 
equally, it is highly tolerant of soft errors (i.e., bit flips). In this talk, 
we will discuss a general method for synthesizing digital circuitry that 
computes on such stochastic bit streams, targeting sensing applications. We show 
that our method produces very simple hardware that is highly tolerant of soft 
errors, so suitable for very light-weight sensors operating in harsh 
environments.
Marc Riedel 
has been an Assistant Professor of Electrical and Computer Engineering at the 
University of Minnesota since 2006. He is also a member of the Graduate Faculty 
in Biomedical Informatics and Computational Biology. In 2004-2005, he was a 
lecturer in Computation and Neural Systems at Caltech. He has held positions at 
Marconi Canada, CAE Electronics, Toshiba, and Fujitsu Research Labs. He received 
his Ph.D. and his M.Sc. in Electrical Engineering at Caltech and his B.Eng. in 
Electrical Engineering with a Minor in Mathematics at McGill University. His 
Ph.D. dissertation titled "Cyclic Combinational Circuits" received the Charles 
H. Wilts Prize for the best doctoral research in Electrical Engineering at 
Caltech. His paper "The Synthesis of Cyclic Combinational Circuits" received the 
Best Paper Award at the Design Automation Conference. He is a recipient of the 
NSF CAREER Award.
10:30-12:00 
Session II: GPU for EDA Computing
 Title: EDA Computing with OpenCL for GPU Architectures and 
Beyond 
Rasit Onur Topaloglu, GLOBALFOUNDRIES, Inc.
 Abstract: 
Graphical processing unit (GPU) computing has been an interesting area of 
research in the last decade. While initial adapters of the technology have been 
from image processing domain due to difficulties in programming the GPUs, 
research on programming languages made it possible for people without the 
knowledge of low-level programming languages such as OpenGL develop code on 
GPUs. Two main GPU architectures from AMD (former ATI) and NVIDIA acquired 
grounds. While AMD adapted Stanford’s Brook language and made it into an 
architecture-agnostic tool, NVIDIA brought CUDA framework to a wide audience. 
Until recently, it has not been possible to compile one GPU code across 
platforms. An opportunity came with idea of combining one or more CPUs and GPUs 
on the same die. Eliminating some of the interconnection bandwidth issues, this 
combination makes it possible to  
offload tasks with high parallelism to the GPU. The technological direction 
towards multicores for CPU-only architectures also requires a programming 
methodology change and act as a catalyst for better programming languages. 
Hence, a unified language that can be used both on multicore CPUs as well as 
GPUs gained interest. Open Computing Language (OpenCL), developed originally by 
the Khronos Group of Apple and supported by both AMD and NVIDIA, is seen as the 
programming language of choice for parallel programming. In this talk, I provide 
an introduction to  EDA computing 
with OpenCL for current and future architectures with GPUs. I will end my talk 
by providing research directions in this area.
 Dr. 
Rasit Onur Topaloglu received his B.S. degree with High 
Honors in Electrical and Electronic  
Engineering from Bogazici University in Turkey. He received his M.S. degree in 
Computer Science and Ph.D. degree in Computer Engineering from University of 
California at San Diego. Since 2005, he has been with Advanced Micro Devices and 
later on with GLOBALFOUNDRIES, which is an advanced semiconductor manufacturer 
with technology coverage down to 28 nm and research down to 14 nm. 
Dr. Topaloglu’s research interests are VLSI design and CAD. He has over 
40 international and refereed publications, four book chapters and an edited 
book, three granted and four pending patents, and a best paper award at IEEE 
International Symposium on Quality Electronic Design. He has chaired DAC 
Workshop on Parallel Algorithms, Programming, and Architectures (PAPA) in 2011. 
He serves in technical program committees for DAC and International Symposium on 
Physical Design. He serves as Semiconductor Research Corporation Science Area 
Coordinator for Computer Aided Design and Test as well as Circuits and Systems 
for GLOBALFOUNDRIES. He is also a Semiconductor Research Corporation Focus 
Center Research Program Associate for Concurrent Systems.
Title: Leveraging GPU Computing for VLSI CAD and Beyond 
Peng Li, Texas A & M University
Abstract:
The recent emergence of general-purpose GPU (graphics processing units) 
computing platforms offer appealing opportunities in leveraging these powerful 
SIMD (single instruction multiple data) machines for addressing a range of 
computational challenges. However, to be successful, one has to properly expose 
the “hidden” data parallelism in an application and develop “smart” algorithms 
to penetrate the SIMD barrier of the GPU architecture. A proper interplay 
between algorithm design and SIMD architecture consideration is essential for 
achieving good runtime performance. In this talk, we will show GPU-based 
algorithms can be developed to analyze large power delivery networks with 
excellent efficiency. We will also demonstrate the application of GPU computing 
to the simulation of large neuronal networks with biophysically plausible 
Hodgkin-Huxley type neuron models.
Peng Li received the Ph.D. degree in electrical and computer engineering from 
CMU in 2003, respectively. He is presently an associate professor at Department 
of Electrical and Computer Engineering, Texas A&M University. His research 
interests center on VLSI systems, design automation, parallel computing, and 
aspects of computational neuroscience and biology. His work has received several 
recognitions including an NSF CAREER Award and three IEEE/ACM Design Automation 
Conference (DAC) Best Paper Awards. 
 Title: GPU Evolution - Simplifying the Step to 
Heterogeneous Computing 
Stephen Jones, NVIDIA, Inc.
 Abstract:
Heterogeneous computing - running a single program on two (or more) different 
computer architectures - offers the potential efficiency and performance of both 
worlds, but at the cost of increased program complexity. Ideally a program could 
achieve heterogeneous execution within a single, homogeneous programming model; 
in practice this requires a combination of software-aware hardware design, and 
heterogeneous-aware software design. In this talk, we will look at the 
co-evolution of GPU hardware and programming model design and where that 
trajectory will take us in the future - in particular with application to the 
large legacy code-bases typical in the EDA world.
 Stephen 
Jones is an engineer in NVIDIA's high-performance computing group, working on 
developing the CUDA language for programming the GPU. His background is in 
computational fluid dynamics and plasma physics and he is a member of CUDA's 
parallel algorithms group, which develops the mathematical libraries which form 
part of the CUDA programming toolkit. His current work is focused on the 
evolution of the CUDA language to better support high performance computing 
applications, and the future GPU architectures required to enable it.
 12:00-1:15 
Lunch and Lunch Keynote 
 Title:
EDA or the 
art of trading off apples, peaches and oranges 
(... and why many-core 
GPU will barely move the needle) 
Patrick Groeneveld, Magma
 Patrick 
Groeneveld is Chief Technologist at Magma Design Automation in San Jose, 
California. Since joining Magma in 1997, he has developed key parts of Magma's 
flagship physical synthesis products. Patrick received his MSc and PhD degrees 
from Delft university of technology (1987, 1991). Before starting at Magma, 
Patrick was associate professor at Delft 
University of 
Technology, where he was also associated with DIMES. From 2001 until 2005 he was 
full professor of Electrical Engineering at Eindhoven University. In his spare 
time, Patrick is the general Chair of the 2012 Design Automation Conference in 
San Francisco. Otherwise, Patrick enjoys being with his family, hiking, reading 
useless information and flying a Cessna 172.
 3:15-4:15 
Keynote Session 
 Title:
Many-core GPU Computing - 
Current Victories and Coming Battles 
Wen-Mei Hwu, University of Illinois, Urbana-Champaign
Wen-mei W. Hwu is the Walter J. ("Jerry") Sanders III-Advanced Micro Devices 
Endowed Chair in Electrical and Computer Engineering in the Coordinated Science 
Laboratory of the University of Illinois at Urbana-Champaign. Dr. Hwu received 
his Ph.D. degree in Computer Science from the University of California, 
Berkeley, 1987. His research interests are in the areas of architecture, 
implementation, software for high-performance computer systems, and parallel 
processing. 
He is a Principal Investigator (PI) for the petascale Blue Waters system, is 
co-director of the Intel and Microsoft funded Universal Parallel Computing Research Center (UPCRC), and PI for 
the world's first NVIDIA CUDA Center of Excellence. At the Illinois Coordinated 
Science Lab, he is the director of the OpenIMPACT project, which has delivered 
new compiler and computer architecture technologies to the computer industry 
since 1987. He also serves as the Soft Systems Theme leader of the MARCO/DARPA 
Gigascale Silicon Research Center (GSRC) and on the Executive Committees of both 
the GSRC and the MARCO/DARPA Center for Circuit and System Solutions (C2S2). For 
his contributions to the areas of compiler optimization and computer 
architecture, he received the 1993 Eta Kappa Nu Outstanding Young Electrical 
Engineer Award, the 1994 Xerox Award for Faculty Research, the 1994 University 
Scholar Award of the University of Illinois, the 1997 Eta Kappa Nu Holmes 
MacDonald Outstanding Teaching Award, the 1998 ACM SigArch Maurice Wilkes Award, 
the 1999 ACM Grace Murray Hopper Award, the 2001 Tau Beta Pi Daniel C. Drucker 
Eminent Faculty Award, and the 2002 ComputerWorld Honors Archive Medal. He is a 
fellow of IEEE and of the ACM.
4:30-5:30 
Session IV: New FPGA Advances
 Title: 
System-level Trends in FPGA Devices 
Mike Hutton, Altera, Inc.
 Abstract:
Recent years have seen many changes in FPGA technology, and one of the more 
pronounced ones is that FPGAs are no longer just glue logic but a central 
component of systems.  This talk will describe recent solutions and 
promising new directions along with some remaining challenges.  Specific 
topics will include newly announced embedded SOC devices, changes to FPGA CAD 
for system-level design (floorplanning, interconnect generation), new directions 
such as OpenCL design entry and commercial application of dynamic 
reconfiguration.
Mike Hutton is a Principal Design Engineer in the Office of the CTO at Altera 
Corporation. His research interests include FPGA architecture, CAD and 
high-performance applications on FPGAs.  He is a member of the ACM SIGDA 
Technical Committee on FPGAs and Reconfigurable Computing, Associate Editor of 
IEEE Transactions on CAD, serves on the Program Committees of DAC, FPGA, FPL, 
FPT and other conferences and is past TPC-Chair for FPGA and SLIP.  He has 
a Ph.D. from the University of Toronto, and has published 35 academic papers and 
holds more than 50 US Patents on FPGA and related technology.
 Title: 
Recent Advances in Xilinx FPGAs 
Tim Tuan, Xilinx, Inc.
 FPGAs 
have evolved from simple, small arrays of logic blocks to massive, complex 
systems comprising billions of devices. With this fantastic growth comes a 
number of significant challenges, including manufacturability, power 
limitations, and design productivity. In 28nm
process node, Xilinx is implementing several key 
technology innovations, such as 2.5D Stacked Silicon Interconnect, new power 
management solutions, and Extensible Processing Platforms. These technologies 
will help Xilinx FPGAs to cope with the challenges of ultra deep submicron 
technologies and to continue riding the waves of scaling.
Tim Tuan is a Sr. Staff Engineer in the CTO Office (aka Xilinx Research
Labs) at Xilinx Inc. in San Jose. At Xilinx, he 
has worked extensively on analyzing and mitigating deep submicron challenges 
facing FPGAs using a wide range of techniques spanning process technology, 
circuits, architectures and CAD. Currently, he's working on high performance 
architectures and power management solutions for the next generation FPGAs. 
Prior to Xilinx, he was with Berkeley Wireless Research Center working on ultra 
low power wireless network processors. He has a MS in EECS from UC Berkeley and 
BS in Computer Engineering from University of
Washington. He has published over 20 technical 
papers and 20 issued patents.