PublicationsCollection of our publications
2016

Diffraction pattern simulation of cellulose fibrils using distributed and quantized pair distances [bibtex] [link]
Yan Zhang, Hideyo Inouye, Michael Crowley, Leiming Yu, David Kaeli, Lee Makowski
Journal of Applied Crystallography, 2016.

Hetero-Mark, A Benchmark Suite for CPU-GPU Collaborative Computing [bibtex]
Yifan Sun, Xiang Gong, Amir Kavyan Ziabari, Leiming Yu, Xiangyu Li, Saoni Mukherjee, Carter McCardwell, Alejandro Villegas, David Kaeli
IEEE International Symposium on Workload Characterization (IISWC), 2016.

Modeling Player Decisions in a Supply Chain Game [bibtex]
Yifan Sun, Chisheng Liang, Stephen Sutherland, Casper Harteveld, David Kaeli
IEEE Computational Intelligence and Games Conference

Hardware Thread Reordering to Boost OpenCL Throughput on FPGAs [bibtex]
Amir Momeni, Hamed Tabkhi, Gunar Schirner, David Kaeli
IEEE International Conference on Computer Design

A Framework for Studying New Approaches to Anomaly Detection [bibtex]
Esra Nergis Yolaçan, David Kaeli
International Journal of Information Security Science

OpenCL-based optimizations for acceleration of object tracking on FPGAs and GPUs [ bibtex ]
Amir Momeni, Hamed Tabkhi, Gunar Schirner, David Kaeli
Architectures and Systems for Real-time Mobile Vision Applications.

Patterns of temporal scaling of groundwater level fluctuation [link] [bibtex]
Xue Yu, Reza Ghasemizadeh, Ingrid Y Padilla, David Kaeli, Akram Alshawabkeh
Journal of Hydrology.

A Complete Key Recovery Timing Attack on a GPU [bibtex]
Zhen Hang Jiang, Yunsi Fei, David Kaeli
The 22nd International IEEE Symposium on High Performance Computer Architecture.

Mystic: Predictive Scheduling for GPU Based Cloud Servers Using Machine Learning [bibtex]
Yash Ukidave, Xiangyu Li, David Kaeli
The 30th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2016.

Balancing Scalar and Vector Execution on GPU Architectures [bibtex]
Zhongliang Chen, David Kaeli
The 30th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2016

Portable Performance for Monte Carlo Simulations of Photon Migration in 3D Turbid Media for Single and Multiple GPUs [bibtex]
Fanny Nina-Paravecino, Leiming Yu, Qianqian Fang, David Kaeli
Nvidia GPU Technology Conference, 2016.

A Comprehensive Performance Analysis of HSA and OpenCL 2.0 [bibtex]
Saoni Mukherjee, Yifan Sun, Paul Blinzer, Amir Kavyan Ziabari, David Kaeli
IEEE International Symposium on Performance Analysis of Systems and Software, 2016.

2015

Heterogeneous System Architecture: A New Compute Platform Infrastructure [bibtex]
S.-H. Hung, Thomas Jablin, David Kaeli, Yifan Sun, Rafael Ubal
Morgan Kaufmann, 2015

Using Benchmarks for Radiation Testing of Microprocessors and FPGAs [link]
Heather Quinn, William H Robinson, Paolo Rech, Miguel Aguirre, Arno Barnard, Marco Desogus, Luis Entrena, Mario Garcia-Valderas, Steven M Guertin, David Kaeli, Fernanda Lima Kastensmidt, Bradley T Kiddie, Antonio Sanchez-Clemente, Matteo Sonza Reorda, Luca Sterpone, Michael Wirthlin
IEEE Transactions on Nuclear Science, 2015.

Performance of the NVIDIA Jetson TK1 in HPC [link]
Yash Ukidave, David Kaeli, Umesh Gupta, Kurt Keville
IEEE International Conference on Cluster Computing, 2015.

Bridging Architecture and Programming for Throughput-Oriented Vision Processing [link]
Amir Momeni, Hamed Tabkhi, Gunar Schirner, David Kaeli
Proceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2015.

A reuse-based refresh policy for energy-aware eDRAM caches [link]
Alejandro Valero, Salvador Petit, Julio Sahuquillo, David R Kaeli, José Duato
Microprocessors and Microsystems, 2015.

Real-time Image Segmentation for Homeland Security Exploiting Hyper-Q Concurrency [link]
Fanny Nina-Paravecino, David Kaeli
NVIDIA GPU Technology Conference, 2015.

Side-channel power analysis of a GPU AES implementation [link]
Chao Luo, Yunsi Fei, Pei Luo, Saoni Mukherjee, David Kaeli
33rd IEEE International Conference on Computer Design (ICCD), New York City, 2015.

Modeling Flash Translation Layers to Enhance System Lifetime [link]
Thomas McCormick, David Kaeli
2015 Flash Memory Summit. Santa Clara, 2015.

Asymmetric NoC Architectures for GPU Systems [link]
Amir Kavyan Ziabari, José L. Abellán, Yenai Ma, Ajay Joshi, David R. Kaeli
9th International Symposium on Networks-on-Chip (NoCs). Vancouver, 2015.

Spatiotemporal changes of CVOC concentrations in karst aquifers: Analysis of three decades of data from Puerto Rico [link]
Xue Yu, Reza Ghasemizadeh, Ingrid Padilla, Celys Irizarry, David Kaeli, Akram Alshawabkeh
Science of the Total Environment, 2015.

Field, experimental, and analytical data on large-scale HPC systems and evaluation of the implications for exascale system design [link]
Nathan DeBardeleben, Sean Blanchard, David Kaeli, Paolo Rech
IEEE 33rd VLSI Test Symposium (VTS), 2015.

Leveraging Silicon-photonic NoC for Designing Scalable GPUs [link]
Amir Kavyan Ziabari, José L. Abellán, Rafael Ubal, Chao Chen, Ajay Joshi, David R. Kaeli
In 29th International Conference on Supercomputing. ACM, 2015.

Exploring the Efficiency of the OpenCL Pipe on an FPGA [link]
Amir Momeni, Hamed Tabkhi, Yash Ukidave, Gunar Schirner, David Kaeli
International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies, 2015.

Bridging Architecture and Programming for Throughput-Oriented Vision Processing [link]
Amir Momeni, Hamed Tabkhi, Gunar Schirner, David Kaeli
23rd ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2015.

Big Data Analysis on Puerto Rico Testsite for Exploring Contamination Threats [link]
Xiangyu Li, Leiming Yu, David Kaeli, Yuanyuan Yao, Poguang Wang, Roger Giese, Akram Alshawabkeh
ALLDATA, 2015.

NUPAR: A Benchmark Suite for Modern GPU Architectures [link]
Yash Ukidave, Fanny Nina Paravecino, Leiming Yu, Charu Kalra, Amir Momeni, Zhongliang Chen, Nick Materise, Brett Daley, Perhaad Mistry, David Kaeli
6th ACM/SPEC International Conference on Performance Engineering (ICPE). 2015

2014

Calculating Architectural Vulnerability Factors for Spatial Multi-Bit Transient Faults [link]
Mark Wilkening, Vilas Sridharan, Si Li, Fritz Previlon, Sudhanva Gurumurthi, David Kaeli
47th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2014

Runtime Support for Adaptive Spatial Partitioning and Inter-Kernel Communication on GPUs [link]
Yash Ukidave, Charu Kalra, David Kaeli, Perhaad Mistry, Dana Schaa
26th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2014

Accelerated Connected Component Labeling Using CUDA Framework [link]
Fanny Nina Paravecino, David Kaeli
International Conference on Computer vision and Graphics, ICCVG 2014

System Call Anomaly Detection using multi-HMMs [link]
Esra N Yolacan, Jennifer G Dy, David R Kaeli
Eighth International Conference on Software Security and Reliability-Companion(SERE-C), 2014

Heterogeneous Computing with OpenCL 2.0 [link]
David Kaeli, Perhaad Mistry, Dana Schaa, Dong Ping Zhang
Morgan Kaufmann, 2014

Harnessing the Power of GPUs to Speed Up Feature Selection for Outlier Detection [link]
Fatemeh Azmandian, Ayse Yilmazer, Jennifer G. Dy, Javed A. Aslam, David R. Kaeli
Journal of Computer Science and Technology, 2014

Fast Fourier Transform (FFT) on GPUs [link]
Yash Ukidave, Gunar Schirner, David Kaeli
Book Chapter: Numerical Computations with GPUs, pp 339-361, 2014

Aggressive Value Prediction on a GPU [link]
Enqiang Sun, David Kaeli
International Journal of Parallel Programming, 2014

Empirical FTL Evaluation and Modeling [link]
Thomas McCormick, David Kaeli
2014 Flash Memory Summit. 2014.

A Parallel Clustering Algorithm for Placement [link]
Amir Momeni, Perhaad Mistry, David Kaeli
International Society for Quality Electronic Design, ISQED 2014

Exploring the Heterogeneous Design Space for both Performance and Reliability [link]
Rafael Ubal, Dana Schaa, Perhaad Mistry, Xiang Gong, Yash Ukidave, Zhongliang Chen, Gunar Schirner, David Kaeli
Proceedings of the The 51st Annual Design Automation Conference on Design Automation Conference, DAC 2014

Analyzing Power Efficiency of Optimization Techniques and Algorithm Design Methods for Applications on Heterogeneous Platforms [link]
Yash Ukidave, Amir Kavyan Ziabari, Perhaad Mistry, Gunar Schirner, David Kaeli
International Journal of High Performance Computing Applications, IJHPCA 2014

Scalar Waving: Improving the Efficiency of SIMD Execution on GPUs [pdf]
Ayse Yilmazer, Zhongliang Chen, David Kaeli
28th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2014

2013

HQL: A Scalable Synchronization Mechanism for GPUs [link]
Ayse Yilmazer, David Kaeli
27th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2013

Architecture-Independent Dynamic Information Flow Tracking [link]
Ryan Whelan, Tim Leek, David Kaeli
22nd International Conference on Compiler Construction, Rome, Italy, March 2013.

Quantifying the energy efficiency of FFT on heterogeneous platforms [link]
Yash Ukidave, Amir Kavyan Ziabari, Mistry Perhaad, Schirner Gunar, Kaeli David R.
International Symposium on Performance Analysis of Systems and Software, 2013

Characterizing Scalar Opportunities in GPGPU Applications [link]
Zhongliang Chen, David Kaeli, Norman Rubin
IEEE International Symposium on Performance Analysis of Systems and Software, April 2013

2012

Heterogeneous Computing with OpenCL: Revised OpenCL 1.2 Edition [bibtex] [link]
Benedict Gaster, Lee Howes, David R. Kaeli, Perhaad Mistry, Dana Schaa
Morgan Kaufmann, 2012

Flash Challenges for Embedded Computing [pdf]
Thomas McCormick Flash Memory Summit. 2012

Feature Weighting and Selection Using Hypothesis Margin of Boosting Characterizing Scalar Opportunities in GPGPU Applications [link]
Malak Alshawabkeh, Javed A. Aslam, Jennifer G. Dy, David R. Kaeli
The IEEE International Conference on Data Mining (ICDM). In press. 2012.

Local Kernel Density Ratio-Based Feature Selection for Outlier Detection [pdf]
Fatemeh Azmandian, Jennifer Dy, Javed A. Aslam, David Kaeli
Asian Conference on Machine Learning (ACML)

Dione: A Flexible Disk Monitoring and Analysis Framework [pdf] [bibtex]
Jennifer Mankin, David Kaeli
15th International Symposium on Research in Attacks, Intrusions, and Defenses (RAID'12) Amsterdam, The Netherlands

Multi2Sim: A Simulation Framework for CPU-GPU Computing [pdf] [bibtex]
Rafael Ubal, Byunghyun Jang, Perhaad Mistry, Dana Schaa, David Kaeli
Proc. of the 21st International Conference on Parallel Architectures and Compilation Techniques (PACT)

2011

Feature Selection Metric Using AUC Margin for Small Samples and Imbalanced Data Classification Problems [link]
Malak Alshawabkeh, Javed A. Aslam, Jennifer G. Dy, David R. Kaeli
International Conference on Machine Learning and Applications (ICMLA). 2011: 145-150

A Novel Feature Selection for Intrusion Detection in Virtual Machine Environments [link
Malak Alshawabkeh, Javed A. Aslam, David R. Kaeli, Jennifer G. Dy
The IEEE International Conference on Tools with Artificial Intelligence (ICTAI). 2011: 879-881

Heterogeneous Computing with OpenCL [bibtex] [link]
Benedict Gaster, Lee Howes, David R. Kaeli, Perhaad Mistry, Dana Schaa
Morgan Kaufmann, 2011

Virtual Machine Monitor-Based Lightweight Intrusion Detection [link]
F. Azmandian, M. Moffie, M. Alshawabkeh, J. G. Dy, J. A. Aslam, and D. R. Kaeli
Operating Systems Review, 45(2):38-53, 2011.

GPU Acceleration of Iterative Digital Breast Tomosynthesis [link]
Dana Schaa, Benjamin Brown, Byunghyun Jang, Perhaad Mistry, Rodrigo Dominguez, David Kaeli, Richard Moore, Daniel B. Kopans
GPU Computing Gems Emerald Edition, Chapter 40, Wen-Mei Hwu, editor, Morgan Kaufmann, 2011

Aggressive Value Prediction on a GPU pdf
Enqiang Sun and David Kaeli
Proceedings of the 23rd International Symposium on Computer Architecture and High Performance Computing(SBAC-PAD), Vitoria, ES, Brazil, Oct. 2011

2010

Effective Virtual Machine Monitor Intrusion Detection Using Feature Selection on Highly Imbalanced Data []
M. Alshawabkeh, M. Moffie, F. Azmandian, J.A. Aslam, J. Dy, and D. Kaeli
Machine Learning and Applications (ICMLA), Washington, DC, USA, December 12-14, 2010

Exploiting Memory Access Patterns to Improve Memory Performance in Data Parallel Architectures []
Byunghyun Jang, Dana Schaa, Perhaad Mistry, David R. Kaeli
IEEE Transaction on Parallel and Distributed Computing, 2010

Using Hardware Vulnerability Factors to Improve AVF Analysis []
Vilas Sridharan, David R. Kaeli
International Symposium on Computer Architecture (ISCA-37), Saint-Malo, France, June 21-23, 2010

Data Structures and Transformations for Physically Based Simulation on a GPU [pdf]
Perhaad Mistry, Dana Schaa, Byunghyun Jang, David Kaeli, Albert Dvornik, Dwight Meglan
9th International Meeting - High Performance Computing for Computational Science (VECPAR'10)

Quantifying Load Imbalance on Virtualized Enterprise Servers [pdf] [bibtex]
Emmanuel Arzuaga, David Kaeli
Proceedings of the First Joint WOSP/SIPEW International Conference on Performance Engineering 2010, San Jose, California, USA January 28 - 30, 2010

Data Transformations Enabling Loop Vectorization on Multithreaded Data Parallel Architectures [pdf] [bibtex]
Byunghyun Jang, Perhaad Mistry, Dana Schaa, Rodrigo Dominguez, David Kaeli
15th ACM SIGPLAN Symposium on Principles and Practices of Parallel Programming (PPoPP'10), Bangalore, India, 2010

2009

AGAMOS: A Graph-Based Approach to Modulo Scheduling for Clustered Microarchitecture [pdf] [bibtex]
Alex Aletà, Josep M. Codina, Jesús Sánchez, Antonio González, David Kaeli
IEEE Transactions on Computers, 2009.

Multi GPU Implementation of Iterative Tomographic Reconstruction Algorithms [pdf] [bibtex]
Byunghyun Jang, David Kaeli, Synho Do, Homer Pien
IEEE International Symposium on Biomedical Imaging, Jun. 2009

Software Transactional Memory for Multicore Embedded Systems [pdf] [bibtex]
Jennifer Mankin, David Kaeli
Proceedings of the ACM SIGPLAN/SIGBED Conference on Languages, Compilers, and Tools for Embedded Systems, June 2009

Exploring the Multiple-GPU Design Space [pdf] [bibtex]
Dana Schaa, David Kaeli
23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS-09), Best Paper Award, May. 2009

Eliminating Microarchitectural Dependency from Architectural Vulnerability [pdf] [bibtex]
Vilas Sridharan, David R. Kaeli
International Symposium on High-Performance Computer Architecture (HPCA-15), Feb 2009

2008

A Field Analysis of System-Level Effects of Soft Errors Occurring in Microprocessors used in Information Systems [pdf]
Syed Z. Shazli, Mohammed Abdul-Aziz, Mehdi B. Tahoori, David R. Kaeli
International Test Conference, November 2008

A Field Analysis of Soft Errors Occurring in Microprocessors used in Information Systems
S. Shazli, M.B. Tahoori, D. Kaeli
North Atlantic Test Conference, May 2008.

Interactive Deformable Registration Visualization and Analysis of 4D Computed Tomography [pdf] [bibtex]
Burak Erem, Gregory C. Sharp, Ziji Wu, and David Kaeli
Proceedings of the 1st International Conference on Medical Biometrics, Jan. 2008

2007

Soft Error Susceptibility Analysis of SRAM-Based FPGAs in High-Performance Information Systems [pdf] [bibtex]
Hossein Asadi, Mehdi B. Tahoori, Brian Mullins, David Kaeli, and Kevin Granlund
IEEE Transactions on Nuclear Science (TNS), Dec. 2007

Characterization of File IO Activity for SPEC CPU2006 [pdf] [bibtex]
Dong Ye, Joydeep Ray, David Kaeli
Special Issue of ACM SIGARCH Computer Architecture News: SPEC CPU2006 Analysis, 2007

Towards the Development of an Error Checker for Radiotherapy Treatment Plans: A Preliminary Study [pdf] [bibtex]
Fatemeh Azmandian, David Kaeli, Jennifer G Dy, Elizabeth Hutchinson, Marek Ancukiewicz, Andrzej Niemierko, and Steve B Jiang
Physics in Medicine and Biology, 2007

Heterogeneous Clustered VLIW Microarchitectures [pdf] [bibtex]
Aletà, A., Codina, J. M., González, A., and Kaeli, D
Proceedings of the 5th IEEE International Symposium on Code Generation and Optimization, March 2007

Case Study: Soft Error Rate Analysis in Storage Systems [pdf] [bibtex]
Brian Mullins, Hossein Asadi, Mehdi B. Tahoori, David Kaeli
Proceedings of the 25th IEEE VLSI Test Symposium, May 2007

Characterizing the Relationship Between ILU-based Preconditioners and the Storage Hierarchy [pdf]
D. Rivera, D. Kaeli, M. Kilmer
Proceedings of the International Conference on Preconditioning Techniques for Large Sparse Matrix Problems in Scientific and Industrial Applications, 2007

Exploring Novel Parallelization Technologies for 3-D Imaging Applications [pdf] [bibtex]
Diego Rivera, Dana Schaa, Micha Moffie, David Kaeli
In 19th International Symposium on Computer Architecture and High Performance Computing, Oct. 2007

External Memory Page Remapping for Embedded Multimedia Systems [pdf] [bibtex ]
Ke Ning, David Kaeli
Proceedings of the ACM SIGPLAN/SIGBED Conference on Languages, Compilers, and Tools for Embedded Systems, June 2007.

2006

Addressing a Workload Characterization Study to the Design of Consistency Protocols [pdf] [bibtex]
Salvador Petit, Julio Sahuquillo, Ana Pont, David Kaeli
Journal of Supercomputing , 2006.

Reducing Data Cache Susceptibility to Soft Errors [pdf] [bibtex]
Hossein Asadi, Vilas Sridharan, Mehdi B. Tahoori, David Kaeli
IEEE Transactions on Dependable and Secure Computing , 2006.

An Adjustable Linear Time Parallel Algorithm for Maximum Weight Bipartite Matching [pdf] [bibtex]
Morteza Fayyazi, David Kaeli, Waleed Meleis
Information Processing Letters , 2006.

Profile-guided File Partitioning on Beowulf Clusters [pdf] [bibtex]
Yijian Wang and David Kaeli
Journal of Cluster Computing, Special Issue on Parallel I/O, 2006.

Acceleration of Maximum Likelihood Estimation for Tomosynthesis Mammography [pdf] [bibtex]
Juemin Zhang, Waleed Meleis, David Kaeli, Tao Wu
Proceedings of the International Conference on Parallel and Distributed Systems, July 2006

Performance Characterization of SPEC CPU2006 Integer Benchmarks on the x86-64 Architecture [pdf] [bibtex]
Dong Ye, Joydeep Ray, Christophe Harle, and David Kaeli
Proceedings of the IEEE Symposium on Workload Characterization, invited paper, October 2006

Vulnerability Analysis of L2 Cache Elements to Single Event Upsets [pdf] [bibtex]
Hossein Asadi, Vilas Sridharan, Mehdi B. Tahoori, David Kaeli
Proceedings of Design and Test in Europe (DATE), 2006.

2005

Speculative Execution in High Performance Computer Architectures
CRC Press, Chapman and Hall,
D. Kaeli and P. Yew, editors, ISBN-1-58488-447-9, 2005.

Balancing Performance and Reliability in the Memory Hierarchy [pdf]
Ghazanfar-Hossein Asadi, Vilas Sridharan, Mehdi B. Tahoori, David Kaeli
International Symposium on Performance Analysis of Systems and Software (ISPASS-05), 2005.

Load Balancing using Grid-based Peer-to-Peer Parallel I/O [pdf]
Yijian Wang and David Kaeli
Proceedings of the IEEE Cluster Computing Conference, September 2005.

Power Aware External Bus Arbitration for System-on-a-Chip Embedded Systems [pdf]
Ke Ning and David Kaeli
Proceedings of the International Conference on High Performance Embedded Architectures and Compilers November 2005.

Exploiting Temporal Locality in Drowsy Cache Policies [pdf]
Salvador Petit, Julio Sahuquillo, Jose M. Such, David Kaeli
Proceedings of IEEE Computing Frontiers, 2005, pp. 371-377.

Demystifying On-the-Fly Spill Code [pdf]
Alex Aletà, Josep M. Codina, Antonio González, David Kaeli
Proceedings of the ACM Conference on Programming Languages, Design and Implementation (PLDI) 2005, pp. 180-189.

A Multinomial Clustering Model for Fast Simulation of Computer Architecture Designs [pdf]
Kaushal Sanghai, Ting Su, Jennifer Dy, David Kaeli
Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Aug. 2005, pp. 808-813.

Subsequence Matching on Structured Time Series Data [pdf]
Wu, H., Salzberg, B., Sharp, G. C., Jiang, S. B., Shirato, H., and Kaeli, D.
Proceedings of ACM Conference on the Management of Data (SIGMOD), 2005, pp. 682-693.

2004

Removing Communications in Clustered Microarchitectures Through Instruction Replication [pdf]
Aletà, A., Codina, J. M., González, A., and Kaeli, D
ACM Transactions on Architecture and Code Optimization, Vol. 1, No. 2, June 2004, pp. 127-151.

A Finite State Model for Respiratory Motion Analysis in Image-guided Radiation Therapy [pdf]
Huanmei Wu, Gregory C Sharp, Betty Salzberg, David Kaeli, Hiroki Shirato and Steve B Jiang
Journal of Physics in Medicine and Biology , 49(23), 2004, pp. 5357-5372.

An Object-Oriented Parallel Library [pdf] [bibtex]
Chakib Ouarraui and David Kaeli
Journal of High Performance Computing and Networking , Vol. 1, Issue 1/2/3, 2004, pp. 85-90.

Developing Energy-Aware Strategies for the Blackfin Processor [pdf] [bibtex]
Steven VanderSanden, David R. Kaeli, Giuseppe Olivadoti, Richard Gentile
Proceedings of the 2006 High Performance Embedded Computing Conference, MIT Lincoln Labs, Sept. 2004.

2003

Levo - A Scalable Processor With High IPC [pdf] [bibtex]
Augustus K. Uht, David Morano, Alireza Khalafi and David R. Kaeli
Journal of Instruction Level Parallelism , August 2003.

A Database System to Advance Subsurface Sensing and Imaging [pdf]
Huanmei Wu, Becky Norum, Betty Salzberg and David Kaeli
Journal of Subsurface Sensing Technologies and Applications , October 2003, pp. 395-408.

2002

Profile-Based Characterization and Tuning for Subsurface Sensing and Imaging Applications [pdf]
Ashouei Jiang, M. Ashouei, D. Jiang, W. Meleis, D. Kaeli, M. El-shenawee, E. Mizan, Y. Wang, C. Dimarzio
International Journal of Systems, Science and Technology , September 2002, pp. 40-55.

Electromagnetics Computations Using the MPI Parallel Implementation of the Steepest Descent Fast Multipole Method (SDFMM) [pdf]
M. El-Shenawee, C. Rappaport, D. Jiang, W. Meleis and D. Kaeli
ACES Journal, , Vol. 17, No. 2, July 2002, pp. 112-122.

Realizing High IPC Using Time-Tagged Resource-Flow Computing [pdf]
Augustus Uht, Alireza Khalafi, David Morano, Marcos de Alba, and David Kaeli
Proceedings of Europar 2002, Springer-Verlag, August, 2002, pp. 490-499.

Path-based Hardware Loop Prediction [pdf]
Marcos R. de Alba, David R. Kaeli
4th International Conference on Control, Virtual Instrumention and Digital Systems, Mexico City, Mexico, August, 2002, pp. 29-38.

Exploiting Pseudo-schedules to Guide Data Dependence Graph Partitioning [pdf]
Alex Aletà, Josep M. Codina, Jesús Sánchez, Antonio González, David Kaeli
Proceedings of IEEE Parallel Architectures and Compilation Techniques, Sept. 2002, pp. 281-290.

Register Pressure-Based Modulo Scheduling for Clustered VLIW Architectures [pdf]
Alex Aletà, Josep M. Codina, Jesús Sánchez, Antonio González, David Kaeli
Proceedings of Journadas de Concurrencia, June 2002, pp. 1-10.

Localized Message Passing Structures for High Speed Ethernet Packet Switching [pdf]
Morteza Fayyazi, David Kaeli
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, June 2002, pp. 1551-1557.

2001

Introduction to the Special Issue on High Performance Memory Systems [pdf]
Haldun Hadimioglu, David Kaeli and Fabrizio Lombardi
IEEE Transactions on Computers , Vol. 50, No. 11, November 2001, pp. 1103-1105.

Analysis of Dynamic Loops (in Spanish) [pdf]
Marcos R. de Alba, David R. Kaeli, Eun-Sung Kim
3rd International Conference on Control, Virtual Instrumention and Digital Systems, Mexico City, Mexico, August, 2001, pp. 93-106.

2000

Welcome to the Opportunities of Binary Translation [pdf]
Erik R. Altman, David Kaeli, Yaron Sheffer
IEEE Computer Magazine , March 2000, pp. 40-46.

1999

Analysis of Temporal-based Program Behavior for Improved Instruction Cache Performance [pdf]
John Kalamatianos, Alireza Khalafi, David R. Kaeli, Waleed Meleis
IEEE Transactions on Computers , Vol. 10, No. 2, February 1999, pp. 168-175.

Indirect Branch Predication Using Data Compression Techniques [pdf]
John Kalamatianos, David R. Kaeli
Journal of Instruction Level Parallelism , Vol. 1, 1999.

Cache Line Coloring Using Real and Estimated Profiles
Amir Hooshang Hashemi, David R. Kaeli, Brad Calder
Digital Technical Journal Special Issue on Tools and Languages , February 1999.

Branch-directed and Pointer-based Data Cache Prefetching [ps]
Yue Liu, Mona Dimitri, David R. Kaeli
Journal of Systems Architecture: Special Issue on Microprocessor Architecture, Vol. 45, 1999, pp. 1047-1073.

Using Cache Line Coloring to Perform Aggressive Procedure Inlining [pdf]
Hakan Aydin, David Kaeli
ACM SIGARCH News, 28(1) March 2000, pp. 62-71.

Accurate Simulation and Evaluation of Code Reordering [ps]
John Kalamatianos, David R. Kaeli
Proceedings of the IEEE International Symposium on the Performance Analysis of Systems and Software , Austin, TX, April 2000.

Parameter Value Characterization of Windows NT-based Applications [ps]
John Kalamatianos, David R. Kaeli, Ronnie Chaiken
Workload Characterization: Methodology and Case Studies , IEEE Computer Society, 1999, pp.142-149.

1998

Tracing and Characterization of NT-based System Workloads [pdf]
Jason Casmira, David Kaeli, David Hunter
Digital Technical Journal Special Issue on Tools and Languages , Vol. 10, No. 1, December 1998, pp. 6-21.

VLSI Design in the 3rd Dimension [ps]
Strickland, S., Ergin, E., Kaeli, D. R., and Zavracky, P.
Integration: the VLSI Journal, Vol. 25/1, September 1998, pp. 1-16.

Predicting Indirect Branches via Data Compression [ps]
John Kalamatianos, David R. Kaeli
Proc. of the 31st International Symposium on Microarchitecture , Dallas, TX, December 1998, pp.272-281.

Temporal-Based Procedure Reordering for Improved Instruction Cache Performance [ps]
John Kalamatianos, David R. Kaeli
Proc. of the 4th International Conference High Performance Computer Architecture , Las Vegas, NV, February 1998, pp. 244-253.

Operating System Impact on Trace-Driven Simulation [ps]
Jason Casmira, John Fraser, David Kaeli, Waleed Meleis
Proceedings of the 31st Simulation Symposium, Boston, MA, April 1998, pp. 76-82.

1997

Performance Analysis on a CC-NUMA Prototype [pdf]
Kaeli, D. R., Fong, L. L., Booth, R. C., Imming, K. C., and Weigel, J. P.
IBM Journal of Research and Development, Vol. 41, No. 3, May 1997, pp. 205-214.

Improving the Accuracy of History-Based Branch Prediction [pdf]
David R. Kaeli and Philip G. Emma
IEEE Transactions on Computers, Vol. 46, No. 4, April 1997, pp. 469-472.

Three Dimensional Circuits Using Transferred Films
P. Sailer, P. Singhal, J. Hopwood, D. R. Kaeli, P. M. Zavracky, K. Warner, and D. P. Vu,
IEEE Circuits and Devices Magazine, November 1997, pp. 27-30. Reprints available by request.

Efficient Procedure Mapping Using Cache Line Coloring [ps]
Amir H. Hashemi, David R. Kaeli, Brad Calder
Proceedings of the SIGPLAN Conference on Programming Language Design and Implementation, June 1997, pp. 171-182.

Analytic Models of Workload Behavior and Pipeline Performance [ps]
Mark S. Squillante, David R. Kaeli, Himanshu Sinha
Proceedings of IEEE MASCOTTS, Haifa, Israel , January 1997, pp. 91-96.


2016

A Fast Level-Set Segmentation Algorithm for Image Processing Designed For Parallel Architectures [bibtex]
Julian Gutierrez, Fanny Nina-Paravecino, David Kaeli
Sixth Workshop on Irregular Applications: Architectures and Algorithms, 2016.

Performance Evaluation of compiler-based Redundant Multithreading in an HSA environment [bibtex]
Charu Kalra, Daniel Lowell, John Kalamatianos, Vilas Sridharan, David Kaeli
IEEE Workshop on Silicon Errors in Logic-System Effects (SELSE), 2016.

Evaluating the Resilience of Highly Parallel Applications [bibtex]
Mark Wilkening, Fritz Previlon, Vilas Sridharan, David Kaeli
IEEE Workshop on Silicon Errors in Logic-System Effects (SELSE), 2016.

Multi2C: an open GPU compiler backed by LLVM [bibtex]
Xiang Gong, Rafael Ubal, David Kaeli
Boston Area Architecture Workshop (BARC). Boston, 2016.

CLIP: An IPB-based GPU compute clustering framework, [bibtex]
Carter McCardwell, David Kaeli
Boston Area Architecture Workshop (BARC). Boston, 2016.

Memory Characterization of Embedded Applications [bibtex]
Trevor Gale, Spencer Hance, Yash Ukidave, Charu Kalra, Kaushal Sanghai, David Kaeli
Boston Area Architecture Workshop (BARC). Boston, 2016.

Correlation Timing Attack on a GPU [bibtex]
Zhen Jiang, Yunsi Fei, David Kaeli
Boston Area Architecture Workshop (BARC). Boston, 2016.

2015

Hardware support for Local Memory Transactions on GPU Architectures [link]
Alejandro Villegas, Angeles Navarro, Rafael Asenjo Plaza, Oscar Plata, Rafael Ubal, David Kaeli
10th ACM SIGPLAN Workshop on Transactional Computing (TRANSACT), Portland, 2015.

Securing virtual execution environments through machine learning-based intrusion detection [link]
Fatemeh Azmandian, David R Kaeli, Jennifer G Dy, Javed A Aslam
IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP), 2015.

The Use of Benchmarks for High-Reliability Systems [pdf]
Heather Marie Quinn, William Robinson, Paolo Rech, Arno Barnard, Miguel Aquirre, Marco Desogus, Luis Entrena, Mario Garcia-Valderas, Steve Michael Guertin, David Kaeli, Fernanda Lima Lima Kastensmidt, Bradley Kiddie, Antonio Sanchez-Clement, Matteo Sonza Reorda, Sterpone, Michael Wirthlin
Los Alamos National Laboratory (LANL), 2015.

Side-Channel Analysis of MAC-Keccak Hardware Implementation [link]
Pei Luo, Yunsi Fei, Xin Fang, A. Adam Ding, David Kaeli, Miriam Leeser
Proceedings of the 4th Workshop on Hardware and Architectural Support for Security and Privacy (HASP). ACM, Portland, 2015.

Mahout on heterogeneous clusters using HadoopCL [link]
Xiangyu Li, Max Grossman, David Kaeli
Proceedings of the 2nd Workshop on Parallel Programming for Analytics Applications (PPAA). ACM, San Fransisco, 2015.

Visualization of OpenCL Application Execution on CPU-GPU Systems [link]
Amir Kavyan Ziabari, Rafael Ubal, Dana Schaa, David Kaeli, Rafael Ubal, Dana Schaa, David Kaeli
Workshop on Computer Architecture Education (WCAE). ACM, Portland 2015.

Examining the Impact of ACE interference on Multi-Bit AVF Estimates [link]
Fritz Previlon, Mark Wilkening, Vilas Sridharan, Sudhanva Gurumurthi, David R Kaeli
IEEE Workshop on Silicon Errors in Logic-System Effects (SELSE), 2015.

Engaging Sophomores in Embedded Design using Robotics [link]
Amir Momeni, Fritz Previlon, Agamemnon Despopoulos, Gunar Schirner, John Kimani, David Kaeli
Workshop on Computer Architecture Education (WCAE). ACM, Portland 2015.

Exploring the Features of OpenCL 2.0 [link]
Saoni Mukherjee, Xiang Gong, Leiming Yu, Carter McCardwell, Yash Ukidave, Tuan Dao, Fanny Nina Paravecino, David Kaeli
International Workshop on OpenCL (IWOCL). Palo Alto, 2015.

High Performance Computing of Fiber Scattering Simulation [link]
Leiming Yu, Yan Zhang, Xiang Gong, Nilay Roy, Lee Makowski, David Kaeli
Workshop on General Purpose Processing Using GPUs (GPGPU). ACM, 2015.

Side Channel Attack on GPUs [link]
SSaoni Mukherjee, Chao Luo, Colleen Finnegan, Yunsi Fei, David Kaeli
Boston Area Architecture Workshop (BARC). Boston, 2015.

FIR filtering and AES encryption with OpenCL 2.0 [link]
Carter McCardwell, Tuan Dao, Saoni Mukherjee, David Kaeli
Boston Area Architecture Workshop (BARC). Boston, 2015.

Speech Recognition on Modern Graphic Processing Units [link]
Leiming Yu, John Magrath, Ajey Pandey, Matthew Sears, David Kaeli
Boston Area Architecture Workshop (BARC). Boston, 2015.

2014

GPU-accelerated HMM for Speech Recognition [link]
Leiming Yu, Yash Ukidave, David Kaeli
Workshop on Heterogeneous and Unconventional Cluster Architectures and Applications (HUCAA). 2014.

Scalable and efficient implementation of correlation power analysis using graphics processing units (GPUs) [link]
Tushar Swamy, Neel Shah, Pei Luo, Yunsi Fei, David Kaeli
Third Workshop on Hardware and Architectural Support for Security and Privacy (HASP). 2014

Performance Evaluation and Optimization Mechanisms for Inter-operable Graphics and Computation on GPUs [link]
Yash Ukidave, Xiang Gong, David Kaeli
Proceedings of the 7th Workshop on General Purpose Processing Using GPUs

2013

Valar: a benchmark suite to study the dynamic behavior of heterogeneous systems [pdf]
Perhaad Mistry, Yash Ukidave, Dana Schaa, David Kaeli
Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units

Architectural Vulnerability Modeling and Analysis of Integrated Graphics Processors [link]
Jeon, Hyeran, Mark Wilkening, Vilas Sridharan, Sudhanva Gurumurthi, and G. Loh
In Workshop on System Effects of Logic Soft Errors (SELSE-9), Stanford, CA, March 2013

Analyzing Optimization Techniques for Power Efficiency on Heterogeneous Platforms [link]
Yash Ukidave, David Kaeli
Proceeding of the 3rd Workshop on Accelerators and Hybrid Exascale Systems, IPDPS'13, May 2013,

A Framework for Profiling and Performance Monitoring of Heterogeneous Applications [pdf]
Perhaad Mistry, Yash Ukidave, David Kaeli
6th Workshop on Programmability Issues for Heterogeneous Multicores, 2013

2012

Enabling Task-level Scheduling on Heterogeneous Platforms [pdf]
Enqiang Sun, Dana Schaa, Richard Bagley, Norman Rubin and David Kaeli
Proceedings of the 5th Workshop on General Purpose Processing on Graphics Processing Units(GPGPU5), London, UK, Mar. 2012

Securing Cloud Storage Systems through a Virtual Machine Monitor [link]
Fatemeh Azmandian, David R Kaeli, Jennifer G Dy, Javed A Aslam, Dana Schaa
Workshop on Secure and Resilient Architectures and Systems (WSRAS), 2012.

Enhanced Boosting-based Algorithm for Intrusion Detection in Virtual Machine Environments [link]
Malak Alshawabkeh, David Kaeli, Javed A. Aslam, Jennifer Dy
In Workshop on Secure and Resilient Architectures and Systems (WSRAS), In press, 2012

2011

Caracal: Dynamic Translation of Runtime Environments for GPUs [link]
Rodrigo Dominguez, Dana Schaa, and David Kaeli
Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units (GPGPU-4). ACM, New York, NY, USA, Article 5 , 7 pages.

2010

Accelerating the Local Outlier Factor Algorithm on a GPU for Intrusion Detection Systems [pdf]
Malak Alshawabkeh,Byunghyun Jang and David Kaeli
3rd Workshop on GPGPU (GPGPU3), Pittsburgh, PA USA, 2010

Using PVF Traces to Accelerate AVF Modeling
Vilas Sridharan and David R. Kaeli
IEEE Workshop on Silicon Errors in Logic - System Effects (SELSE-6), Mar 2010

2009

A Binary Instrumentation Tool for the Blackfin Processor [pdf]
Enqiang Sun and David Kaeli
Workshop on Binary Instrumentation and Applications, Held in Conjunction with MICRO-42,

The Effect of Input Data on Program Vulnerability [pdf]
Vilas Sridharan and David R. Kaeli
IEEE Workshop on Silicon Errors in Logic - System Effects (SELSE-5), Mar 2009

Accelerating Phase Unwrapping and Affine Transformations for Optical Quadrature Microscopy using CUDA [pdf]
Perhaad Mistry, Sherman Braganza, David Kaeli, Miriam Lesser
2nd Workshop on General Purpose Computation on GPUs (GPGPU2), Mar. 2009

Architecture-Aware Optimization Targeting Multithreaded Stream Computing [pdf]
Byunghyun Jang, Synho Do, Homer Pien, David Kaeli
2nd Workshop on General Purpose Computation on GPUs (GPGPU2), Mar. 2009

2008

Performance Prediction in Multi-GPU Execution
Dana Schaa and David Kaeli
NVISION August, 2008

Field Failure Analysis of Microprocessors used in Information Systems [pdf]
Syed Z. Shazli, Mohammed Abdul-Aziz, Mehdi B. Tahoori, David R. Kaeli
Workshop on Resilience Assessment and Dependability Benchmarking, June 2008.

Performance Evaluation of Virtual Appliances [pdf]
Zhaoqian Chen and David Kaeli
First International Workshop on Virtualization Performance: Analysis, Characterization, and Tools (VPACT 08), April, 2008.

Resource-Conscious Optimization of Cryptographic Algorithms on an Embedded Architecture [pdf]
Wassim Bassalee and David Kaeli
Proceedings of the ACM Workshop on Optimizations for DSP and Embedded Systems, April 2008

An M/G/1 Queue Model for Multiple Application on Storage Area Networks [pdf]
Emmanuel Arzuaga and David R. Kaeli
Proceedings of the 11th Workshop on Computer Architecture Evaluation using Commercial Workloads (CAECW-11), February 2008.

A Taxonomy to Enable Error Correction and Recovery in Software [pdf]
Vilas Sridharan, Dean A. Liberty, David R. Kaeli
Workshop on Quality-Aware Design (W-QUAD) in conjunction with the 35th International Symposium on Computer Architecture (ISCA-35), June 2008

Quantifying Software Vulnerability [pdf]
Vilas Sridharan, David R. Kaeli
1st Workshop on Radiation Effects and Fault Tolerance at Nanometer Technologies (WREFT-1) in conjunction with Computing Frontiers, May 2008

2007

Stream Image Processing on a Dual-Core Embedded System [pdf]
Michael G. Benjamin and David Kaeli
Proceedings of the Embedded Computer Systems: Architectures, Modeling, and Simulation, 7th International Workshop (SAMOS 2007), July 2007

A Code Layout Framework for Embedded Processors with Configurable Memory Hierarchy [pdf]
Kaushal Sanghai, Alex Raikman, Ken Butler, David Kaeli
Proceedings of the 5th Workshop on Optimizations for DSP and Embedded Systems, March 2007

Stream Programming on the Blackfin Architecture [pdf]
Michael G. Benjamin and David Kaeli
Proceedings of the 4th Boston Area Computer Architecture Workshop, January 2007

Characterizing the Relationship Between Sparse Matrix Preconditioners and the Storage Hierarchy [pdf]
Diego Rivera, David Kaeli, and Misha Kilmer
Proceedings of the 4th Boston Area Computer Architecture Workshop, January 2007

Performance Characterization of SPEC CPU2006 Integer Benchmarks [pdf]
Dong Ye, Joydeep Ray, Christophe Harle, and David Kaeli
Proceedings of the 4th Boston Area Computer Architecture Workshop, January 2007

Instruction-Level Energy Estimation [pdf]
Seth Molloy, David Kaeli
Proceedings of the 4th Boston Area Computer Architecture Workshop, January 2007

Case Study: Soft Error Rate Analysis in Storage Systems [ps]
Brian Mullins, Hossein Asadi, Mehdi B. Tahoori, David Kaeli
Proceedings of the 4th Boston Area Computer Architecture Workshop, January 2007

Use of an Embedded Configurable Memory for Stream Image Processing [pdf]
Michael G. Benjamin and David Kaeli
Proceedings of the 5th Workshop on Optimizations for DSP and Embedded Systems, March 2007

Reliability in the Shadow of Long-Stall Instructions [pdf]
Vilas Sridharan, David Kaeli, Arijit Biswas
3rd Annual Workshop on Silicon Errors in Logic - System Effects (SELSE-3), April 2007

2006

Hunting Trojan Horses [pdf]
Micha Moffie, Winnie Cheng, David Kaeli, Qin Zhao
Proceedings of the Workshop on Architecture and System Support for Improving Software Dependability, Oct 2006

Experiences with the Blackfin Architecture for an Embedded Systems Lab [pdf]
Michael Benjamin, David Kaeli, Richard Platcow
Proceedings of the Workshop on Computer Architecture Education, July 2006

2005

A Benchmark Suite for Behavior-Based Security Mechanisms [pdf]
Dong Ye, Micha Moffie and David Kaeli
Proceedings of the Workshop on Software Security Assurance Tools, Techniques and Metrics, November 2005.

ASM: An Application Security Monitor [pdf]
Micha Moffie and David Kaeli
Proceedings of the Workshop on Binary Instrumentation adn Applications, September 2005, pp. 31-36.

Reliability Tradeoffs in Design of Cache Memories [pdf]
Hossein Asadi, Vilas Sridharan, Mehdi B. Tahoori, David Kaeli
1st Workshop on Architectural Reliability (WAR) in conjunction with the International Symposium on Microarchitecture (MICRO-38), 2005.

2002

Realizing High IPC Through a Scalable Memory-Latency Tolerant Multipath Microarchitecture [pdf]
D. Morano, A. Khalafi, D.R. Kaeli, A. K. Uht
Presented at MEDEA Workshop, Charlottesville, VA, September 2002.

2001

Runtime Predictability of Loops [pdf]
Marcos R. de Alba, David R. Kaeli
IEEE 4th Annual Workshop on Workload Characterization, Austin, Tx, December, 2001, pp. 91-98.

Profile-guided Tuning of Heap-based Memory Access [pdf] July 2001
Efe Yardimci, David Kaeli
2nd Workshop on Memory Performance Issues, Goteberg, Sweden,

1999

Studying the Performance of the FX!32 Binary Translation System [ps]
Paul J. Drongowski, David Hunter, Morteza Fayyazi, David Kaeli, Jason Casmira
Proceedings of the 1st Workshop on Binary Translation , Newport Beach, CA, Oct. 1999.

A Study of Dynamic Branch Predication for SHARC DSPs [ps]
Suleyman Sair, David R. Kaeli, Jose Fridman
Proceedings of the 2nd International Workshop on Compiler and Architecture Support for Embedded Systems (CASES'99), Washington, D.C., Oct. 1999.

1998

A Study of Loop Unrolling for VLIW-based DSP Processors [ps]
Suleyman Sair David, David R. Kaeli, Waleed Meleis
Proc. of the 1998 IEEE Workshop on Signal Processing Systems (SiPS '98) , October 1998, pp. 519-527.

1997

Procedure Mapping Using Static Call Graph Estimation [ps]
Amir H. Hashemi, David R. Kaeli, Brad Calder
Proceedings of the Workshop on the Interaction between Compilers and Computer Architectures, San Antonio, Texas, February 1997, also appearing in the IEEE TCCA Newsletter, 1997.

IISWC 2016 [link]

IWOCL 2015 [link]

ASPLOS 2014 [link]

ISCA 2013 [link]

IWOCL 2013 [link]

PACT 2012 [link]

ICPE 2012 [link]

PACT 2011 [link]

SBAC-PAD 2007 [link]

2006

Hunting Trojan Horses [pdf]
M. Moffie, David Kaeli, NUCAR Technical Report, January 2006.

2002

A Software Communications Architecture Compliant Software Defined Radio Implementation [pdf]
Sabri Murat Bicer, MS Thesis, June 2002.

1997

A Code Annotation Tool for Capturing Operating System Execution June 1997. [ps]