Curriculum Vitae
Prabhu Vellaisamy · pvellais@andrew.cmu.edu · Download PDF
Education
Sep 2021 – Dec 2026 (Expected)
Doctor of Philosophy, Electrical & Computer Engineering
Carnegie Mellon University — Pittsburgh, PA
Co-Advisors: Prof. J.P. Shen & Prof. Shawn Blanton · CMU NCAL, CMU ACTL, UCF UNARY Research Groups
CIT Dean's Fellowship · 2023 Qualcomm Innovation Fellowship Winner
CIT Dean's Fellowship · 2023 Qualcomm Innovation Fellowship Winner
Jan 2020 – May 2021
Master of Science, Electrical & Computer Engineering
Carnegie Mellon University — Pittsburgh, PA
Jun 2014 – Jul 2018
Bachelor of Technology, Electrical & Electronics Engineering
SRM Institute of Science and Technology — Chennai, India
Research Interests
Academic and Industry Experience
Jun 2026 – Sep 2026
Artificial Intelligence (AI) Research Scientist Intern Offer Accepted
Samsung Semiconductor Inc. — San Jose, CA
Mar 2026 – Jun 2026
Silicon Solution Engineering Intern Offer Accepted
NVIDIA Corporation — Santa Clara, CA
Jun 2024 – Aug 2024
AI Characterization & Tight Coupling Analysis Intern
Samsung Semiconductor Inc. — San Jose, CA
- Built SKIP, a PyTorch Profiler framework that uncovered critical LLM inference bottlenecks, revealing GH200 suffers 2.8× higher prefill latency and 4× larger CPU-bounded regions vs. Intel x86+H100 and AMD x86+A100.
- Spearheaded a 5-person CMU-Samsung research collaboration from concept to publication; first-authored paper accepted at ISPASS 2025.
Jun 2022 – Dec 2022
AI Architecture & Algorithm Intern Exemplary Performance Award
MediaTek USA Inc. — San Jose, CA (Full-time Jun–Sep; Part-time Remote Aug–Dec)
- Developed TubGEMM (ISVLSI 2023) and OzMAC (VLSI-SoC 2024), compute units for edge AI that reduced power consumption by 40%+ while maintaining throughput on TSMC N5 process.
PhD Research Highlights
2021 – Present
Doctoral Researcher
CMU NCAL / CMU ACTL — Co-Advisors: Prof. J.P. Shen & Prof. Shawn Blanton
- Collaborated with research teams across 4 research groups (CMUNCAL, CMU-ACTL, UCF-UNARY, NEXUS), delivering 12 peer-reviewed publications and mentoring graduate and undergraduate students.
- Characterized performance bottlenecks in LLM inference on GH200 through systematic profiling of model configurations; research funded by Samsung Semiconductor.
- Created Tempus Core, an INT8 temporal-unary convolution accelerator that achieved 53% area reduction, 44% power savings, and 5× iso-area throughput improvement over the baseline NVDLA convolution core.
- Created TNNGen, an automation framework that compiles PyTorch models to layout-ready netlists and was validated across 7 modalities.
- Developed TNN7, a set of 9 macros for a 7nm PDK extension to ASAP7, reducing energy-delay product (EDP) by 45% against the baseline design.
Presentations
- Invited Talk. “Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures.” Jülich Supercomputing Center, Jülich, Germany (Remote), May 20, 2025.
- Conference Presentation. “Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures.” IEEE ISPASS 2025, Ghent, Belgium, May 12, 2025.
- Conference Presentation. “Tempus Core: Area-Power Efficient Temporal-Unary Convolution Core for Low-Precision Edge DLAs.” IEEE DATE 2025, Lyon, France, April 1, 2025.
- Conference Presentation. “OzMAC: An Energy-Efficient Sparsity-Exploiting Multiply-Accumulate-Unit Design for DL Inference.” IEEE VLSI-SoC 2024, Tangier, Morocco, October 7, 2024.
- Conference Presentation. “Exploration of Unary Arithmetic-Based Matrix Multiply Units for Low Precision DL Accelerators.” IEEE ISVLSI 2024, Knoxville, TN, July 2, 2024.
- Conference Presentation. “TNNGen: Automated Design of Neuromorphic Sensory Processing Units for Time-Series Clustering.” IEEE ISCAS 2024, Singapore, May 21, 2024.
Fellowships, Awards, and Honors
- Amar Mukherjee Best Paper Award, ISVLSI 2025.
- ISVLSI 2024 Travel Grant.
- CMU GSA Conference Grant.
- Qualcomm Innovation Fellowship 2023.
- DAC 2022 Young Fellow.
- ASPLOS Young Architect 2022.
- Carnegie Institute of Technology Dean's Fellowship.
Teaching Experience
Carnegie Mellon University — Department of Electrical and Computer Engineering
| Course | Role | Semesters |
|---|---|---|
| 18-340/640: Hardware Arithmetic for Machine Learning | Teaching Instructor | 4 semesters (approximately 50 students per semester) |
| 18-743: Neuromorphic Computer Architecture & Processor Design | Teaching Instructor | 5 semesters (approximately 20 graduate students per semester) |
| 18-740: Modern Computer Architecture | Teaching Instructor | 1 semester (approximately 100 students) |
Technical Skills
Tools
Programming
Languages
Relevant Coursework
Professional Service
- IEEE Transactions on Very Large-Scale Integration (VLSI) Systems (IEEE TVLSI) reviewer.
- IEEE Journal of Exploratory Solid-State Computational Devices and Circuits (IEEE JXCDC) reviewer.
Professional Memberships and Honor Societies
- IEEE-Eta Kappa Nu (HKN).
- Sigma Xi Scientific Research Honor Society.