Renee St. Amant

Principal Research Product Manager

Publications

View by:

- Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale
  
  Kunal Jain, A. Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Ruehle, Saravan Rajmohan, Shashwat Jaiswal, Yogesh Simmhan, Anoop Kulkarni, Steve Kofsky
  
  ArXiv | February 2025
  
  Publication
- Ensuring Fair LLM Serving Amid Diverse Applications
  
  Kunal Jain, Ankur Mallick, A. Parayil, Renee St. Amant, Rujia Wang, Victor Ruehle, Chetan Bansal, Saravan Rajmohan, Redwan Ibne Seraj Khan, Haiying Shen, Anoop Kulkarni, Steve Kofsky, Pankhuri Choudhary, Yue Cheng
  
  ArXiv | November 2024
  
  Publication
- LeanAttention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers
  
  Rya Sanovar, Srikant Bharadwaj, Renee St. Amant, Victor Ruehle, Saravan Rajmohan
  
  May 2024
  
  Publication
- RETROSPECTIVE: Dark Silicon and the End of Multicore Scaling
  
  Hadi Esmaeilzadeh, Emily Blem, Renee St. Amant, Karthikeyan Sankaralingam, Doug Burger
  
  ISCA@50 25-Year Retrospective: 1996-2020 | Published by ACM SIGARCH and IEEE TCCA | 2023
  
  Publication
- RETROSPECTIVE: General-Purpose Code Acceleration with Limited-Precision Analog Computation
  
  Renee St. Amant, Amir Yazdanbakhsh, Jongse Park, Hadi Esmaeilzadeh, Arjang Hassibi, Luis Ceze, Doug Burger
  
  ISCA@50 25-Year Retrospective: 1996-2020 | Published by ACM SIGARCH and IEEE TCCA | 2023
  
  Publication
- General-purpose code acceleration with limited-precision analog computation
  
  Renee St. Amant, A. Yazdanbakhsh, Jongse Park, Bradley Thwaites, Hadi Esmaeilzadeh, A. Hassibi, Luis Ceze, Doug Burger
  
  2014 International Symposium on Computer Architecture | June 2014
  
  IEEE Micro Top Picks from the Computer Architecture Conferences Honorable Mention 2016
  
  DOI Publication Publication
- Power challenges may end the multicore era
  
  H. Esmaeilzadeh, Emily R. Blem, Renee St. Amant, Karthikeyan Sankaralingam, Doug Burger
  
  Commun. ACM | February 2013, Vol 56(2): pp. 93-102
  
  Communications of the ACM, Special Issue: Research Highlights
  
  DOI Publication
- Dark Silicon and the End of Multicore Scaling
  
  Hadi Esmaeilzadeh, Emily R. Blem, Renee St. Amant, Karthikeyan Sankaralingam, Doug Burger
  
  IEEE Micro | May 2012, Vol 32(3): pp. 122-134
  
  IEEE Micro Special Issue
  
  DOI
- Dark silicon and the end of multicore scaling
  
  H. Esmaeilzadeh, Emily R. Blem, Renee St. Amant, Karthikeyan Sankaralingam, Doug Burger
  
  2011 International Symposium on Computer Architecture | June 2011
  
  Selected for IEEE Micro Top Picks and Communications of the ACM Research Highlights
  
  DOI Publication
- Mixed-Signal Approximate Computation: A Neural Predictor Case Study
  
  Renee St. Amant, Daniel A. Jiménez, Doug Burger
  
  IEEE Micro | January 2009, Vol 29: pp. 104-115
  
  IEEE Micro Special Issue
  
  DOI Publication
- Low-power, high-performance analog neural branch prediction
  
  Renee St. Amant, Daniel A. Jiménez, Doug Burger
  
  2008 International Symposium on Microarchitecture | November 2008
  
  Selected for IEEE Micro Top Picks from the Computer Architecture Conferences
  
  DOI Publication

- Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale
  
  Kunal Jain, A. Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Ruehle, Saravan Rajmohan, Shashwat Jaiswal, Yogesh Simmhan, Anoop Kulkarni, Steve Kofsky
  
  ArXiv | February 2025
  
  Publication
- Ensuring Fair LLM Serving Amid Diverse Applications
  
  Kunal Jain, Ankur Mallick, A. Parayil, Renee St. Amant, Rujia Wang, Victor Ruehle, Chetan Bansal, Saravan Rajmohan, Redwan Ibne Seraj Khan, Haiying Shen, Anoop Kulkarni, Steve Kofsky, Pankhuri Choudhary, Yue Cheng
  
  ArXiv | November 2024
  
  Publication
- LeanAttention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers
  
  Rya Sanovar, Srikant Bharadwaj, Renee St. Amant, Victor Ruehle, Saravan Rajmohan
  
  May 2024
  
  Publication
- RETROSPECTIVE: General-Purpose Code Acceleration with Limited-Precision Analog Computation
  
  Renee St. Amant, Amir Yazdanbakhsh, Jongse Park, Hadi Esmaeilzadeh, Arjang Hassibi, Luis Ceze, Doug Burger
  
  ISCA@50 25-Year Retrospective: 1996-2020 | Published by ACM SIGARCH and IEEE TCCA | 2023
  
  Publication
- General-purpose code acceleration with limited-precision analog computation
  
  Renee St. Amant, A. Yazdanbakhsh, Jongse Park, Bradley Thwaites, Hadi Esmaeilzadeh, A. Hassibi, Luis Ceze, Doug Burger
  
  2014 International Symposium on Computer Architecture | June 2014
  
  IEEE Micro Top Picks from the Computer Architecture Conferences Honorable Mention 2016
  
  DOI Publication Publication
- Mixed-Signal Approximate Computation: A Neural Predictor Case Study
  
  Renee St. Amant, Daniel A. Jiménez, Doug Burger
  
  IEEE Micro | January 2009, Vol 29: pp. 104-115
  
  IEEE Micro Special Issue
  
  DOI Publication
- Low-power, high-performance analog neural branch prediction
  
  Renee St. Amant, Daniel A. Jiménez, Doug Burger
  
  2008 International Symposium on Microarchitecture | November 2008
  
  Selected for IEEE Micro Top Picks from the Computer Architecture Conferences
  
  DOI Publication
- Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale
  
  Kunal Jain, A. Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Ruehle, Saravan Rajmohan, Shashwat Jaiswal, Yogesh Simmhan, Anoop Kulkarni, Steve Kofsky
  
  ArXiv | February 2025
  
  Publication
- Ensuring Fair LLM Serving Amid Diverse Applications
  
  Kunal Jain, Ankur Mallick, A. Parayil, Renee St. Amant, Rujia Wang, Victor Ruehle, Chetan Bansal, Saravan Rajmohan, Redwan Ibne Seraj Khan, Haiying Shen, Anoop Kulkarni, Steve Kofsky, Pankhuri Choudhary, Yue Cheng
  
  ArXiv | November 2024
  
  Publication
- LeanAttention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers
  
  Rya Sanovar, Srikant Bharadwaj, Renee St. Amant, Victor Ruehle, Saravan Rajmohan
  
  May 2024
  
  Publication
- LeanAttention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers
  
  Rya Sanovar, Srikant Bharadwaj, Renee St. Amant, Victor Ruehle, Saravan Rajmohan
  
  May 2024
  
  Publication
- RETROSPECTIVE: Dark Silicon and the End of Multicore Scaling
  
  Hadi Esmaeilzadeh, Emily Blem, Renee St. Amant, Karthikeyan Sankaralingam, Doug Burger
  
  ISCA@50 25-Year Retrospective: 1996-2020 | Published by ACM SIGARCH and IEEE TCCA | 2023
  
  Publication
- RETROSPECTIVE: General-Purpose Code Acceleration with Limited-Precision Analog Computation
  
  Renee St. Amant, Amir Yazdanbakhsh, Jongse Park, Hadi Esmaeilzadeh, Arjang Hassibi, Luis Ceze, Doug Burger
  
  ISCA@50 25-Year Retrospective: 1996-2020 | Published by ACM SIGARCH and IEEE TCCA | 2023
  
  Publication
- General-purpose code acceleration with limited-precision analog computation
  
  Renee St. Amant, A. Yazdanbakhsh, Jongse Park, Bradley Thwaites, Hadi Esmaeilzadeh, A. Hassibi, Luis Ceze, Doug Burger
  
  2014 International Symposium on Computer Architecture | June 2014
  
  IEEE Micro Top Picks from the Computer Architecture Conferences Honorable Mention 2016
  
  DOI Publication Publication
- Power challenges may end the multicore era
  
  H. Esmaeilzadeh, Emily R. Blem, Renee St. Amant, Karthikeyan Sankaralingam, Doug Burger
  
  Commun. ACM | February 2013, Vol 56(2): pp. 93-102
  
  Communications of the ACM, Special Issue: Research Highlights
  
  DOI Publication
- Dark Silicon and the End of Multicore Scaling
  
  Hadi Esmaeilzadeh, Emily R. Blem, Renee St. Amant, Karthikeyan Sankaralingam, Doug Burger
  
  IEEE Micro | May 2012, Vol 32(3): pp. 122-134
  
  IEEE Micro Special Issue
  
  DOI
- Dark silicon and the end of multicore scaling
  
  H. Esmaeilzadeh, Emily R. Blem, Renee St. Amant, Karthikeyan Sankaralingam, Doug Burger
  
  2011 International Symposium on Computer Architecture | June 2011
  
  Selected for IEEE Micro Top Picks and Communications of the ACM Research Highlights
  
  DOI Publication
- Mixed-Signal Approximate Computation: A Neural Predictor Case Study
  
  Renee St. Amant, Daniel A. Jiménez, Doug Burger
  
  IEEE Micro | January 2009, Vol 29: pp. 104-115
  
  IEEE Micro Special Issue
  
  DOI Publication
- Low-power, high-performance analog neural branch prediction
  
  Renee St. Amant, Daniel A. Jiménez, Doug Burger
  
  2008 International Symposium on Microarchitecture | November 2008
  
  Selected for IEEE Micro Top Picks from the Computer Architecture Conferences
  
  DOI Publication

- Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale
  
  Kunal Jain, A. Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Ruehle, Saravan Rajmohan, Shashwat Jaiswal, Yogesh Simmhan, Anoop Kulkarni, Steve Kofsky
  
  ArXiv | February 2025
  
  Publication
- Ensuring Fair LLM Serving Amid Diverse Applications
  
  Kunal Jain, Ankur Mallick, A. Parayil, Renee St. Amant, Rujia Wang, Victor Ruehle, Chetan Bansal, Saravan Rajmohan, Redwan Ibne Seraj Khan, Haiying Shen, Anoop Kulkarni, Steve Kofsky, Pankhuri Choudhary, Yue Cheng
  
  ArXiv | November 2024
  
  Publication
- Power challenges may end the multicore era
  
  H. Esmaeilzadeh, Emily R. Blem, Renee St. Amant, Karthikeyan Sankaralingam, Doug Burger
  
  Commun. ACM | February 2013, Vol 56(2): pp. 93-102
  
  Communications of the ACM, Special Issue: Research Highlights
  
  DOI Publication
- Dark Silicon and the End of Multicore Scaling
  
  Hadi Esmaeilzadeh, Emily R. Blem, Renee St. Amant, Karthikeyan Sankaralingam, Doug Burger
  
  IEEE Micro | May 2012, Vol 32(3): pp. 122-134
  
  IEEE Micro Special Issue
  
  DOI
- Mixed-Signal Approximate Computation: A Neural Predictor Case Study
  
  Renee St. Amant, Daniel A. Jiménez, Doug Burger
  
  IEEE Micro | January 2009, Vol 29: pp. 104-115
  
  IEEE Micro Special Issue
  
  DOI Publication
- General-purpose code acceleration with limited-precision analog computation
  
  Renee St. Amant, A. Yazdanbakhsh, Jongse Park, Bradley Thwaites, Hadi Esmaeilzadeh, A. Hassibi, Luis Ceze, Doug Burger
  
  2014 International Symposium on Computer Architecture | June 2014
  
  IEEE Micro Top Picks from the Computer Architecture Conferences Honorable Mention 2016
  
  DOI Publication Publication
- Dark silicon and the end of multicore scaling
  
  H. Esmaeilzadeh, Emily R. Blem, Renee St. Amant, Karthikeyan Sankaralingam, Doug Burger
  
  2011 International Symposium on Computer Architecture | June 2011
  
  Selected for IEEE Micro Top Picks and Communications of the ACM Research Highlights
  
  DOI Publication
- Low-power, high-performance analog neural branch prediction
  
  Renee St. Amant, Daniel A. Jiménez, Doug Burger
  
  2008 International Symposium on Microarchitecture | November 2008
  
  Selected for IEEE Micro Top Picks from the Computer Architecture Conferences
  
  DOI Publication
- RETROSPECTIVE: Dark Silicon and the End of Multicore Scaling
  
  Hadi Esmaeilzadeh, Emily Blem, Renee St. Amant, Karthikeyan Sankaralingam, Doug Burger
  
  ISCA@50 25-Year Retrospective: 1996-2020 | Published by ACM SIGARCH and IEEE TCCA | 2023
  
  Publication
- RETROSPECTIVE: General-Purpose Code Acceleration with Limited-Precision Analog Computation
  
  Renee St. Amant, Amir Yazdanbakhsh, Jongse Park, Hadi Esmaeilzadeh, Arjang Hassibi, Luis Ceze, Doug Burger
  
  ISCA@50 25-Year Retrospective: 1996-2020 | Published by ACM SIGARCH and IEEE TCCA | 2023
  
  Publication
- LeanAttention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers
  
  Rya Sanovar, Srikant Bharadwaj, Renee St. Amant, Victor Ruehle, Saravan Rajmohan
  
  May 2024
  
  Publication