Abstract
The use of generative AI-based coding assistants like ChatGPT and Github Copilot is a reality in contemporary software development. Many of these tools are provided as remote APIs. Using third-party APIs raises data privacy and security concerns for client companies, which motivates the use of locallydeployed language models. In this study, we explore the tradeoff between model accuracy and energy consumption, aiming to provide valuable insights to help developers make informed decisions when selecting a language model. We investigate the performance of 18 families of LLMs in typical software development tasks on two real-world infrastructures, a commodity GPU and a powerful AI-specific GPU. Given that deploying LLMs locally requires powerful infrastructure which might not be affordable for everyone, we consider both full-precision and quantized models. Our findings reveal that employing a big LLM with a higher energy budget does not always translate to significantly improved accuracy. Additionally, quantized versions of large models generally offer better efficiency and accuracy compared to full-precision versions of medium-sized ones. Apart from that, not a single model is suitable for all types of software development tasks.
| Original language | English |
|---|---|
| Title of host publication | 2025 IEEE/ACM 22nd International Conference on Mining Software Repositories, MSR 2025 |
| Publisher | IEEE |
| Pages | 725-736 |
| Number of pages | 12 |
| ISBN (Electronic) | 9798331501839 |
| DOIs | |
| Publication status | Published - 13 Jun 2025 |
| Event | 22nd IEEE/ACM International Conference on Mining Software Repositories, MSR 2025 - Ottawa, Canada Duration: 27 Apr 2025 → 29 Apr 2025 Conference number: 22 |
Conference
| Conference | 22nd IEEE/ACM International Conference on Mining Software Repositories, MSR 2025 |
|---|---|
| Abbreviated title | MSR 2025 |
| Country/Territory | Canada |
| City | Ottawa |
| Period | 27/04/25 → 29/04/25 |
Keywords
- 2025 OA procedure
- Energy Efficiency
- LLMs
- Model Quantization
- Software Development
- Trade-Offs
- Coding Assistant