logo
ResearchBunny Logo
Large Language Models for Code Analysis: Do LLMs Really Do Their Job?

Computer Science

Large Language Models for Code Analysis: Do LLMs Really Do Their Job?

C. Fang, N. Miao, et al.

This paper delivers a comprehensive evaluation of large language models (LLMs) for code analysis, including the challenging case of obfuscated code, and presents real-world case studies. Findings indicate LLMs can assist in automating code analysis while exhibiting certain limitations. Research conducted by Chongzhou Fang, Ning Miao, Shaurya Srivastav, Jialin Liu, Ruoyu Zhang, Ruijie Fang, Asmita, Ryan Tsang, Najmeh Nazari, Han Wang, and Houman Homayoun.

00:00
00:00
~3 min • Beginner • English
Abstract
Large language models (LLMs) have demonstrated significant potential in the realm of natural language understanding and programming code processing tasks. Their capacity to comprehend and generate human-like code has spurred research into harnessing LLMs for code analysis purposes. However, the existing body of literature falls short in delivering a systematic evaluation and assessment of LLMs' effectiveness in code analysis, particularly in the context of obfuscated code. This paper seeks to bridge this gap by offering a comprehensive evaluation of LLMs' capabilities in performing code analysis tasks. Additionally, it presents real-world case studies that employ LLMs for code analysis. Our findings indicate that LLMs can indeed serve as valuable tools for automating code analysis, albeit with certain limitations. Through meticulous exploration, this research contributes to a deeper understanding of the potential and constraints associated with utilizing LLMs in code analysis, paving the way for enhanced applications in this critical domain.
Publisher
Proceedings of the 33rd USENIX Security Symposium
Published On
Aug 14, 2024
Authors
Chongzhou Fang, Ning Miao, Shaurya Srivastav, Jialin Liu, Ruoyu Zhang, Ruijie Fang, Asmita, Ryan Tsang, Najmeh Nazari, Han Wang, Houman Homayoun
Tags
Large language models
Code analysis
Obfuscated code
Automated analysis
Evaluation
Real-world case studies
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny