Evaluation of the Programming Skills of Large Language Models

Proceedings of Society

Agile management in cybersecurity

P.M. Asprion, C. Giovanoli, C. Scherb, S. Bhat

Published: 2023

Classeval: A manually-crafted benchmark for evaluating llms on class-level code generation

X. Du, M. Liu, K. Wang, H. Wang, J. Liu, Y. Chen, J. Feng, C. Sha, X. Peng, Y. Lou

Published: 2023

International Journal of Information Management

“so what if chatgpt wrote it?” multidisciplinary perspectives on opportunities, challenges and implications of generative conversational ai for research, practice and policy

Y.K. Dwivedi, N. Kshetri, L. Hughes, E.L. Slade, A. Jeyaraj, A.K. Kar, A.M. Baabdullah, A. Koohang, V. Raghavan, M. Ahuja

Published: 2023

Google Blog

What’s ahead for bard: More global, more visual, more integrated

S. Hsiao

Published: 2023

Proceedings of the 2017 ACM Conference on Innovation and Technology in Computer Science Education

Code quality issues in student programs

H. Keuning, B. Heeren, J. Jeuring

Published: 2017

Diagnostic and Interventional Imaging

Revolutionizing radiology with gpt-based models: current applications, future possibilities and limitations of chatgpt

A. Lecler, L. Duron, P. Soyer

Published: 2023

Journal of Systems and Software

Source code metrics: A systematic mapping study

A.S. Nuñez-Varela, H.G. Pérez-Gonzalez, F.E. Martínez-Perez, C. Soubervielle-Montalvo

Published: 2017

2022 IEEE Symposium on Security and Privacy (SP)

Asleep at the keyboard? assessing the security of github copilot’s code contributions

Hammond Pearce, Baleegh Ahmad, Benjamin Tan, Brendan Dolan-Gavitt, Ramesh Karri

Published: 2022

Journal of Systems and Software

A systematic review on the code smell effect

J.A.M. Santos, J.B. Rocha-Junior, L.C.L. Prates, R.S. Do Nascimento, M.F. Freitas, M.G. De Mendonça

Published: 2018

Communications in Computer and Information Science – CCIS

Cymed: A framework for testing cybersecurity of connected medical devices

C. Scherb, A. Hadayah, L.B. Heitz

Published: 2024

Divide, conquer and verify: Improving symbolic execution performance

C. Scherb, L.B. Heitz, H. Grieder, O. Mattmann

Published: 2023

EPiC Series in Computing

A cyber attack simulation for teaching cybersecurity

C. Scherb, L.B. Heitz, F. Grimberg, H. Grieder, M. Maurer

Published: 2023

A serious game for simulating cyberattacks to teach cybersecurity

C. Scherb, L.B. Heitz, F. Grimberg, H. Grieder, M. Maurer

Published: 2023

Using large language models to generate junit tests: An empirical study

M.L. Siddiq, J.C.S. Santos, R.H. Tanvir, N. Ulfat, F.A. Rifat, V.C. Lopes

Published: 2024

Empirical software engineering

Bug characteristics in open source software

L. Tan, C. Liu, Z. Li, X. Wang, Y. Zhou, C. Zhai

Published: 2014

SIGKDD

Codegeex: A pre-trained model for code generation with multilingual benchmarking on humaneval-x

Q. Zheng, X. Xia, X. Zou, Y. Dong, S. Wang, Y. Xue

Published: 2023