cv

Basics

Name	David N. Palacio
Label	Ph.D(c) Computer Science
Email	danaderp@gmail.com
Phone	(757) 279-4265
Url	https://danaderp.github.io/danaderp/
Summary	Ph.D. candidate specializing in deep learning and software engineering, with expertise in applying causal inference techniques to interpret large language models (LLMs), aimed at automating software maintenance tasks. Eager to leverage these skills as a Research Scientist focused on AI development, trustworthiness, alignment, and explainability.

Work

2021.10 - 2022.03
Research Intern

Microsoft

Published a patent for an innovative debugging tool designed to provide recourse to practitioners in explaining deep generative models trained on code as data. Worked in collaboration with four Microsoft Senior research scientists.
- Post Hoc Interpretability
2020.05 - 2020.08
PhD Intern

Cisco Systems

Investigated an information theory approach for interpreting and evaluating software retrieval techniques.
- Information Theory, Traceability
2016.02 - 2016.12
Software Engineer (Senior Back‑end)

KSMTI

Engineered reactive and functional programming architectures for enabling fast development of any type of marketplace business.
- Scala, Functional Programming

Education

2017.08 - 2024.08

Williamsburg, VA
PhD

College of William & Mary (W&M)

Computer Science

Awards

2014.10

Indian Government Scholarship in Noida, India

Indian Government

Certificates

	Distributed Machine Learning with Apache Spark
	Berkeley University	2016-08-15

Publications

2024

Which Syntactic Capabilities Are Statistically Learned by Masked Language Models for Code?

ICSME

Limitations of evaluating Masked Language Models (MLMs) in code completion tasks.
2024

Toward a Theory of Causation for Interpreting Neural Code Models

IEEE Transactions of Software Engineering

It concerned a causal theory for understanding LLMs for code.
2022

Debugging Tool for Code Generation Neural Language Models

United States Patent

A debugging tool identifies the smallest subset of an input sequence or rationales that influenced a neural language model.

Skills

	Deep Learning for Software Engineering
	PyMC3
	PyTorch
	Foundation Models
	LLMs for Code
	Unsupervised Models
	Code Generation

Languages

	Spanish
	Native speaker

	English
	Fluent

	German
	Intermediate

Interests

	LLMs for Code
	Explainability
	Interpretability
	Trustworthiness
	Causal Inference
	Representation Learning
	Data Analysis