Curriculum Vitae

(Updated October 2025)

Huascar Sanchez

Research Summary and Skills
Computer Science Researcher with over $15$ years of experience at the intersection of AI/ML and software engineering. Proven expertise in building and operating large-scale systems for "Big Code" analytics and programming language processing. Deep knowledge of AI/ML applications in source code analysis, coding automation, and social-cyber security. Strong background in modeling, analyzing, and detecting social-cyber operations targeting open-source ecosystems. My recent work leverages multi-LLM collaboration, LLM chemistry estimation (LLM matchmaking), and multi-LLM consensus to advance the development of compound AI systems—driving progress in workflow-aware reasoning, multi-LLM orchestration, data processing, knowledge extraction, and source code analysis. Skilled in prototyping novel solutions, along with a track record of cross-disciplinary leadership, and effective communication across technical and non-technical teams. Passionate about turning cutting-edge research into practical, scalable features that drive automation and productivity.
Skills
Multi-LLM Collaboration Optimization
Programmer Productivity
Multi-LLM Recommendation
Automated Software Engineering
Machine Learning on Source Code
Source Code Curation and Analysis
Social-Cyber Security
Data Mining (Programming Language Processing)
Research Interests
Multi-LLM collaboration optimization and LLMs chemistry estimation for compound AI and software tasks.
Social-cyber security for protecting the integrity of open-source ecosystems.
Source code analysis and delta-debugging-inspired methods for identifying and characterizing emergent ML model misbehaviors.
Software engineering for AI/ML, AI/ML for automated software engineering.
Crowdsourcing, re-imagined with collaborative LLMs, software tools, and humans.
Current Position
2024 – now
SRI International · Menlo Park, CA
Senior Computer Scientist, Computer Science Laboratory
Leading applied research in AI/ML for code generation and analysis, focusing on optimal multi-LLM collaboration for complex, compound AI tasks. Establishing technical direction and developing principled methods to evaluate response quality, task accuracy, and inter-model dynamics ("chemistry") in structured collaborative tasks—advancing the design of optimal LLM ensembles.
Projects
2025 – now
Reliable Multi-AI Collaboration (Team: SRI)
Leading the development of a framework for optimizing multi-LLM collaboration in complex compound AI tasks by estimating LLM chemistry—and incorporating Bayes-driven consensus mechanisms for uncertainty quantification to enable the formation of efficient, effective, and stable LLM ensembles.
2024 – now
ARPA-H PARADIGM (Team: SRI)
Developing multi-LLM strategies and workflows for extracting data types and constraints, inferring JSON schemas, and synthesizing schema-compliant code from diverse healthcare specifications.
2024 – now
IARPA ReSCIND (Team: SRI, IHMC, TwoSix, UFL, RAD, GMU, ColumbiaU, SimSpace)
Leading the development of Cyber-Specific Computational Cognitive Models using Self-Supervised Machine Learning, Information Foraging Theory, and Fuzzy Cognitive Maps.
2024 – now
ARPA-H DIGIHEALS (Team: SRI, Confidencial Inc., Homeward Health)
Advancing the automatic generation of formal representations from natural language using neuro-symbolic AI, and leading the development of multi-LLM (Agentic) workflows for healthcare data analysis, formalization, and composition.
Previous projects*
* Order of projects is arbitrary.
Education
2015
University of California Santa Cruz · Ph.D. in Computer Science
Dissertation: Source code Curation Tooling for the Code Forager
Advisor: Prof. Jim Whitehead
2006
San Jose State University · M.S. in Software Engineering
MS Thesis title: Building Systems Using Patterns: Creating Knowledge Maps
Advisor: Prof. M.E. Fayad
2001
Catholic University of Nicaragua · B.S. in Information Systems Engineering
Professional Employment
2021 – 2023
SRI International · Menlo Park, CA
Advanced Computer Scientist II, Computer Science Laboratory
Continuing to lead DARPA Social Cyber research to secure open-source software supply chains through dynamic behavior embeddings that model social-cyber threats, while developing multimodal, multi-task deep learning models for open-source software introspection and error profiling.
2019 – 2021
SRI International · Menlo Park, CA
Advanced Computer Scientist I, Computer Science Laboratory
Guiding the integration of formal verification and program analysis for continuous assurance case construction. Leading DARPA SocialCyber as Principal Investigator, shaping project vision, delivering hybrid AI methods for securing software supply chains, mentoring junior researchers, and coordinating reporting across stakeholders.
2018 – 2019
SRI International · Menlo Park, CA
Advanced Computer Scientist, Computer Science Laboratory
Applying machine learning, large-scale graph mining, and dynamic program analysis to open-source software security, and developing integration and verification methods for systems of systems in distributed environments.
2017 – 2018
SRI International · Menlo Park, CA
Computer Scientist, Computer Science Laboratory
Leveraging Big Code analytics and machine learning on source code for code-to-code search, API recommendation, code synthesis, and source code analysis.
2015 – 2017
SRI International · Menlo Park, CA
Postdoctoral Fellow, Computer Science Laboratory, Host: Hassen Saïdi
Developing learning-to-rank models for large-scale code search and recommendation, and lightweight source code summarization techniques for improved source code understanding.
2014 – 2015
SRI International · Menlo Park, CA
Visiting Research Fellow, Computer Science Laboratory, Host: John Murray
Making formal verification scalable and accessible to broader audiences via crowd-sourced approaches—crowd-sourced formal verification—that enable participation without formal methods expertise.
2013 – 2015
University of California Santa Cruz · Santa Cruz, CA
Research Assistant, Software Introspection Laboratory
Empirical validation of crowd-sourced formal verification methods enabled by online games.
2012 – 2013
University of California Santa Cruz · Santa Cruz, CA
Research Assistant, Information Retrieval Laboratory
Developing latent representation learning methods from social media data for multimodal cyberbullying detection.
2011 – 2012
University of California Santa Cruz · Santa Cruz, CA
Research Assistant, Software Introspection Laboratory
Developing AI-guided, mixed-initiative content creation tools for automatic robot components design.
2010 – 2011
University of California Santa Cruz · Santa Cruz, CA
Research Assistant, Database Group
Advancing semi-automatic index tuning in DBMSs to make it easier and more effective.
2010 – 2012
Infomotor, Inc. · San Francisco, CA
Software Engineer (Consultant)
Streamlining both reporting and presentation of key performance metrics to deliver data that is both understandable and actionable.
2006 – 2009
Pearson VUE · Bloomington, MN
Software Engineer
Developing and maintaining distributed applications that deliver over four million computer-based tests annually worldwide, supporting clients in licensure, certification, and assessment markets.
Publications
Conference Proceedings
2024
Continuous Safety & Security Evidence Generation, Curation and Assurance Case Construction Using the Evidential Tool Bus.
N. Shankar, M. Kim, H. Sanchez, H. Ruess, T. Beyene, R. Bouchekir, D. Bhatt, S. Varadarajan, A. Murugesan, H. Ren, I. Hong-Wong, K. Siu, S. Varanasi, M. D. Ernst.
43rd AIAA/IEEE Digital Avionics Systems Conference (DASC), 2024.
2016
JayHorn: A Framework for Verifying Java programs.
T. Kahsai, P. Rümmer, H. Sanchez, and M. Schäf
CAV: 28th International Conference on Computer Aided Verification, 2016.
2016
Design Lessons From Binary Fission: A Crowd Sourced Game for Precondition Discovery.
K. Compton, H. Logas, J. Osborn, C. Chakrabortti, K. Coffman, D. Fava, D. Lederle-Ensign, Z. Lin, J. Mazeika, A. Mobramaein, J. Pagnutti, H. Sanchez, J. Whitehead, J. Murray, and B. Laurel.
CFDG: 11th International Joint Conference of DiGRA and FDG, 2016.
2016
Detecting Similar Programs via the Weisfeiler-Lehman Graph Kernel.
W. Li, H. Saïdi, H. Sanchez, M. Schäf, and P. Schweitzer.
ICSR: 15th International Conference on Software Reuse, 2016.
2016
Multistaging to Understand: Distilling the Essence of Java Code Examples.
H. Sanchez, J. Whitehead, and M. Schäf.
ICPC: 24th IEEE International Conference on Program Comprehension, 2016.
2015
Source Code Curation on StackOverflow: The Vesperin System.
H. Sanchez and J. Whitehead.
ICSE: 37th IEEE/ACM International Conference on Software Engineering, 2015.
2014
Software Verification Games: Designing Xylem, the Code of Plants.
H. Logas, J. Whitehead, M. Mateas, R. Vallejos, L. Scott, D. Shapiro, J. Murray, K. Compton, J. Osborn, O. Salvatore, Z. Lin, H. Sanchez, M. Shavlovsky, D. Cetina, S. Clementi, and C. Lewis.
FDG: 9th International Conference on the Foundations of Digital Games, 2014.
2014
Xylem: The Code of Plants.
H. Logas, J. Whitehead, M. Mateas, R. Vallejos, L. Scott, D. Shapiro, J. Murray, K. Compton, J. Osborn, O. Salvatore, Z. Lin, H. Sanchez, M. Shavlovsky, C. Lewis, D. Cetina, and S. Clementi.
FDG: 9th International Conference on the Foundations of Digital Games, 2014.
2013
SNIPR: Complementing Code Search with Code Retargeting Capabilities.
H. Sanchez.
ICSE: 35th International Conference on Software Engineering, 2013.
2012
Kaizen: A Semi-Automatic Index Advisor.
I. Jimenez, H. Sanchez, Q.T. Tran, and N. Polyzotis.
SIGMOD: The 2012 ACM International Conference on Management of Data, 2012.
2012
Twitter Bullying Detection.
H. Sanchez and S. Kumar.
NSDI: Proceedings of the 8th USENIX Conference on Networked Systems Design and Implementation, 2012.
2010
Knowledge Maps — Fundamentally Modular Approach to Software Architecture, Design, Development and Deployment.
M. E. Fayad, H.A. Sanchez, and S.K. Singh.
SEDE: 19th International Conference on Software Engineering and Data Engineering, 2010.
2006
The Branding Analysis Pattern.
M. E. Fayad, H.A. Sanchez, and S.K. Singh.
AICCSA: Proceedings of the IEEE International Conference on Computer Systems and Applications, 2006.
2005
Towards Scalable and Adaptable Software Architectures.
M.E. Fayad, H.S. Hamza, and H.A. Sanchez.
IRI: The 2005 IEEE International Conference on Information Reuse and Integration, 2005.
2005
The Dynamic Mapping Design Pattern.
J. Gresh, J. McKim, and H. Sanchez.
PLOP: 12th Pattern Language of Programs Conference, 2005.
2004
A Pattern Language for CRC Cards.
M. Fayad, H. Sanchez, and H. Hamza.
PLOP: 11th Conference on Pattern Languages of Programs, 2004.
2003
The Sampling Analysis Pattern.
H.A. Sanchez, B. Lai, and M.E. Fayad.
IRI: IEEE International Conference on Information Reuse and Integration, 2003.
Workshop Papers
2023
Software Introspection for Signaling Social-Cyber Operations.
H. Sanchez and B. Hitaj.
DESTION: 5th Workshop on Design Automation for CPS and IoT, 2023.
2016
Putting Your Passwords on Self Destruct Mode: Beating Password Fatigue.
H. Sanchez, J. Murray, and D. Sanchez.
SOUPS: 12th Symposium on Usable Privacy and Security, 2016.
2015
BotPrint: Casual Robotic Evolution.
K. Compton, D. Lederle-Ensign, Z. Lin, J. Mazeika, A. Mobramaein, J. Pagnutti, H. Sanchez, and J. Whitehead.
WCCG: 1st Workshop on Computational Creativity and Games, 2015.
Journal Articles
2011
Benchmarking Online Index-Tuning Algorithms.
I. Jimenez, J. LeFevre, N. Polyzotis, H. Sanchez, and K. Schnaitter.
IEEE Data Engineering Bulletin, 2011.
Books
2014
Software Patterns, Knowledge Maps, and Domain Analysis.
M. Fayad, H.A. Sanchez, S.G.K. Hegde, A. Basia, and A. Vakil.
CRC Press, 2014. ISBN 9781466571433
Reports and Other Publications
2025
LLM Chemistry Estimation for Multi-LLM Recommendation.
H. Sanchez and B. Hitaj.
ArXiv [Cs.LG]. arXiv:2510.03930 (Under Review)
2022
DesCert: Design for Certification.
N. Shankar, D. Bhatt, M. Ernst, M. Kim, S. Varadarajan, S. Millstein, J. Navas, J. Biatek, H. Sanchez, A. Murugesan, H. Ren.
Technical report, 2022. arxiv:2203.15178
2022
Trust in Motion: Capturing Trust Ascendancy in Open-Source Projects using Hybrid AI.
H. Sanchez and B. Hitaj.
ArXiv [Cs.SE]. arxiv:2210.02656
2021
Leveraging Team Dynamics to Predict Open-source Software Projects' Susceptibility to Social Engineering Attacks.
L. Giovanini, D. Oliveira, H. Sanchez, D. Shands.
ArXiv [Cs.SE]. arxiv:2106.16067
Patents
2023
Intelligent Collaborative Project Management.
D. Sanchez, H. Sanchez, H. Saïdi.
United States patent 11610173, 2023.
2022
Multipath verification of data transforms in a system of systems.
B. Dutertre, S. Jha, H. Sanchez, P. Lincoln, E. M. Pearson, R. Dean, I. A. Mason.
United States patent 2022/0197881, 2022.
Talks
2025
Collaborative AI for Data Type Extraction and JSON Schema Learning.
ARPA-H PARADIGM, Virtual meeting, June, 2025.
2025
Multi-level Cyber-Specific Computational Cognitive Models (C3Ms).
IARPA ReSCIND, Virtual meeting, May, 2025.
2023
Software Introspection for Signaling Social-Cyber Operations.
5th Workshop on Design Automation for CPS and IoT, TX. 2023.
2022
Graph-based Pattern Mining in Code Revisions for API Misuse Detection.
DARPA SocialCyber, Virtual meeting, 2022.
2022
Temporal Developer Activity Embeddings for Modeling Evolving Developer Behavior in Social Coding Environments.
DARPA SocialCyber, Virtual meeting, 2022.
2022
SIGNAL: Protecting the Integrity of Open-source Projects using Hybrid AI.
Computer Science Laboratory's Future Perfect Seminar, Virtual meeting. SRI International, Menlo Park, CA, 2022.
2021
A Multi-Task Learning Approach for Software Development Tasks Prediction.
DARPA SocialCyber, Virtual meeting, 2021.
2021
Change Point Analysis of Evolutionary Signatures in Social-Cyber Attacks.
DARPA SocialCyber, Virtual meeting, 2021.
2020
Continuous Certification of Open-source Software using Baseline DesCert.
DARPA ARCOS, Virtual meeting, 2020.
2019
Applying Machine Learning to Open-Source Code Retrieval and Verification.
Digital Foundations and Mathematics. Sandia National Laboratories. Livermore, CA, 2019.
2018
API Workflow Recommendations for Codebase Exploration.
DARPA MUSE. University of Texas at Austin, TX, 2018.
2017
Scaling Inter-Project Software Analysis.
DARPA MUSE, Massachusetts Institute of Technology (MIT), MA, 2017.
2015
Source Code Curation Tooling for the Code Forager.
Computer Science Laboratory Seminars. SRI International, Menlo Park, CA, 2015.
Service
Service to the Field
2025
IEEE Secure Development Conference Program Committee.
IEEE SecDev 2025. Call for Papers.
2023
NSF GRFP Reviewer.
NSF Graduate Research Fellowship Program (GRFP 2023). NSF 23-605
2022
Mining Challenge Program Committee.
19th International Conference on Mining Software Repositories (MSR 2022), Pittsburgh, PA.
2020
Artifact Evaluation Program Committee.
International Symposium on Software Testing and Analysis (ISSTA 2020), Los Angeles, CA.
2017
Paper Reviewer.
The 9th NASA Formal Methods Symposium (NFM 2017), Moffett Field, CA.
2016
Paper Reviewer.
The 13th International Colloquium on Theoretical Aspects of Computing (ICTAC 2016), Taipei, Taiwan.
2014
Paper Reviewer.
The 11th Working Conference on Mining Software Repositories (MSR 2014), Hyderabad India.
Mentorship
2022
Akshith Gunasekaran. PhD Student (Oregon State University)
Summer Internship, SRI International, Menlo Park, CA.
2017
Di Yang. PhD Student (University of California Irvine)
Summer Internship, SRI International, Menlo Park, CA.
2015
Xu Zhang. PhD Student (University of California Santa Cruz).
Software Development*
* Some projects listed above are private on GitHub; please contact me for access details.
Committer
LLM Chemistry
LLM Chemistry Estimation for Multi-LLM Recommendation; https://github.com/SRI-CSL/llm-chemistry (Python, PyTorch).
Aligner
LLM Workflows for JSON Schema learning; https://github.com/SRI-CSL/aligner (Python, PyTorch, Langchain).
Collaborative LLMs
Optimal Multi-AI Collaboration; https://github.com/hsanchez/SRI-CSL/collm (Python, PyTorch, Ollama).
Yellow-Patch
A source code and patch analysis library; https://github.com/hsanchez/yellow-patch (Python, PyTorch, HuggingFace's Accelerate).
Evidential Tool Bus
A framework for scaling assurance case construction, program verification and analysis; https://github.com/SRI-CSL/etb2 (Java, Python).
DesCert
A framework for continuous safety & security evidence generation, curation and assurance case construction; https://github.com/SRI-CSL/DesCert (Java, Python, C).
SIGNAL
Securing open-source software supply chains against social-cyber threats using hybrid AI methods; https://github.com/SRI-CSL/signal-public (Python, PyTorch).
SENSOR
Graph-based pattern mining in code revisions for API misuse detection; https://github.com/SRI-CSL/signal-sensor.
SimProg
Inter-project program similarity detection using graph kernels; https://github.com/SRI-CSL/simprog (Python, PyTorch).
JayHorn
Model checking for Java programs; https://github.com/jayhorn (Java).
do-like-javac
Program analysis and verification pipeline for Java projects; https://github.com/SRI-CSL/do-like-javac (Java, Scala, C, Python).
Callspace
Reliable API recommendation using likely program invariants and graph kernels; https://github.com/SRI-CSL/smac/callspace (Scala, Java).
Randoop Gradle Plugin
Gradle plugin for running Randoop unit test generator; https://github.com/SRI-CSL/randoop-gradle-plugin (Kotlin, Java).
Daikon Gradle Plugin
Gradle plugin for running Daikon invariant detector; https://github.com/SRI-CSL/daikon-gradle-plugin (Kotlin, Java).
Contributor
CHEKOFV
A framework for crowd-sourced formal verification; https://github.com/SoftwareIntrospectionLab/chekofv (Python, Nodejs, TypeScript, ActionScript).
PASCALI
A set of tools for leveraging big code analytics in program analysis, code search, and program synthesis; https://github.com/SRI-CSL/pascali (Python, Scala, Java).