(Updated July 2025)

Huascar Sanchez

Summary and Relevant Skills

Senior Computer Scientist with over 15 years of experience at the intersection of AI/ML and software engineering. Proven expertise in building and operating large-scale software systems for "Big Code" analytics and software introspection, with deep knowledge of AI/ML applications to software development and vice versa. Strong background in compound AI systems, with a special focus on multi-AI collaboration for data processing, knowledge extraction, information prediction, source code analysis, and software analytics. Skilled in prototyping novel solutions, solving complex technical problems, communicating across technical and non-technical audiences, and leading interdisciplinary teams in both research and applied domains.

Skills

Software Development & Introspection

Software Engineering for AI/ML

AI/ML for Software Engineering

Software Analytics

Recommendation Systems

Machine Learning for Source Code

Multi-LLM strategies for Schema Learning from Healthcare Specifications

Multi-LLM Collaboration for Coding Automation

Compound AI Systems for Health Data Analysis

Graph-based Pattern Mining

Research Interests

Multi-LLM Collaboration for coding automation, software introspection, and software analytics.

Program analysis methods for introspecting emergent AI/ML model behaviors.

AI/ML for software engineering, software engineering for AI/ML, and social-cyber security.

Crowdsourcing, re-imagined with collaborative LLMs, software tools, and humans.

Current Position

2024 – now

SRI International · Menlo Park, CA
Senior Computer Scientist, Computer Science Laboratory
Leading applied research in AI/ML for code generation and analysis using Multi-AI collaboration and compound AI systems. Defining technical direction and developing principled methods to estimate response quality, task accuracy, and the degree of interaction between independent LLMs in structured collaborative tasks—enabling optimal coordination and multi-LLM recommendation.

Projects

2024 – now

ARPA-H PARADIGM (Team: SRI)
Developing multi-LLM strategies and tooling for extracting data types and value constraints, learning JSON schemas, and generating schema-compliant code from diverse healthcare specification documents.

2024 – now

IARPA ReSCIND (Team: SRI, IHMC, TwoSix, UFL, RAD, GMU, ColumbiaU, SimSpace)
Leading the development of cyber-specific computational cognitive models using self-supervised machine learning, Information Foraging Theory, and Fuzzy Cognitive Maps.

2024 – now

ARPA-H DIGIHEALS (Team: SRI, Confidencial Inc., Homeward Health) ç Working on automatic generation of formal representations from natural language, and leading the development of Agentic workflows for health data analysis, formalization, and composition.

Previous projects^*

* Order of projects is arbitrary.

DARPA ARCOS, DARPA SocialCyber, DARPA SoSITE, DARPA MUSE, DARPA CSFV.

Education

2015

University of California Santa Cruz · Ph.D. in Computer Science
Dissertation: Source code Curation Tooling for the Code Forager
Advisor: Prof. Jim Whitehead

2006

San Jose State University · M.S. in Software Engineering
MS Thesis title: Building Systems Using Patterns: Creating Knowledge Maps
Advisor: Prof. M.E. Fayad

2001

Catholic University of Nicaragua · B.S. in Information Systems Engineering

Professional Employment

2021 – 2023

SRI International · Menlo Park, CA
Advanced Computer Scientist II, Computer Science Laboratory
Developing multimodal and multi-task deep learning models for open-source software introspection and error profiling; continuing to lead efforts as a DARPA Social Cyber PI to secure open-source software supply chains by developing dynamic behavior embeddings to model social-cyber threats.

2019 – 2021

SRI International · Menlo Park, CA
Advanced Computer Scientist I, Computer Science Laboratory
Guiding the integration of formal verification and program analysis for continuous assurance case construction. Leading DARPA SocialCyber as Principal Investigator—defining project vision, delivering hybrid AI methods for securing software supply chains, mentoring junior researchers, and coordinating reporting across stakeholders.

2018 – 2019

SRI International · Menlo Park, CA
Advanced Computer Scientist, Computer Science Laboratory
Applying machine intelligence, graph mining, and dynamic program analysis to open-source software security; developing approaches for the integration and verification of systems of systems in distributed environments.

2016 – 2018

SRI International · Menlo Park, CA
Computer Scientist, Computer Science Laboratory
Leveraging Big Code Analytics and AI/ML for code search, recommendation, synthesis, and analysis.

2015 – 2016

SRI International · Menlo Park, CA
Postdoctoral Fellow, Computer Science Laboratory, Host: Hassen Saïdi
Developing learning-to-rank models for large-scale code search and recommendation, and novel lightweight source code summarization techniques for improved code understanding.

2014 – 2015

SRI International · Menlo Park, CA
Visiting Research Fellow, Computer Science Laboratory, Host: John Murray
Making formal verification scalable and accessible to broader audiences, including those without formal methods expertise, through crowd-sourced formal verification.

2013 – 2015

University of California Santa Cruz · Santa Cruz, CA
Research Assistant, Software Introspection Laboratory
Empirical validation of Crowd-sourced formal verification methods.

2012 – 2013

University of California Santa Cruz · Santa Cruz, CA
Research Assistant, Information Retrieval Laboratory
Learning representations for multimodal cyber bullying detection.

2011 – 2012

University of California Santa Cruz · Santa Cruz, CA
Research Assistant, Software Introspection Laboratory
Developing AI-guided mixed-initiative content creation tools for automatic robot components design.

2010 – 2011

University of California Santa Cruz · Santa Cruz, CA
Research Assistant, Database Group
Making semi-automatic index tuning in DBMS easier and more effective.

2010 – 2012

Infomotor, Inc. · San Francisco, CA
Software Engineer (Consultant)
Streamlining both reporting and presentation of key performance metrics and delivering understandable and actionable data.

2006 – 2009

Pearson VUE · Bloomington, MN
Software Engineer
Developing and maintaining distributed applications delivering over four million computer-based tests annually worldwide, supporting clients in licensure, certification, and assessment markets.

Publications

Conference Proceedings

2024

Continuous Safety & Security Evidence Generation, Curation and Assurance Case Construction Using the Evidential Tool Bus.
N. Shankar, M. Kim, H. Sanchez, H. Ruess, T. Beyene, R. Bouchekir, D. Bhatt, S. Varadarajan, A. Murugesan, H. Ren, I. Hong-Wong, K. Siu, S. Varanasi, M. D. Ernst.
43rd AIAA/IEEE Digital Avionics Systems Conference (DASC), 2024.

2016

JayHorn: A Framework for Verifying Java programs.
T. Kahsai, P. Rümmer, H. Sanchez, and M. Schäf
CAV: 28th International Conference on Computer Aided Verification, 2016.

2016

Design Lessons From Binary Fission: A Crowd Sourced Game for Precondition Discovery.
K. Compton, H. Logas, J. Osborn, C. Chakrabortti, K. Coffman, D. Fava, D. Lederle-Ensign, Z. Lin, J. Mazeika, A. Mobramaein, J. Pagnutti, H. Sanchez, J. Whitehead, J. Murray, and B. Laurel.
CFDG: 11th International Joint Conference of DiGRA and FDG, 2016.

2016

Detecting Similar Programs via the Weisfeiler-Lehman Graph Kernel.
W. Li, H. Saïdi, H. Sanchez, M. Schäf, and P. Schweitzer.
ICSR: 15th International Conference on Software Reuse, 2016.

2016

Multistaging to Understand: Distilling the Essence of Java Code Examples.
H. Sanchez, J. Whitehead, and M. Schäf.
ICPC: 24th IEEE International Conference on Program Comprehension, 2016.

2015

Source Code Curation on StackOverflow: The Vesperin System.
H. Sanchez and J. Whitehead.
ICSE: 37th IEEE/ACM International Conference on Software Engineering, 2015.

2014

Software Verification Games: Designing Xylem, the Code of Plants.
H. Logas, J. Whitehead, M. Mateas, R. Vallejos, L. Scott, D. Shapiro, J. Murray, K. Compton, J. Osborn, O. Salvatore, Z. Lin, H. Sanchez, M. Shavlovsky, D. Cetina, S. Clementi, and C. Lewis.
FDG: 9th International Conference on the Foundations of Digital Games, 2014.

2014

Xylem: The Code of Plants.
H. Logas, J. Whitehead, M. Mateas, R. Vallejos, L. Scott, D. Shapiro, J. Murray, K. Compton, J. Osborn, O. Salvatore, Z. Lin, H. Sanchez, M. Shavlovsky, C. Lewis, D. Cetina, and S. Clementi.
FDG: 9th International Conference on the Foundations of Digital Games, 2014.

2013

SNIPR: Complementing Code Search with Code Retargeting Capabilities.
H. Sanchez.
ICSE: 35th International Conference on Software Engineering, 2013.

2012

Kaizen: A Semi-Automatic Index Advisor.
I. Jimenez, H. Sanchez, Q.T. Tran, and N. Polyzotis.
SIGMOD: The 2012 ACM International Conference on Management of Data, 2012.

2012

Twitter Bullying Detection.
H. Sanchez and S. Kumar.
NSDI: Proceedings of the 8th USENIX Conference on Networked Systems Design and Implementation, 2012.

2010

Knowledge Maps — Fundamentally Modular Approach to Software Architecture, Design, Development and Deployment.
M. E. Fayad, H.A. Sanchez, and S.K. Singh.
SEDE: 19th International Conference on Software Engineering and Data Engineering, 2010.

2006

The Branding Analysis Pattern.
M. E. Fayad, H.A. Sanchez, and S.K. Singh.
AICCSA: Proceedings of the IEEE International Conference on Computer Systems and Applications, 2006.

2005

Towards Scalable and Adaptable Software Architectures.
M.E. Fayad, H.S. Hamza, and H.A. Sanchez.
IRI: The 2005 IEEE International Conference on Information Reuse and Integration, 2005.

2005

The Dynamic Mapping Design Pattern.
J. Gresh, J. McKim, and H. Sanchez.
PLOP: 12th Pattern Language of Programs Conference, 2005.

2004

A Pattern Language for CRC Cards.
M. Fayad, H. Sanchez, and H. Hamza.
PLOP: 11th Conference on Pattern Languages of Programs, 2004.

2003

The Sampling Analysis Pattern.
H.A. Sanchez, B. Lai, and M.E. Fayad.
IRI: IEEE International Conference on Information Reuse and Integration, 2003.

Workshop Papers

2023

Software Introspection for Signaling Social-Cyber Operations.
H. Sanchez and B. Hitaj.
DESTION: 5th Workshop on Design Automation for CPS and IoT, 2023.

2016

Putting Your Passwords on Self Destruct Mode: Beating Password Fatigue.
H. Sanchez, J. Murray, and D. Sanchez.
SOUPS: 12th Symposium on Usable Privacy and Security, 2016.

2015

BotPrint: Casual Robotic Evolution.
K. Compton, D. Lederle-Ensign, Z. Lin, J. Mazeika, A. Mobramaein, J. Pagnutti, H. Sanchez, and J. Whitehead.
WCCG: 1st Workshop on Computational Creativity and Games, 2015.

Journal Articles

2011

Benchmarking Online Index-Tuning Algorithms.
I. Jimenez, J. LeFevre, N. Polyzotis, H. Sanchez, and K. Schnaitter.
IEEE Data Engineering Bulletin, 2011.

Books

2014

Software Patterns, Knowledge Maps, and Domain Analysis.
M. Fayad, H.A. Sanchez, S.G.K. Hegde, A. Basia, and A. Vakil.
CRC Press, 2014. ISBN 9781466571433

Reports and Other Publications

2022

DesCert: Design for Certification.
N. Shankar, D. Bhatt, M. Ernst, M. Kim, S. Varadarajan, S. Millstein, J. Navas, J. Biatek, H. Sanchez, A. Murugesan, H. Ren.
Technical report, 2022. arxiv:2203.15178

2022

Trust in Motion: Capturing Trust Ascendancy in Open-Source Projects using Hybrid AI.
H. Sanchez and B. Hitaj.
ArXiv [Cs.SE]. arxiv:2210.02656

2021

Leveraging Team Dynamics to Predict Open-source Software Projects' Susceptibility to Social Engineering Attacks.
L. Giovanini, D. Oliveira, H. Sanchez, D. Shands.
ArXiv [Cs.SE]. arxiv:2106.16067

Patents

2023

Intelligent Collaborative Project Management.
D. Sanchez, H. Sanchez, H. Saïdi.
United States patent 11610173, 2023.

2022

Multipath verification of data transforms in a system of systems.
B. Dutertre, S. Jha, H. Sanchez, P. Lincoln, E. M. Pearson, R. Dean, I. A. Mason.
United States patent 2022/0197881, 2022.

Talks

2025

Collaborative AI for Data Type Extraction and JSON Schema Learning.
ARPA-H PARADIGM, Virtual meeting, June, 2025.

2025

Multi-level Cyber-Specific Computational Cognitive Models (C3Ms).
IARPA ReSCIND, Virtual meeting, May, 2025.

2023

Software Introspection for Signaling Social-Cyber Operations.
5th Workshop on Design Automation for CPS and IoT, TX. 2023.

2022

Graph-based Pattern Mining in Code Revisions for API Misuse Detection.
DARPA SocialCyber, Virtual meeting, 2022.

2022

Temporal Developer Activity Embeddings for Modeling Evolving Developer Behavior in Social Coding Environments.
DARPA SocialCyber, Virtual meeting, 2022.

2022

SIGNAL: Protecting the Integrity of Open-source Projects using Hybrid AI.
Computer Science Laboratory's Future Perfect Seminar, Virtual meeting. SRI International, Menlo Park, CA, 2022.

2021

A Multi-Task Learning Approach for Software Development Tasks Prediction.
DARPA SocialCyber, Virtual meeting, 2021.

2021

Change Point Analysis of Evolutionary Signatures in Social-Cyber Attacks.
DARPA SocialCyber, Virtual meeting, 2021.

2020

Continuous Certification of Open-source Software using Baseline DesCert.
DARPA ARCOS, Virtual meeting, 2020.

2019

Applying Machine Learning to Open-Source Code Retrieval and Verification.
Digital Foundations and Mathematics. Sandia National Laboratories. Livermore, CA, 2019.

2018

API Workflow Recommendations for Codebase Exploration.
DARPA MUSE. University of Texas at Austin, TX, 2018.

2017

Scaling Inter-Project Software Analysis.
DARPA MUSE, Massachusetts Institute of Technology (MIT), MA, 2017.

2015

Source Code Curation Tooling for the Code Forager.
Computer Science Laboratory Seminars. SRI International, Menlo Park, CA, 2015.

Service

Service to the Field

2025

IEEE Secure Development Conference Program Committee.
IEEE SecDev 2025. Call for Papers.

2023

NSF GRFP Reviewer.
NSF Graduate Research Fellowship Program (GRFP 2023). NSF 23-605

2022

Mining Challenge Program Committee.
19th International Conference on Mining Software Repositories (MSR 2022), Pittsburgh, PA.

2020

Artifact Evaluation Program Committee.
International Symposium on Software Testing and Analysis (ISSTA 2020), Los Angeles, CA.

2017

Paper Reviewer.
The 9th NASA Formal Methods Symposium (NFM 2017), Moffett Field, CA.

2016

Paper Reviewer.
The 13th International Colloquium on Theoretical Aspects of Computing (ICTAC 2016), Taipei, Taiwan.

2014

Paper Reviewer.
The 11th Working Conference on Mining Software Repositories (MSR 2014), Hyderabad India.

Mentorship

2022

Akshith Gunasekaran. PhD Student (Oregon State University)
Summer Internship, SRI International, Menlo Park, CA.

2017

Di Yang. PhD Student (University of California Irvine)
Summer Internship, SRI International, Menlo Park, CA.

2015

Xu Zhang. PhD Student (University of California Santa Cruz).

Software Development^*

* Some projects above are private on GitHub; please contact me for access.

Committer

Multi-AI
Multi-AI Collaboration; https://github.com/hsanchez/SRI-CSL/multi-ai (Python, PyTorch).

Yellow-Patch
A source code and patch analysis library; https://github.com/hsanchez/yellow-patch (Python, PyTorch, HuggingFace's Accelerate).

Evidential Tool Bus
A framework for scaling assurance case construction, program verification and analysis; https://github.com/SRI-CSL/etb2 (Java, Python).

DesCert
A framework for continuous safety & security evidence generation, curation and assurance case construction; https://github.com/SRI-CSL/DesCert (Java, Python, C).

SIGNAL
Securing open-source software supply chains against social-cyber threats using hybrid AI methods; https://github.com/SRI-CSL/signal-public (Python, PyTorch).

SENSOR
Graph-based pattern mining in code revisions for API misuse detection; https://github.com/SRI-CSL/signal-sensor.

SimProg
Inter-project program similarity detection using graph kernels; https://github.com/SRI-CSL/simprog (Python, PyTorch).

JayHorn
Model checking for Java programs; https://github.com/jayhorn (Java).

do-like-javac
Program analysis and verification pipeline for Java projects; https://github.com/SRI-CSL/do-like-javac (Java, Scala, C, Python).

Callspace
Reliable API recommendation using likely program invariants and graph kernels; https://github.com/SRI-CSL/smac/callspace (Scala, Java).

Randoop Gradle Plugin
Gradle plugin for running Randoop unit test generator; https://github.com/SRI-CSL/randoop-gradle-plugin (Kotlin, Java).

Daikon Gradle Plugin
Gradle plugin for running Daikon invariant detector; https://github.com/SRI-CSL/daikon-gradle-plugin (Kotlin, Java).

Contributor

CHEKOFV
A framework for crowd-sourced formal verification; https://github.com/SoftwareIntrospectionLab/chekofv (Python, Nodejs, TypeScript, ActionScript).

PASCALI
A set of tools for leveraging big code analytics in program analysis, code search, and program synthesis; https://github.com/SRI-CSL/pascali (Python, Scala, Java).