(Updated October 2025)
      
    Huascar Sanchez
Research Summary and Skills
    
            Computer Science Researcher with over $15$ years of experience at the intersection of AI/ML and software engineering. Proven expertise in building and operating large-scale systems for "Big Code" analytics and programming language processing. Deep knowledge of AI/ML applications in source code analysis, coding automation, and social-cyber security. Strong background in modeling, analyzing, and detecting social-cyber operations targeting open-source ecosystems. My recent work leverages multi-LLM collaboration, LLM chemistry estimation (LLM matchmaking), and multi-LLM consensus to advance the development of compound AI systems—driving progress in workflow-aware reasoning, multi-LLM orchestration, data processing, knowledge extraction, and source code analysis. Skilled in prototyping novel solutions, along with a track record of cross-disciplinary leadership, and effective communication across technical and non-technical teams. Passionate about turning cutting-edge research into practical, scalable features that drive automation and productivity.
        
 
    Skills
    Multi-LLM Collaboration Optimization
          Programmer Productivity
          Multi-LLM Recommendation
          Automated Software Engineering
          Machine Learning on Source Code
          Source Code Curation and Analysis
          Social-Cyber Security
          Data Mining (Programming Language Processing)
          
        Research Interests
  Multi-LLM collaboration optimization and LLMs chemistry estimation for compound AI and software tasks.
        Social-cyber security for protecting the integrity of open-source ecosystems.
        Source code analysis and delta-debugging-inspired methods for identifying and characterizing emergent ML model misbehaviors.
        Software engineering for AI/ML, AI/ML for automated software engineering.
        Crowdsourcing, re-imagined with collaborative LLMs, software tools, and humans.
    Current Position
    2024 – now
        
            SRI International · Menlo Park, CA 
Senior Computer Scientist, Computer Science Laboratory
Leading applied research in AI/ML for code generation and analysis, focusing on optimal multi-LLM collaboration for complex, compound AI tasks. Establishing technical direction and developing principled methods to evaluate response quality, task accuracy, and inter-model dynamics ("chemistry") in structured collaborative tasks—advancing the design of optimal LLM ensembles.
    Senior Computer Scientist, Computer Science Laboratory
Leading applied research in AI/ML for code generation and analysis, focusing on optimal multi-LLM collaboration for complex, compound AI tasks. Establishing technical direction and developing principled methods to evaluate response quality, task accuracy, and inter-model dynamics ("chemistry") in structured collaborative tasks—advancing the design of optimal LLM ensembles.
Projects
    2025 – now
        
            Reliable Multi-AI Collaboration (Team: SRI) 
Leading the development of a framework for optimizing multi-LLM collaboration in complex compound AI tasks by estimating LLM chemistry—and incorporating Bayes-driven consensus mechanisms for uncertainty quantification to enable the formation of efficient, effective, and stable LLM ensembles.
    Leading the development of a framework for optimizing multi-LLM collaboration in complex compound AI tasks by estimating LLM chemistry—and incorporating Bayes-driven consensus mechanisms for uncertainty quantification to enable the formation of efficient, effective, and stable LLM ensembles.
2024 – now
        
            ARPA-H PARADIGM (Team: SRI) 
Developing multi-LLM strategies and workflows for extracting data types and constraints, inferring JSON schemas, and synthesizing schema-compliant code from diverse healthcare specifications.
    Developing multi-LLM strategies and workflows for extracting data types and constraints, inferring JSON schemas, and synthesizing schema-compliant code from diverse healthcare specifications.
2024 – now
        
            IARPA ReSCIND (Team: SRI, IHMC, TwoSix, UFL, RAD, GMU, ColumbiaU, SimSpace) 
Leading the development of Cyber-Specific Computational Cognitive Models using Self-Supervised Machine Learning, Information Foraging Theory, and Fuzzy Cognitive Maps.
    Leading the development of Cyber-Specific Computational Cognitive Models using Self-Supervised Machine Learning, Information Foraging Theory, and Fuzzy Cognitive Maps.
2024 – now
        
            ARPA-H DIGIHEALS (Team: SRI, Confidencial Inc., Homeward Health) 
Advancing the automatic generation of formal representations from natural language using neuro-symbolic AI, and leading the development of multi-LLM (Agentic) workflows for healthcare data analysis, formalization, and composition.
    Advancing the automatic generation of formal representations from natural language using neuro-symbolic AI, and leading the development of multi-LLM (Agentic) workflows for healthcare data analysis, formalization, and composition.
Previous projects*
    * Order of projects is arbitrary.
    
Education
	2015
		
			University of California Santa Cruz · Ph.D. in Computer Science 
Dissertation: Source code Curation Tooling for the Code Forager
Advisor: Prof. Jim Whitehead
	Dissertation: Source code Curation Tooling for the Code Forager
Advisor: Prof. Jim Whitehead
2006
		
			San Jose State University · M.S. in Software Engineering 
MS Thesis title: Building Systems Using Patterns: Creating Knowledge Maps
Advisor: Prof. M.E. Fayad
	MS Thesis title: Building Systems Using Patterns: Creating Knowledge Maps
Advisor: Prof. M.E. Fayad
2001
		
			Catholic University of Nicaragua · B.S. in Information Systems Engineering 
		
	Professional Employment
    2021 – 2023
        
            SRI International · Menlo Park, CA 
Advanced Computer Scientist II, Computer Science Laboratory
Continuing to lead DARPA Social Cyber research to secure open-source software supply chains through dynamic behavior embeddings that model social-cyber threats, while developing multimodal, multi-task deep learning models for open-source software introspection and error profiling.
    Advanced Computer Scientist II, Computer Science Laboratory
Continuing to lead DARPA Social Cyber research to secure open-source software supply chains through dynamic behavior embeddings that model social-cyber threats, while developing multimodal, multi-task deep learning models for open-source software introspection and error profiling.
2019 – 2021
        
            SRI International · Menlo Park, CA 
Advanced Computer Scientist I, Computer Science Laboratory
Guiding the integration of formal verification and program analysis for continuous assurance case construction. Leading DARPA SocialCyber as Principal Investigator, shaping project vision, delivering hybrid AI methods for securing software supply chains, mentoring junior researchers, and coordinating reporting across stakeholders.
    Advanced Computer Scientist I, Computer Science Laboratory
Guiding the integration of formal verification and program analysis for continuous assurance case construction. Leading DARPA SocialCyber as Principal Investigator, shaping project vision, delivering hybrid AI methods for securing software supply chains, mentoring junior researchers, and coordinating reporting across stakeholders.
2018 – 2019
        
            SRI International · Menlo Park, CA 
Advanced Computer Scientist, Computer Science Laboratory
Applying machine learning, large-scale graph mining, and dynamic program analysis to open-source software security, and developing integration and verification methods for systems of systems in distributed environments.
    Advanced Computer Scientist, Computer Science Laboratory
Applying machine learning, large-scale graph mining, and dynamic program analysis to open-source software security, and developing integration and verification methods for systems of systems in distributed environments.
2017 – 2018
        
            SRI International · Menlo Park, CA 
Computer Scientist, Computer Science Laboratory
Leveraging Big Code analytics and machine learning on source code for code-to-code search, API recommendation, code synthesis, and source code analysis.
    Computer Scientist, Computer Science Laboratory
Leveraging Big Code analytics and machine learning on source code for code-to-code search, API recommendation, code synthesis, and source code analysis.
2015 – 2017
        
            SRI International · Menlo Park, CA 
Postdoctoral Fellow, Computer Science Laboratory, Host: Hassen Saïdi
Developing learning-to-rank models for large-scale code search and recommendation, and lightweight source code summarization techniques for improved source code understanding.
    Postdoctoral Fellow, Computer Science Laboratory, Host: Hassen Saïdi
Developing learning-to-rank models for large-scale code search and recommendation, and lightweight source code summarization techniques for improved source code understanding.
2014 – 2015
        
            SRI International · Menlo Park, CA 
Visiting Research Fellow, Computer Science Laboratory, Host: John Murray
Making formal verification scalable and accessible to broader audiences via crowd-sourced approaches—crowd-sourced formal verification—that enable participation without formal methods expertise.
    Visiting Research Fellow, Computer Science Laboratory, Host: John Murray
Making formal verification scalable and accessible to broader audiences via crowd-sourced approaches—crowd-sourced formal verification—that enable participation without formal methods expertise.
2013 – 2015
        
          University of California Santa Cruz · Santa Cruz, CA 
Research Assistant, Software Introspection Laboratory
Empirical validation of crowd-sourced formal verification methods enabled by online games.
      Research Assistant, Software Introspection Laboratory
Empirical validation of crowd-sourced formal verification methods enabled by online games.
2012 – 2013
        
          University of California Santa Cruz · Santa Cruz, CA 
Research Assistant, Information Retrieval Laboratory
Developing latent representation learning methods from social media data for multimodal cyberbullying detection.
      Research Assistant, Information Retrieval Laboratory
Developing latent representation learning methods from social media data for multimodal cyberbullying detection.
2011 – 2012
        
          University of California Santa Cruz · Santa Cruz, CA 
Research Assistant, Software Introspection Laboratory
Developing AI-guided, mixed-initiative content creation tools for automatic robot components design.
      Research Assistant, Software Introspection Laboratory
Developing AI-guided, mixed-initiative content creation tools for automatic robot components design.
2010 – 2011
        
          University of California Santa Cruz · Santa Cruz, CA 
Research Assistant, Database Group
Advancing semi-automatic index tuning in DBMSs to make it easier and more effective.
      Research Assistant, Database Group
Advancing semi-automatic index tuning in DBMSs to make it easier and more effective.
2010 – 2012
        
          Infomotor, Inc. · San Francisco, CA 
Software Engineer (Consultant)
Streamlining both reporting and presentation of key performance metrics to deliver data that is both understandable and actionable.
      Software Engineer (Consultant)
Streamlining both reporting and presentation of key performance metrics to deliver data that is both understandable and actionable.
2006 – 2009
        
          Pearson VUE · Bloomington, MN 
Software Engineer
Developing and maintaining distributed applications that deliver over four million computer-based tests annually worldwide, supporting clients in licensure, certification, and assessment markets.
      Software Engineer
Developing and maintaining distributed applications that deliver over four million computer-based tests annually worldwide, supporting clients in licensure, certification, and assessment markets.
Publications
  Conference Proceedings
  2024
    
      Continuous Safety & Security Evidence Generation, Curation and Assurance Case Construction Using the Evidential Tool Bus.
N. Shankar, M. Kim, H. Sanchez, H. Ruess, T. Beyene, R. Bouchekir, D. Bhatt, S. Varadarajan, A. Murugesan, H. Ren, I. Hong-Wong, K. Siu, S. Varanasi, M. D. Ernst.
43rd AIAA/IEEE Digital Avionics Systems Conference (DASC), 2024.
  N. Shankar, M. Kim, H. Sanchez, H. Ruess, T. Beyene, R. Bouchekir, D. Bhatt, S. Varadarajan, A. Murugesan, H. Ren, I. Hong-Wong, K. Siu, S. Varanasi, M. D. Ernst.
43rd AIAA/IEEE Digital Avionics Systems Conference (DASC), 2024.
2016
    
      JayHorn: A Framework for Verifying Java programs.
T. Kahsai, P. Rümmer, H. Sanchez, and M. Schäf
CAV: 28th International Conference on Computer Aided Verification, 2016.
  T. Kahsai, P. Rümmer, H. Sanchez, and M. Schäf
CAV: 28th International Conference on Computer Aided Verification, 2016.
2016
    
      Design Lessons From Binary Fission: A Crowd Sourced Game for Precondition Discovery.
K. Compton, H. Logas, J. Osborn, C. Chakrabortti, K. Coffman, D. Fava, D. Lederle-Ensign, Z. Lin, J. Mazeika, A. Mobramaein, J. Pagnutti, H. Sanchez, J. Whitehead, J. Murray, and B. Laurel.
CFDG: 11th International Joint Conference of DiGRA and FDG, 2016.
  K. Compton, H. Logas, J. Osborn, C. Chakrabortti, K. Coffman, D. Fava, D. Lederle-Ensign, Z. Lin, J. Mazeika, A. Mobramaein, J. Pagnutti, H. Sanchez, J. Whitehead, J. Murray, and B. Laurel.
CFDG: 11th International Joint Conference of DiGRA and FDG, 2016.
2016
    
      Detecting Similar Programs via the Weisfeiler-Lehman Graph Kernel.
W. Li, H. Saïdi, H. Sanchez, M. Schäf, and P. Schweitzer.
ICSR: 15th International Conference on Software Reuse, 2016.
  W. Li, H. Saïdi, H. Sanchez, M. Schäf, and P. Schweitzer.
ICSR: 15th International Conference on Software Reuse, 2016.
2016
    
      Multistaging to Understand: Distilling the Essence of Java Code Examples.
H. Sanchez, J. Whitehead, and M. Schäf.
ICPC: 24th IEEE International Conference on Program Comprehension, 2016.
  H. Sanchez, J. Whitehead, and M. Schäf.
ICPC: 24th IEEE International Conference on Program Comprehension, 2016.
2015
    
      Source Code Curation on StackOverflow: The Vesperin System.
H. Sanchez and J. Whitehead.
ICSE: 37th IEEE/ACM International Conference on Software Engineering, 2015.
  H. Sanchez and J. Whitehead.
ICSE: 37th IEEE/ACM International Conference on Software Engineering, 2015.
2014
    
      Software Verification Games: Designing Xylem, the Code of Plants.
H. Logas, J. Whitehead, M. Mateas, R. Vallejos, L. Scott, D. Shapiro, J. Murray, K. Compton, J. Osborn, O. Salvatore, Z. Lin, H. Sanchez, M. Shavlovsky, D. Cetina, S. Clementi, and C. Lewis.
FDG: 9th International Conference on the Foundations of Digital Games, 2014.
  H. Logas, J. Whitehead, M. Mateas, R. Vallejos, L. Scott, D. Shapiro, J. Murray, K. Compton, J. Osborn, O. Salvatore, Z. Lin, H. Sanchez, M. Shavlovsky, D. Cetina, S. Clementi, and C. Lewis.
FDG: 9th International Conference on the Foundations of Digital Games, 2014.
2014
    
      Xylem: The Code of Plants.
H. Logas, J. Whitehead, M. Mateas, R. Vallejos, L. Scott, D. Shapiro, J. Murray, K. Compton, J. Osborn, O. Salvatore, Z. Lin, H. Sanchez, M. Shavlovsky, C. Lewis, D. Cetina, and S. Clementi.
FDG: 9th International Conference on the Foundations of Digital Games, 2014.
  H. Logas, J. Whitehead, M. Mateas, R. Vallejos, L. Scott, D. Shapiro, J. Murray, K. Compton, J. Osborn, O. Salvatore, Z. Lin, H. Sanchez, M. Shavlovsky, C. Lewis, D. Cetina, and S. Clementi.
FDG: 9th International Conference on the Foundations of Digital Games, 2014.
2013
    
      SNIPR: Complementing Code Search with Code Retargeting Capabilities.
H. Sanchez.
ICSE: 35th International Conference on Software Engineering, 2013.
  H. Sanchez.
ICSE: 35th International Conference on Software Engineering, 2013.
2012
    
      Kaizen: A Semi-Automatic Index Advisor.
I. Jimenez, H. Sanchez, Q.T. Tran, and N. Polyzotis.
SIGMOD: The 2012 ACM International Conference on Management of Data, 2012.
  I. Jimenez, H. Sanchez, Q.T. Tran, and N. Polyzotis.
SIGMOD: The 2012 ACM International Conference on Management of Data, 2012.
2012
    
      Twitter Bullying Detection.
H. Sanchez and S. Kumar.
NSDI: Proceedings of the 8th USENIX Conference on Networked Systems Design and Implementation, 2012.
  H. Sanchez and S. Kumar.
NSDI: Proceedings of the 8th USENIX Conference on Networked Systems Design and Implementation, 2012.
2010
    
      Knowledge Maps — Fundamentally Modular Approach to Software Architecture, Design, Development and Deployment.
M. E. Fayad, H.A. Sanchez, and S.K. Singh.
SEDE: 19th International Conference on Software Engineering and Data Engineering, 2010.
  M. E. Fayad, H.A. Sanchez, and S.K. Singh.
SEDE: 19th International Conference on Software Engineering and Data Engineering, 2010.
2006
    
      The Branding Analysis Pattern.
M. E. Fayad, H.A. Sanchez, and S.K. Singh.
AICCSA: Proceedings of the IEEE International Conference on Computer Systems and Applications, 2006.
  M. E. Fayad, H.A. Sanchez, and S.K. Singh.
AICCSA: Proceedings of the IEEE International Conference on Computer Systems and Applications, 2006.
2005
    
      Towards Scalable and Adaptable Software Architectures.
M.E. Fayad, H.S. Hamza, and H.A. Sanchez.
IRI: The 2005 IEEE International Conference on Information Reuse and Integration, 2005.
  M.E. Fayad, H.S. Hamza, and H.A. Sanchez.
IRI: The 2005 IEEE International Conference on Information Reuse and Integration, 2005.
2005
    
      The Dynamic Mapping Design Pattern.
J. Gresh, J. McKim, and H. Sanchez.
PLOP: 12th Pattern Language of Programs Conference, 2005.
  J. Gresh, J. McKim, and H. Sanchez.
PLOP: 12th Pattern Language of Programs Conference, 2005.
2004
    
      A Pattern Language for CRC Cards.
M. Fayad, H. Sanchez, and H. Hamza.
PLOP: 11th Conference on Pattern Languages of Programs, 2004.
  M. Fayad, H. Sanchez, and H. Hamza.
PLOP: 11th Conference on Pattern Languages of Programs, 2004.
2003
    
      The Sampling Analysis Pattern.
H.A. Sanchez, B. Lai, and M.E. Fayad.
IRI: IEEE International Conference on Information Reuse and Integration, 2003.
  H.A. Sanchez, B. Lai, and M.E. Fayad.
IRI: IEEE International Conference on Information Reuse and Integration, 2003.
Workshop Papers
  2023
    
      Software Introspection for Signaling Social-Cyber Operations.
H. Sanchez and B. Hitaj.
DESTION: 5th Workshop on Design Automation for CPS and IoT, 2023.
  H. Sanchez and B. Hitaj.
DESTION: 5th Workshop on Design Automation for CPS and IoT, 2023.
2016
    
      Putting Your Passwords on Self Destruct Mode: Beating Password Fatigue.
H. Sanchez, J. Murray, and D. Sanchez.
SOUPS: 12th Symposium on Usable Privacy and Security, 2016.
  H. Sanchez, J. Murray, and D. Sanchez.
SOUPS: 12th Symposium on Usable Privacy and Security, 2016.
2015
    
      BotPrint: Casual Robotic Evolution.
K. Compton, D. Lederle-Ensign, Z. Lin, J. Mazeika, A. Mobramaein, J. Pagnutti, H. Sanchez, and J. Whitehead.
WCCG: 1st Workshop on Computational Creativity and Games, 2015.
  K. Compton, D. Lederle-Ensign, Z. Lin, J. Mazeika, A. Mobramaein, J. Pagnutti, H. Sanchez, and J. Whitehead.
WCCG: 1st Workshop on Computational Creativity and Games, 2015.
Journal Articles
  2011
    
      Benchmarking Online Index-Tuning Algorithms.
I. Jimenez, J. LeFevre, N. Polyzotis, H. Sanchez, and K. Schnaitter.
IEEE Data Engineering Bulletin, 2011.
  I. Jimenez, J. LeFevre, N. Polyzotis, H. Sanchez, and K. Schnaitter.
IEEE Data Engineering Bulletin, 2011.
Books
  2014
    
      Software Patterns, Knowledge Maps, and Domain Analysis.
M. Fayad, H.A. Sanchez, S.G.K. Hegde, A. Basia, and A. Vakil.
CRC Press, 2014. ISBN 9781466571433
  M. Fayad, H.A. Sanchez, S.G.K. Hegde, A. Basia, and A. Vakil.
CRC Press, 2014. ISBN 9781466571433
Reports and Other Publications
  2025
    
      LLM Chemistry Estimation for Multi-LLM Recommendation.
H. Sanchez and B. Hitaj.
ArXiv [Cs.LG]. arXiv:2510.03930 (Under Review)
  H. Sanchez and B. Hitaj.
ArXiv [Cs.LG]. arXiv:2510.03930 (Under Review)
2022
    
      DesCert: Design for Certification.
N. Shankar, D. Bhatt, M. Ernst, M. Kim, S. Varadarajan, S. Millstein, J. Navas, J. Biatek, H. Sanchez, A. Murugesan, H. Ren.
Technical report, 2022. arxiv:2203.15178
  N. Shankar, D. Bhatt, M. Ernst, M. Kim, S. Varadarajan, S. Millstein, J. Navas, J. Biatek, H. Sanchez, A. Murugesan, H. Ren.
Technical report, 2022. arxiv:2203.15178
2022
    
      Trust in Motion: Capturing Trust Ascendancy in Open-Source Projects using Hybrid AI.
H. Sanchez and B. Hitaj.
ArXiv [Cs.SE]. arxiv:2210.02656
  H. Sanchez and B. Hitaj.
ArXiv [Cs.SE]. arxiv:2210.02656
2021
    
      Leveraging Team Dynamics to Predict Open-source Software Projects' Susceptibility to Social Engineering Attacks.
L. Giovanini, D. Oliveira, H. Sanchez, D. Shands.
ArXiv [Cs.SE]. arxiv:2106.16067
  L. Giovanini, D. Oliveira, H. Sanchez, D. Shands.
ArXiv [Cs.SE]. arxiv:2106.16067
Patents
  2023
    
      Intelligent Collaborative Project Management.
D. Sanchez, H. Sanchez, H. Saïdi.
United States patent 11610173, 2023.
  D. Sanchez, H. Sanchez, H. Saïdi.
United States patent 11610173, 2023.
2022
    
      Multipath verification of data transforms in a system of systems.
B. Dutertre, S. Jha, H. Sanchez, P. Lincoln, E. M. Pearson, R. Dean, I. A. Mason.
United States patent 2022/0197881, 2022.
  B. Dutertre, S. Jha, H. Sanchez, P. Lincoln, E. M. Pearson, R. Dean, I. A. Mason.
United States patent 2022/0197881, 2022.
Talks
  2025
    
      Collaborative AI for Data Type Extraction and JSON Schema Learning.
ARPA-H PARADIGM, Virtual meeting, June, 2025.
    
  ARPA-H PARADIGM, Virtual meeting, June, 2025.
2025
    
      Multi-level Cyber-Specific Computational Cognitive Models (C3Ms).
IARPA ReSCIND, Virtual meeting, May, 2025.
    
  IARPA ReSCIND, Virtual meeting, May, 2025.
2023
    
      Software Introspection for Signaling Social-Cyber Operations.
5th Workshop on Design Automation for CPS and IoT, TX. 2023.
    
  5th Workshop on Design Automation for CPS and IoT, TX. 2023.
2022
    
      Graph-based Pattern Mining in Code Revisions for API Misuse Detection.
DARPA SocialCyber, Virtual meeting, 2022.
    
  DARPA SocialCyber, Virtual meeting, 2022.
2022
    
      Temporal Developer Activity Embeddings for Modeling Evolving Developer Behavior in Social Coding Environments.
DARPA SocialCyber, Virtual meeting, 2022.
    
  DARPA SocialCyber, Virtual meeting, 2022.
2022
    
      SIGNAL: Protecting the Integrity of Open-source Projects using Hybrid AI.
Computer Science Laboratory's Future Perfect Seminar, Virtual meeting. SRI International, Menlo Park, CA, 2022.
    
  Computer Science Laboratory's Future Perfect Seminar, Virtual meeting. SRI International, Menlo Park, CA, 2022.
2021
    
      A Multi-Task Learning Approach for Software Development Tasks Prediction.
DARPA SocialCyber, Virtual meeting, 2021.
    
  DARPA SocialCyber, Virtual meeting, 2021.
2021
    
      Change Point Analysis of Evolutionary Signatures in Social-Cyber Attacks.
DARPA SocialCyber, Virtual meeting, 2021.
    
  DARPA SocialCyber, Virtual meeting, 2021.
2020
    
      Continuous Certification of Open-source Software using Baseline DesCert.
DARPA ARCOS, Virtual meeting, 2020.
    
  DARPA ARCOS, Virtual meeting, 2020.
2019
    
      Applying Machine Learning to Open-Source Code Retrieval and Verification.
Digital Foundations and Mathematics. Sandia National Laboratories. Livermore, CA, 2019.
    
  Digital Foundations and Mathematics. Sandia National Laboratories. Livermore, CA, 2019.
2018
    
      API Workflow Recommendations for Codebase Exploration.
DARPA MUSE. University of Texas at Austin, TX, 2018.
    
  DARPA MUSE. University of Texas at Austin, TX, 2018.
2017
    
      Scaling Inter-Project Software Analysis.
DARPA MUSE, Massachusetts Institute of Technology (MIT), MA, 2017.
    
  DARPA MUSE, Massachusetts Institute of Technology (MIT), MA, 2017.
2015
    
      Source Code Curation Tooling for the Code Forager.
Computer Science Laboratory Seminars. SRI International, Menlo Park, CA, 2015.
    
  Computer Science Laboratory Seminars. SRI International, Menlo Park, CA, 2015.
Service
  Service to the Field
  2025
    
  2023
    
  2022
    
      Mining Challenge Program Committee.
19th International Conference on Mining Software Repositories (MSR 2022), Pittsburgh, PA.
  19th International Conference on Mining Software Repositories (MSR 2022), Pittsburgh, PA.
2020
    
      Artifact Evaluation Program Committee.
International Symposium on Software Testing and Analysis (ISSTA 2020), Los Angeles, CA.
  International Symposium on Software Testing and Analysis (ISSTA 2020), Los Angeles, CA.
2017
    
      Paper Reviewer.
The 9th NASA Formal Methods Symposium (NFM 2017), Moffett Field, CA.
  The 9th NASA Formal Methods Symposium (NFM 2017), Moffett Field, CA.
2016
    
      Paper Reviewer.
The 13th International Colloquium on Theoretical Aspects of Computing (ICTAC 2016), Taipei, Taiwan.
  The 13th International Colloquium on Theoretical Aspects of Computing (ICTAC 2016), Taipei, Taiwan.
2014
    
      Paper Reviewer.
The 11th Working Conference on Mining Software Repositories (MSR 2014), Hyderabad India.
  The 11th Working Conference on Mining Software Repositories (MSR 2014), Hyderabad India.
Mentorship
  2022
    
      Akshith Gunasekaran. PhD Student (Oregon State University)
Summer Internship, SRI International, Menlo Park, CA.
  Summer Internship, SRI International, Menlo Park, CA.
2017
    
      Di Yang. PhD Student (University of California Irvine)
Summer Internship, SRI International, Menlo Park, CA.
  Summer Internship, SRI International, Menlo Park, CA.
2015
    
      Xu Zhang. PhD Student (University of California Santa Cruz).
    
  Software Development*
  * Some projects listed above are private on GitHub; please contact me for access details.
  Committer
  
      LLM Chemistry
LLM Chemistry Estimation for Multi-LLM Recommendation; https://github.com/SRI-CSL/llm-chemistry (Python, PyTorch).
  LLM Chemistry Estimation for Multi-LLM Recommendation; https://github.com/SRI-CSL/llm-chemistry (Python, PyTorch).
      Aligner
LLM Workflows for JSON Schema learning; https://github.com/SRI-CSL/aligner (Python, PyTorch, Langchain).
  LLM Workflows for JSON Schema learning; https://github.com/SRI-CSL/aligner (Python, PyTorch, Langchain).
      Collaborative LLMs
Optimal Multi-AI Collaboration; https://github.com/hsanchez/SRI-CSL/collm (Python, PyTorch, Ollama).
  Optimal Multi-AI Collaboration; https://github.com/hsanchez/SRI-CSL/collm (Python, PyTorch, Ollama).
      Yellow-Patch
A source code and patch analysis library; https://github.com/hsanchez/yellow-patch (Python, PyTorch, HuggingFace's Accelerate).
  A source code and patch analysis library; https://github.com/hsanchez/yellow-patch (Python, PyTorch, HuggingFace's Accelerate).
      Evidential Tool Bus
A framework for scaling assurance case construction, program verification and analysis; https://github.com/SRI-CSL/etb2 (Java, Python).
  A framework for scaling assurance case construction, program verification and analysis; https://github.com/SRI-CSL/etb2 (Java, Python).
      DesCert
A framework for continuous safety & security evidence generation, curation and assurance case construction; https://github.com/SRI-CSL/DesCert (Java, Python, C).
  A framework for continuous safety & security evidence generation, curation and assurance case construction; https://github.com/SRI-CSL/DesCert (Java, Python, C).
      SIGNAL
Securing open-source software supply chains against social-cyber threats using hybrid AI methods; https://github.com/SRI-CSL/signal-public (Python, PyTorch).
  Securing open-source software supply chains against social-cyber threats using hybrid AI methods; https://github.com/SRI-CSL/signal-public (Python, PyTorch).
      SENSOR
Graph-based pattern mining in code revisions for API misuse detection; https://github.com/SRI-CSL/signal-sensor.
  Graph-based pattern mining in code revisions for API misuse detection; https://github.com/SRI-CSL/signal-sensor.
      SimProg
Inter-project program similarity detection using graph kernels; https://github.com/SRI-CSL/simprog (Python, PyTorch).
  Inter-project program similarity detection using graph kernels; https://github.com/SRI-CSL/simprog (Python, PyTorch).
      do-like-javac
Program analysis and verification pipeline for Java projects; https://github.com/SRI-CSL/do-like-javac (Java, Scala, C, Python).
  Program analysis and verification pipeline for Java projects; https://github.com/SRI-CSL/do-like-javac (Java, Scala, C, Python).
      Callspace
Reliable API recommendation using likely program invariants and graph kernels; https://github.com/SRI-CSL/smac/callspace (Scala, Java).
  Reliable API recommendation using likely program invariants and graph kernels; https://github.com/SRI-CSL/smac/callspace (Scala, Java).
      Randoop Gradle Plugin
Gradle plugin for running Randoop unit test generator; https://github.com/SRI-CSL/randoop-gradle-plugin (Kotlin, Java).
  Gradle plugin for running Randoop unit test generator; https://github.com/SRI-CSL/randoop-gradle-plugin (Kotlin, Java).
      Daikon Gradle Plugin
Gradle plugin for running Daikon invariant detector; https://github.com/SRI-CSL/daikon-gradle-plugin (Kotlin, Java).
  Gradle plugin for running Daikon invariant detector; https://github.com/SRI-CSL/daikon-gradle-plugin (Kotlin, Java).
Contributor
  
      CHEKOFV
A framework for crowd-sourced formal verification; https://github.com/SoftwareIntrospectionLab/chekofv (Python, Nodejs, TypeScript, ActionScript).
  A framework for crowd-sourced formal verification; https://github.com/SoftwareIntrospectionLab/chekofv (Python, Nodejs, TypeScript, ActionScript).
      PASCALI
A set of tools for leveraging big code analytics in program analysis, code search, and program synthesis; https://github.com/SRI-CSL/pascali (Python, Scala, Java).
  A set of tools for leveraging big code analytics in program analysis, code search, and program synthesis; https://github.com/SRI-CSL/pascali (Python, Scala, Java).