Patent-Pending ML Validation
First-of-its-kind Bayesian Optimization framework that discovers worst-case hardware stress in <1% of search space at Arm
Computer Science & Mathematics Student | Aspiring Software Engineer & Machine Learning Researcher
First-of-its-kind Bayesian Optimization framework that discovers worst-case hardware stress in <1% of search space at Arm
Core AI developer for UT Austin Villa's 7v7 robot soccer team, building RL-powered agent skills that won 3rd place globally
Built end-to-end data/ML infrastructure processing 50M+ articles and 70M+ comments with 99%+ accuracy BERT models at UT's Center for Media Engagement
Hi, I'm Ayman. I'm a cs student at UT Austin (Hook 'em) pursuing a concentration in AI and ML. I've worked across 4 labs at UT as a undergrad research assistant, from building RL envs for robot soccer to implementing and benchmarking ViTs and LLMs to aid medical research. I currently do ML research at ARM to optimize post-silicon validation. I love building impactful projects and solving complex problems, and I'm extremely passionate about deep learning.
Bachelor of Science
Location: Austin, TX, USA
Major: Computer Science
Concentration: Artificial Intelligence and Machine Learning
Relevant Coursework:
ML Research Engineer (Part-Time) | Austin, TX • May 2025 – Present
Independently proposed and built patent-pending ML framework using Bayesian Optimization to automatically discover worst-case hardware stress tests, achieving 99.8th-percentile stress levels while exploring <1% of configuration space. Automated 10,000+ hours of validation testing for next-generation AI platforms.
Research Assistant | Austin, TX • Jan 2025 – Present
Developed AI-driven agent skills that helped secure 3rd place at RoboCup 2025. Built RL-based policies for walking, dribbling, and attacking in 400K-line C++ codebase, slashing training time 70% through GPU optimization.
Data Engineer Intern | North Bethesda, MD • Jan 2025 – March 2025
Built high-performance data pipelines for Legis-1 Platform, processing millions of legislative documents. Developed LLM pipelines using RAG and embeddings to analyze 500K+ legal records for AI-driven policy research.
Software Engineer, Research Assistant | Austin, TX • Sep 2023 – May 2025
Built 150M-entry dataset processing 50M+ articles and 70M+ comments. Fine-tuned BERT models achieving 99% accuracy for NLP tasks. Designed React/Flask/Firebase platform serving 1,000+ participants with 99.99% uptime.
ML Research Assistant | Austin, TX • Feb 2024 – Jan 2025
Built containerized 3D pancreas MRI segmentation pipeline on H100 supercomputer using CNNs and transformers. Achieved +12% Dice gain matching SOTA performance across 1000+ scans with 5-fold cross-validation.
Research Assistant | Austin, TX • Feb 2024 – Jan 2025
Designed multiagent LLM research project studying diagnostic consistency in medical reasoning. Applied Cohen's Kappa, Chi-square tests, and logistic regression to assess agreement and bias. Presented at UT AI Health Conference as youngest speaker.
Software Engineer Intern | Remote • Jun 2022 – Oct 2022
Optimized CRM workflows with JavaScript and RPA, achieving centralized device data framework. Refined Configuration Database, purging redundant records and presenting data-driven insights to executives.
Research Intern | Remote • Jun 2023 – May 2024
Built NLP-driven chatbot for online news engagement using deep learning. Executed text analytics with POS tagging, LIWC, and sentence embeddings. Published research at CHI 2024 conference.
Software Engineer Intern | Austin, TX • Jun 2021 – Aug 2021
Improved post-COVID loan processing workflows for small businesses using Python scripting and data visualization to streamline operations.
Summer Learning Academy | Austin, TX • Jun 2021 – Aug 2021
As youngest participant, gained exposure to AI, business strategies, and professional development while collaborating on tech-focused initiatives.
Full-stack LLM implementation featuring RoPE positional embeddings, RMSNorm, SwiGLU activations, and attention sinks. Includes training pipeline, HuggingFace fine-tuning with LoRA on SST-2/SAMSum/GSM8K, and DPO alignment.
Tech Stack: Python, PyTorch, Transformers, PEFT, WikiText-2
Click the title to visit the repository.
Complete implementation of Stable Diffusion v1 architecture from scratch, including CLIP text encoder, U-Net with cross-attention, VAE encoder/decoder, and DDPM/DDIM schedulers. Trained on custom datasets with gradient accumulation and mixed precision training.
Tech Stack: Python, PyTorch, CLIP, Diffusion Models, VAE
Click the title to visit the repository.
A dynamic book-sharing platform that allows users to explore and share books freely while empowering authors to earn more by bypassing traditional publishers.
Tech Stack: React, PostgreSQL, Django, AWS S3, Django Rest Framework, Vercel
Click the title to visit the live site.
An intelligent networking tool that allows users to find and email professionals that align with their career interests and automatically drafts and sends emails to them using the user's resume, their intent, and the reciever's experiences. It has 200+ users on it right now.
Tech Stack: Flask, React, Tailwind, OpenAI API, Vercel, Render
Click the title to visit the live site.
A web application that helps users discover LeetCode problems similar to a given one, aiding in interview preparation.
Tech Stack: React, Flask, Scikit-learn, PostgreSQL
Click the title to visit the live site.
Code X-Ray is an interactive, graph-powered codebase explorer that lets developers instantly understand large monorepos—without reading thousands of lines first. By combining static analysis with GPT-4o, it visualizes architecture, highlights complexity hotspots, and answers deep code questions like “what breaks if I touch this?” right inside your IDE or browser.
Tech Stack: React, Tailwind, OpenAI API, Vercel, Render, AST parsing
Click the title to visit the live site.
An implementation of the EAGLE: Efficient Training-Free Router for Multi-LLM Inference paper. This project builds a routing system that combines global and local model abilities via ELO ratings and OpenAI embeddings, enabling adaptive selection between GPT-4o and GPT-4o-mini. Includes dataset parsing, embedding similarity search, grid search over routing weights, and analysis of performance under non-tie and skewed distributions.
Tech Stack: Python, OpenAI API, scikit-learn, Hugging Face Datasets, Pandas, NumPy
I'm passionate about learning and keeping up with the latest advancements in deep learning, from new architectures to real-world applications.
I love spending time in the gym and hitting new maxes
I've been playing soccer since I could walk. If I'm not working, you can find me on the nearest field!
I cherish spending quality time with friends and family, whether it's a casual hangout or a special gathering.
I spend a lot of time working on my startups. Outside of coding, it's asking people I'm close with for advice and feedback on my current projects.