About
About Youssef Ateya
Welcome to my personal website! I’m Youssef Ateya, a passionate Software Engineer with a strong background in Computer Science and a Minor in Mathematics from the University of Houston. My journey in tech has been focused on building robust, scalable, and intelligent solutions.
Education
University Of Houston - Houston, TX Bachelor of Science in Computer Science, Minor in Mathematics
- Aug. 2022 - May 2025*
- Relevant Coursework: Data Structures and Algorithms, Operating Systems, Computer Architecture, Automata Theory, Database Systems*
Experience
Software Engineer – Digital Wildcatters
June 2024 – July 2025
- Architected and deployed RAG-powered LLMs for 5 enterprise energy clients, processing 100,000+ oil & gas documents across PDFs, Word, Excel, and PowerPoint using distributed microservices with Kafka.
- Engineered end-to-end Azure ML pipelines in Python leveraging Document Intelligence, Computer Vision, and Cognitive Search to automate ingestion and retrieval, achieving 3× faster document search.
- Developed full-stack features for Collide.io using Ruby on Rails, React, and JavaScript; implemented real-time chat, caching, and analytics dashboards in a Linux production environment.
- Containerized backend and AI services with Docker and deployed via Kubernetes, improving scalability and reliability of production workloads.
- Fine-tuned transformer models in PyTorch for domain-specific retrieval and summarization; integrated custom embeddings and metadata tagging into RAG pipelines.
- Built internal monitoring tools using FastAPI and Prometheus to visualize ingestion throughput, latency, and system health.
- Created comprehensive test suites using pytest, Jest, and RSpec, achieving 90% coverage; automated builds and deployments via Jenkins and GitHub Actions.
- Collaborated cross-functionally with product managers, data scientists, and 5 enterprise clients to align feature development with business goals.
Meta MLH Fellow — Compiler Systems – Meta
June 2024 – September 2024
- Optimized Clang/LLVM compiler passes for 30% faster compilation in specific workloads, demonstrating measurable impact on developer productivity.
- Collaborated with Meta’s Programming Languages Research team on compiler performance experiments and regression testing pipelines.
- Authored and published hundreds of pages of LLVM and Clangd documentation used by 100+ new contributors, showing leadership in open-source knowledge sharing.
- Implemented automated cross-platform build and test workflows (Linux, macOS, Windows) using CMake and LLVM’s LIT testing framework.
- Led weekly code review sessions within fellowship cohorts to promote best practices and mentorship-driven collaboration.
Clang/LLVM Open Source Contributor – Open Source
September 2024 – Present
- Contributing to Clangd, Clang-Tidy, and ClangIR under the LLVM project, improving diagnostics, static analysis, and IR optimization workflows.
- Enhanced developer experience for VS Code users via new Clangd Language Server features used by 2M+ developers globally.
- Implemented and tested new code analysis checks and refactoring tools merged into Clang-Tidy.
- Authored design discussions and documentation in collaboration with LLVM maintainers across multiple subprojects.
Projects
University of Houston Cougar Chronicles | HTML/CSS, Node.JS, MySQL, Chart.JS, Azure (Github)
- Architected comprehensive library management system with automated book tracking, real-time inventory monitoring, and advanced analytics dashboard featuring interactive charts and usage statistics for administrators.
DEltectives, HackTX 2023 Winner | Python, Flask, React, Node.js (Github)
- Developed award-winning diversity and inclusion platform with intelligent campus matching algorithms, real-time geolocation services, and comprehensive user analytics that successfully connected 200+ students during the hackathon event.
Technical Skills
Proficient in: Python, JavaScript, Ruby, C/C++, Ruby on Rails, React, Node.js, Azure (AI Services, Document Intelligence), Docker, MySQL, Git
Experience with: Swift, Rust, Kotlin, Angular, Flask, AWS EC2, Kubernetes, PostgreSQL, MongoDB, Qdrant, NumPy, Pandas, Hugging Face, AI/ML: Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), Vector Databases, OCR, Haystack