Allen Thomas
+1 217-298-6572 | misalignedmodel.com | github.com/AllenThomasDev
allenthomasdev@gmail.com | linkedin.com/in/AllenThomasDev
AI Engineer with 4+ years experience in large-scale systems, specializing in LLM applications and ML infrastructure
Education
University of Illinois at Urbana-Champaign
Master's in Computer Science | 3.85/4.0
August 2024 - May 2026
Distributed Systems, Systems for Gen AI, Topics in LLM Agents, Deep Learning
University of Pune
Bachelor's in Computer Engineering | 3.40/4.0
August 2017 - May 2021
Reinforcement Learning
Experience
Full-Stack Engineer
Cerebion LLC - Post-quantum cryptography compliance platform
May 2025 - Present
- Developing AI-powered code refactoring platform to address enterprise post-quantum cryptography compliance requirements, targeting organizations facing 2030+ migration deadlines.
- Built ML engine detecting 25+ cryptographic vulnerability patterns with 90%+ accuracy on test datasets.
- Developed AI-powered code analysis platform using LLMs and Ghidra decompilation, reducing manual code review time by 80% for post-quantum cryptography compliance.
- Implemented multi-language parser enabling analysis across 10+ programming languages, expanding compatibility from Java-only to enterprise-wide coverage.
Software Engineer
Helpshift - Customer support platform installed on over 2 billion devices
June 2022 - June 2024
- Raised analytics uptime from 99.0% to 99.99% and cut $250,000/yr by leading analytics infrastructure migration to AWS
- Migrated analytics pipelines from HBase to Redshift, enabling 10× traffic growth for 200+ customers with zero downtime
- Eliminated stream processing bottlenecks affecting real-time analytics by migrating legacy Storm infrastructure to Flink, reducing event latency by 35% for 40K+ support agents.
- Preserved 350+ TB of historical data during migration, maintaining 6+ years of customer analytics access
- Established Airflow standards and documentation across 5+ teams, saving 15 developer hours weekly
- Reduced ad-hoc engineering data requests by 40% by implementing Metabase self-service analytics platform
- Mentored 10+ hires on coding practices and system architecture, reducing time to first release by 35% compared to previous year.
Projects
AI Home & Appliance Management
May 2025
Presented at Chicago Booth GNVC Finals - React Native, TypeScript, FastAPI, LLMs
- Shipped a react prototype to Global New Venture Challenge at Chicago Booth School of Business judges in under 3 weeks.
- Delivered production-grade backend with async workflows, Supabase persistence, and resilient GPS/camera upload flows
- Engineered image-based appliance recognition powered by LLMs, reaching 85% identification accuracy through image processing and structured prompt templates.
- Designed type-safe APIs with Pydantic and implemented real-time sync powering a multi-property maintenance dashboard.
Control Vector-Based LLM Steering
March 2025
LLM Steering, Behavioral Analysis
- Implemented control vector techniques to steer LLM behavior across personalities to modify model outputs.
- Built pipeline to analyze model responses at varying control strengths and evaluation methods for behavioral analysis.
Distributed Stream Processing Framework
Oct - Nov 2024
Golang, Distributed Systems, Fault Tolerance
- Built and deployed HyDFS, a custom fault-tolerant distributed file system supporting replication, consistent hashing, and automated recovery, providing the foundational storage layer for distributed data processing.
- Designed a high-throughput, distributed stream processing framework with exactly-once semantics on top of HyDFS, enabling reliable real-time data handling across multiple nodes.
Awards & Research
Agentic Research Code Reproducibility System
October 2024
Autonomous Research Validation - Docker, Google Gemini, OpenAI
- Developed autonomous system that reproduced 12 out of 15 academic papers with zero human intervention, reducing research validation time from weeks to hours through intelligent Docker environment reconstruction.
Multi-Agent Reinforcement Learning: Hide and Seek
2021
IEEE Publication - Reinforcement Learning, Multi-Agent Systems
- Researched and implemented multiple reinforcement learning algorithms for multi-agent systems, developing a novel hide-and-seek simulation environment inspired by OpenAI research on Multi-agent Autocurricula
Skills
Languages: Java, Python, Golang, Clojure, JavaScript
Databases: PostgreSQL, MySQL, MongoDB, Apache HBase, Redis, Kafka, Flink
Cloud: AWS Redshift, S3, Athena, EMR