Allen Thomas

+1 217-298-6572 | misalignedmodel.com | github.com/AllenThomasDev

allenthomasdev@gmail.com | linkedin.com/in/AllenThomasDev

AI Engineer with 4+ years experience in large-scale systems, specializing in LLM applications and ML infrastructure

Education

University of Illinois at Urbana-Champaign

Master's in Computer Science | 3.85/4.0

August 2024 - May 2026

Distributed Systems, Systems for Gen AI, Topics in LLM Agents, Deep Learning

University of Pune

Bachelor's in Computer Engineering | 3.40/4.0

August 2017 - May 2021

Reinforcement Learning

Experience

Full-Stack Engineer

Cerebion LLC - Post-quantum cryptography compliance platform

May 2025 - Present

  • Developing AI-powered code refactoring platform to address enterprise post-quantum cryptography compliance requirements, targeting organizations facing 2030+ migration deadlines.
  • Built ML engine detecting 25+ cryptographic vulnerability patterns with 90%+ accuracy on test datasets.
  • Developed AI-powered code analysis platform using LLMs and Ghidra decompilation, reducing manual code review time by 80% for post-quantum cryptography compliance.
  • Implemented multi-language parser enabling analysis across 10+ programming languages, expanding compatibility from Java-only to enterprise-wide coverage.

Software Engineer

Helpshift - Customer support platform installed on over 2 billion devices

June 2022 - June 2024

  • Raised analytics uptime from 99.0% to 99.99% and cut $250,000/yr by leading analytics infrastructure migration to AWS
  • Migrated analytics pipelines from HBase to Redshift, enabling 10× traffic growth for 200+ customers with zero downtime
  • Eliminated stream processing bottlenecks affecting real-time analytics by migrating legacy Storm infrastructure to Flink, reducing event latency by 35% for 40K+ support agents.
  • Preserved 350+ TB of historical data during migration, maintaining 6+ years of customer analytics access
  • Established Airflow standards and documentation across 5+ teams, saving 15 developer hours weekly
  • Reduced ad-hoc engineering data requests by 40% by implementing Metabase self-service analytics platform
  • Mentored 10+ hires on coding practices and system architecture, reducing time to first release by 35% compared to previous year.

Projects

AI Home & Appliance Management

May 2025

Presented at Chicago Booth GNVC Finals - React Native, TypeScript, FastAPI, LLMs

  • Shipped a react prototype to Global New Venture Challenge at Chicago Booth School of Business judges in under 3 weeks.
  • Delivered production-grade backend with async workflows, Supabase persistence, and resilient GPS/camera upload flows
  • Engineered image-based appliance recognition powered by LLMs, reaching 85% identification accuracy through image processing and structured prompt templates.
  • Designed type-safe APIs with Pydantic and implemented real-time sync powering a multi-property maintenance dashboard.

Control Vector-Based LLM Steering

March 2025

LLM Steering, Behavioral Analysis

  • Implemented control vector techniques to steer LLM behavior across personalities to modify model outputs.
  • Built pipeline to analyze model responses at varying control strengths and evaluation methods for behavioral analysis.

Distributed Stream Processing Framework

Oct - Nov 2024

Golang, Distributed Systems, Fault Tolerance

  • Built and deployed HyDFS, a custom fault-tolerant distributed file system supporting replication, consistent hashing, and automated recovery, providing the foundational storage layer for distributed data processing.
  • Designed a high-throughput, distributed stream processing framework with exactly-once semantics on top of HyDFS, enabling reliable real-time data handling across multiple nodes.

Awards & Research

Agentic Research Code Reproducibility System

October 2024

Autonomous Research Validation - Docker, Google Gemini, OpenAI

  • Developed autonomous system that reproduced 12 out of 15 academic papers with zero human intervention, reducing research validation time from weeks to hours through intelligent Docker environment reconstruction.

Multi-Agent Reinforcement Learning: Hide and Seek

2021

IEEE Publication - Reinforcement Learning, Multi-Agent Systems

  • Researched and implemented multiple reinforcement learning algorithms for multi-agent systems, developing a novel hide-and-seek simulation environment inspired by OpenAI research on Multi-agent Autocurricula

Skills

Languages: Java, Python, Golang, Clojure, JavaScript

Databases: PostgreSQL, MySQL, MongoDB, Apache HBase, Redis, Kafka, Flink

Cloud: AWS Redshift, S3, Athena, EMR