Jim Bonant

ABOUT

I build the infrastructure
that makes AI reliable.

Senior AI/ML Infrastructure Architect and MLOps Engineer with 13+ years designing, deploying, and optimizing scalable hybrid cloud and on-prem AI platforms. I specialize in production LLM serving, Retrieval-Augmented Generation (RAG) systems, vector databases, and custom LLM solutions in Python, Rust, and C++.

I’ve led MLOps transformations that delivered 70–80% faster deployments, 99.99% uptime, and substantial cost and performance gains across enterprise environments.

Currently running my own expermintal platform Naturally Artificial of Pennsylvania, I architect bare-metal and containerized LLM inference platforms and containerized AI products that power real customer workloads at scale. Previously I led machine learning teams at PNC and drove major containerization and CI/CD initiatives at BNY Mellon.

LinkedIn • GitHub ML OPS • GitHub Old

CAREER

Experience

Full resume →

March 2026 — Present

Naturally Artificial of Pennsylvania

Senior AI/ML SecOps & Platform Engineer

Architecting and deploying scalable hybrid bare-metal LLM inference platforms integrated with containerized applications and APIs. Supporting 100k+ daily inferences with sub-500ms latency (3× speedup).

•Designed and led end-to-end MLOps + DevOps CI/CD pipelines (Jenkins, GitLab) for full-lifecycle model deployment and monitoring.
•Built custom RAG systems including the S.A.R.A. executive AI agent and domain-specific variants achieving 95% retrieval accuracy.
•Engineered complete infrastructure stack with comprehensive AI/ML SecOps (zero-trust, vulnerability scanning, encryption, AI-powered security gates).
•Currently operating live multi-cloud environments on AWS, Azure, and Google Cloud.
•Deploying highest-availability infrastructure via vLLM, KServe, and Gateway architecture for RAG.
•Notable side projects in blockchain development and chain hardening with advanced studies of ECL, RSAHash, and general RSA encryption practices. Focus on breaking weaknesses for the benefit of strengthening my own chains.

March 2024 — March 2026

Travel and Life Break

Vacation

I planned on a year off but made it into two, thanks to favorable crypto markets.

•Sailed the Great Lakes during the summers of 2025 and 2026
•Full home remodel projects
•Yoga, skiing, art, hiking, and rock climbing
•Heavy focus on mathematical research in post-quantum cryptography
•I was fortunate to take this time and have kept many of the details private, but the gap needed to be addressed honestly.

July 2021 — March 2024

PNC

Machine Learning Team Lead

Led a cross-functional team of 4 engineers building and deploying production ML models and MLOps pipelines for financial analytics and risk applications.

• Accelerated model iteration by 50% and improved accuracy through optimized data pipelines.
• Delivered 60% faster time-to-production and 55% lower query latency for high-volume transaction systems.
• Drove enterprise MLOps adoption (model versioning, automated testing, observability) and mentored engineers on container orchestration.

Feb 2018 — Aug 2022

Bank of New York Mellon

VP – Production Release Engineering

Led major modernization of release engineering and infrastructure in highly regulated environments.

• Migrated multiple enterprise applications to Docker/Kubernetes, increasing deployment velocity 4× while maintaining 99.95% uptime.
• Reduced release times by 60% and production incidents by 85% through GitLab CI/CD migration and Jenkins cleanup.
• Implemented advanced git-flow strategies and Liquibase automation, cutting merge conflicts by 70%.

July 2012 — Feb 2018

Broadridge Financial

Senior DevOps Engineer

Principal build and automation engineer for enterprise financial reporting platforms.

• Reduced manual deployment effort by 75% through Python, Shell, Groovy, and Jenkins/Bamboo automation pipelines.
• Built custom Python deployment and QA tooling on Web2py and Django frameworks.
• Led system provisioning with Chef and Ansible while supporting high-stability financial applications.

SELECTED WORK

What I’ve shipped

View all on GitHub →

PRODUCTION AI AGENT

S.A.R.A. Executive AI Agent

Custom Retrieval-Augmented Generation system for executive decision support. Built full RAG architecture with specialized high-accuracy variants for marketing, form generation, and strategic analysis using vector databases and advanced prompting. Achieved 95% retrieval accuracy and 65% latency reduction.

Python • Rust • Vector DBs • LangChain • Secure Infrastructure

INFRASTRUCTURE

Bare-Metal LLM Inference Platform

Production hybrid platform combining bare-metal GPU inference with containerized front-ends and APIs. Supports 100k+ daily inferences at sub-500ms latency. Full MLOps lifecycle automation with comprehensive SecOps controls (zero-trust, encryption, AI-powered gates).

Kubernetes • Docker • Helm • Jenkins/GitLab CI • Prometheus/Grafana

Explore more on GitHub

Github Actions Workflow Terraform Spinup of a Kubernetes Cluster using IaC in AWS

Overview of a gateway style RAG vLLM. Kserve GKC ready. Full Metrics SRE data native

Video Resume & Homelab Overview

I build the infrastructure
that makes AI reliable.

Experience

What I’ve shipped

S.A.R.A. Executive AI Agent

Bare-Metal LLM Inference Platform

Skills & Technologies

Education

Have a challenging AI infrastructure problem?

I build the infrastructurethat makes AI reliable.

Experience

What I’ve shipped

S.A.R.A. Executive AI Agent

Bare-Metal LLM Inference Platform

Skills & Technologies

Education

Have a challenging AI infrastructure problem?

I build the infrastructure
that makes AI reliable.