# About Me ## Shane Zhang (张欣耕) AI/ML Engineer, LLM Specialist, RAG Expert ## Contact Information - 732-491-6378 - zhangxingeng970221@gmail.com - www.shanezhang.com ## Summary ## Professional Experience ### Citigroup Inc. US - AI/ML Engineer *Aug 2024 - Present* Location: Rutherford, NJ - Built a fast structured data processing pipeline with Python, reducing processing time from 10 seconds to under 1 second per document, enhancing document handling efficiency by 90%. - Designed and accelerated a robust RESTful API with Python FastAPI for real-time data ingestion and NLP processing into Postgres SQL database, supporting over 1 million API calls per day. - Deployed solutions at scale with OpenShift and Apache Spark, streamlining CI/CD procedures with zero downtime over 120 days. - Leveraged advanced Agentic RAG systems and Knowledge Graphs with RDF and LPG formats using Python and LangChain to improve global retrieval accuracy, increasing the F1 score of AI-driven compliance analysis from 0.32 to 0.71. - Implemented evaluation metrics and visualizations with LangChain and RAGAS to ensure LLM quality, reducing hallucination rate by 30% and error investigation time by 60%. - Utilized OpenAI GPT, Claude, and Google Gemini models and fine-tuned open-source LLM models to develop customized and scalable Agentic RAG systems handling over 200,000 API calls per day. - Developed human-centered evaluation frameworks (RLHF) to assess LLM performance in real-world scenarios, continuously ensuring alignment with user intents for over 2,000 active users. - Deployed scalable AI-driven RAG systems on AWS, using S3, EC2, Glue, Lambda, SageMaker, and Bedrock for efficient data processing and LLM integration pipelines. - Developed and deployed FastAPI interfaces and gRPC protocols for efficient API integration across cloud providers like Azure and AWS. ### Robert Wood Johnson University Hospital - AI/ML Engineer *Jan 2023 - Aug 2024* Location: New Brunswick, NJ - Designed, built, and deployed an Agentic RAG system using Python, NumPy, Pandas, JavaScript, SQL, and Chroma vector database, automating the parsing and summarization of over 5,000 research documents. - Engineered an advanced Agentic RAG system with OpenAI and React framework, reducing onboarding time by 50% and improving research efficiency by 40%. - Collaborated with a cross-functional team of 15 academic researchers, integrating over 200 feedback points into the NLP system to align the platform with current and future research objectives. - Conducted monthly workshops on ML usage for 10+ staff members, achieving 80% adoption of ML-enhanced workflow across the organization. - Utilized AWS SageMaker for model fine-tuning and AWS Bedrock for serving the models in production environments. ### Fiskkit Inc. - Machine Learning Engineer *Jan 2020 - July 2021* Location: San Francisco, CA - Integrated NLP-driven features into the Node.js backend with Python PyTorch (C++ CUDA), enabling real-time text generation and summarization functionalities, cutting response times by 70%. - Conducted data pre-processing and data exploration using Python PySpark, NumPy, and Pandas, ensuring high-quality data integration and leading to a 20% improvement in model training results using PyTorch distributed. - Optimized deep learning models through quantization and pruning techniques with TensorRT, achieving a 40% decrease in inference latency. - Produced a semantic graph database using Neo4j and Cypher queries to store and query complex relationships, improving data retrieval efficiency by 60%. ## Key Projects ### Enterprise-Scale RAG System *Citigroup* Built a comprehensive RAG system for financial document analysis using LangChain, pgvector, and AWS Bedrock Technologies: - Python - FastAPI - LangChain - Postgres - pgvector - AWS Bedrock ### Healthcare Research Assistant *RWJUH* Developed an AI assistant for medical researchers using OpenAI APIs, vector databases, and React Technologies: - Python - React - OpenAI - Chroma DB - AWS SageMaker ### NLP Text Analysis Platform *Fiskkit* Created an NLP-powered platform for text analysis with Node.js backend and PyTorch models Technologies: - Node.js - Python - PyTorch - PySpark - Neo4j ## Technical Skills ### Generative AI - LLM Fine-Tuning - Parameter-Efficient Fine-Tuning (PEFT) - Retrieval-Augmented Generation (RAG) - Agentic RAG Systems - Knowledge Graphs (RDF) - Knowledge Graphs (LPG formats) - GraphRAG - OpenAI GPT - Google Gemini - Anthropic Claude - LangChain - LangGraph - Prompt Engineering - RLHF ### Machine Learning & NLP - PyTorch - TensorFlow - Keras - Transformers - HuggingFace - Natural Language Processing (NLP) - spaCy - NLTK - Feature Engineering - Statistical Modeling - Quantization - Pruning Techniques - Model Evaluation - Performance Analysis - Neural Networks - GANs - Stable Diffusion ### Data Engineering - Postgres with pgvector - FAISS - Chroma - Neo4j - Cypher - MySQL - PostgreSQL - Oracle - Elasticsearch - MongoDB - PySpark - Pandas - NumPy - Apache Spark - Hadoop - ETL Workflows - Data Processing Pipelines ### Programming - Python - FastAPI - Streamlit - PyTest - JavaScript - Node.js - React - Scala - Java - C++ - REST API Development - gRPC - JWT - Authentication Systems ### DevOps & MLOps - Docker - Kubernetes - OpenShift - CI/CD Pipelines - Development Workflows - AWS SageMaker - AWS Bedrock - AWS S3 - AWS EC2 - AWS Lambda - AWS Glue - Azure Cloud Services - MLflow - RAGAS ### Professional & Soft Skills - Technical Communication - Documentation - Team Collaboration - Cross-functional Leadership - Stakeholder Management - Client Presentations - Mentoring - Training - Agile Methodologies - Project Management ## Education ### M.S. in Computer Science, Machine Learning Specialty - Rutgers, The State University of New Jersey *Dec 2020 - Dec 2022* GPA: GPA: 3.71 ### B.S. in Computer Science - Rutgers, The State University of New Jersey *Sep 2017 - Sep 2020* ## Languages - English (Native, Fluent) - Mandarin (Native, Fluent) ## Certifications ### AWS Certified Machine Learning - Specialty (MLS-C01) *Amazon Web Services* - Jun 2024 This certification validates expertise in building, training, tuning, and deploying machine learning, Deep Learning, Generative AI, and LLM models on AWS. ## Teaching Experience ### Teaching Assistant - Rutgers University *Dec 2020 - Dec 2022* Tutored over 400 students in Computer Science and Machine Learning topics ## Additional Information ### Publications - Berns, M. P., Nunez, G. M., Zhang, X., et al. (Sep 2024). Auditory Decision-making Deficits After Permanent Noise-induced Hearing Loss. ## Interests - Generative AI - SCUBA Diving - Reading - Philosophy - Psychology - Sociology --- Resume generated from structured data - Visit the full interactive version at: https://www.shanechang.com/about-me/