Hi, I'm Hozaifa Owaisi

AI Developer and Researcher

About Me

Hozaifa Owaisi profile picture from hackathon

Education

University of North Carolina at Pembroke

Bachelor of Science, Major: Computer Science, Minor: Cybersecurity

Expected May 2025

Wake Technical Community College

Associates in Engineering Transfer Program, Major: Computer Science

August 2022

My Skills

PythonJavaTypeScriptReactExpressSpringBootMongoDBAWSPyTorchTensorFlowGitDocker

I am a dedicated and passionate individual pursuing my interests in Computer Science, Data Science, and AI/ML. My drive stems from a deep love for tackling intricate challenges, whether in machine learning, data exploration and engineering, AI, web security, or development and operations. I thrive on the process of problem-solving, spending days searching for solutions, experimenting with different approaches, learning from failures, and eventually uncovering the right path. It's the journey, not just the result, that excites me most. I am a quick learner, often eager to join and explore new things.

Work Experience

My professional journey in AI research, development, and engineering.

LLM Research and Development Engineer

AI Institute of South Carolina – in collaboration with UNCP

Current
January 2025 - April 2025
Pembroke, NC

Project: Knowledge Editing and Testing Framework for LLMs

Python
PyTorch
TensorFlow
Keras
Scikit-learn
  • Researched and implemented LLM knowledge editing techniques to efficiently update models without full retraining, reducing computational costs while improving adaptability
  • Designed custom augmentation pipelines and benchmarking strategies to evaluate and optimize LLM performance across diverse data domains
  • Developed multi-scale AI/ML solutions with MLOps workflows using Amazon SageMaker for automated training, tuning, and deployment
  • Conducted comparative analysis of LLM architectures and contributed to system change management, ensuring seamless migrations aligned with stakeholder goals

All Technologies:

Python
PyTorch
TensorFlow
Keras
Scikit-learn
LCLangChain
RAGRAG
AWS SageMakerAWS SageMakerAWS LambdaAWS Lambda
AWS EKS
Custom Benchmarking Tools
Git
Bitbucket

Full Stack AI Engineer

NSA Laboratory for Analytic Sciences (LAS) @ NCSU

August 2023 - January 2025
Raleigh, NC

Project: Internet Routing Integrity & RPKI Visualization Platform

Angular
JavaJava
Spring Boot
Python
Django
  • Developed full stack application using Angular, Spring Boot, and REST APIs with real-time data visualizations for routing integrity analytics
  • Implemented ETL/ELT pipelines for large-scale datasets and collaborated with data engineering to model organizational metadata for AI pipelines
  • Built LLM-based classification models and an interactive RAG-powered chat interface with custom PostgreSQL pgVector database integration
  • Established scalable LLM workflows including preprocessing, fine-tuning, and deployment with RayTune optimization, presenting findings at NSA-hosted conferences

All Technologies:

Angular
JavaJava
Spring Boot
Python
Django
REST APIs
LLMs
vLLMs
RTRay Tune
AWS EC2
AWS S3
ETL/ELT
Apache Spark
MongoDB
PostgreSQL
pgVector
Git
JUnit
PyUnit
Agile
FlaskFlaskTypeScriptTypeScriptOpenAIOpenAIClaudeClaudeGeminiGemini
huggingface
Data Analysis
ARIN
RPKI
Full-Stack
Model deployment
Hyperparameter Optimization
Model Finetuning
W&B

DL Research and Development Engineer

Corvid Technologies @UNCP

January 2023 - January 2024
Pembroke, NC

Project: Foundational Model Development for Rapid Blood Clot Detection using DNNs

Python
PyTorch
TensorFlow
Scikit-learn
RTRay Tune
  • Designed and implemented PyTorch-based DNN models for blood clot prediction with Ray Tune hyperparameter optimization
  • Developed automated data preprocessing pipelines for complex CFD datasets, improving model generalization and training convergence
  • Established modular ML pipelines with CI/CD integration and migrated development to AWS cloud infrastructure for improved collaboration
  • Optimized inference latency and model deployment, contributing to peer-reviewed publications and delivering presentations at academic events

All Technologies:

Python
PyTorch
TensorFlow
Scikit-learn
RTRay Tune
Pandas
NumPy
Git
AWSAWS
Jupyter
Apache Spark
DynamoDBDynamoDB
REST APIs
CI/CD
GitHub Actions
Machine Learning
Research
XGBoost
CFD

My Projects

A selection of my research and personal projects focusing on machine learning, data analytics, and web development.

DNN Models for Blood Clot Detection

DNN Models for Blood Clot Detection

Conducted research to develop foundational DNN models using PyTorch and Ray Tune for predicting blood clots (Thrombogenesis) in COVID-19 patients, analyzing Computational Fluid Dynamics (CFD) data. Explored hyperparameter optimization, dimensionality reduction, data-streaming techniques, GPU optimization and scheduling techniques, and data encoding techniques.

Python
PyTorch
RTRay Tune
Machine Learning
Research
XGBoost
TensorFlow
Scikit-learn
AWSAWS
CFD
Pandas
Scikit-learn
NumPy
Git
Jupyter
Apache Spark
DynamoDBDynamoDB
REST APIs
CI/CD
GitHub Actions
Internet Routing Integrity Project

Internet Routing Integrity Project

Led a student team in a full-stack research project with the NSA Lab (LAS @ NCSU). Explored ARIN datasets, visualized RPKI information and helped develop a full-stack website for policymakers using Angular and Flask. Built ETL pipelines and integrated REST APIs. Later presented and demoed the project to NIST.

Angular
FlaskFlask
MongoDB
AWSAWS
Data Analysis
Python
TypeScriptTypeScript
REST-API
ETL
ARIN
RPKI
Full-Stack
Model deployment
Hyperparameter Optimization
Model Finetuning
OpenAIOpenAIClaudeClaudeGeminiGemini
huggingface
Research
W&B
JavaJava
Spring Boot
Django
REST APIs
LLMs
vLLMs
RTRay Tune
AWS EC2
AWS S3
ETL/ELT
Apache Spark
PostgreSQL
pgVector
Git
JUnit
PyUnit
Agile
AI-Udoin? Personal Therapy Multi Voice Agent

AI-Udoin? Personal Therapy Multi Voice Agent

Developed a therapy-oriented multi-call agent (HackUNCP 2025 Healthcare Track Winner). Engineered context-aware conversations using LLMs augmented via Pinecone vector DB (RAG) and built a dynamic multi-voice synthesis pipeline. We are exploring further implimentations of this project

Python
LLM
RAGRAG
Pinecone
Vector DB
Azure
Voice Synthesis
Hackathon
Car Reviews Analysis with LLMs ('Car-ing')

Car Reviews Analysis with LLMs ('Car-ing')

Built a FastAPI-based NLP service ('Car-ing') using DistilBERT, Helsinki-NLP, BART, etc. for sentiment analysis, translation, Q&A, and summarization. Deployed with Docker Compose and experimented with Azure Kubernetes. Implemented MLOps pipelines.

Python
NLP
DistilBERT
BART
Machine Learning
FastAPIFastAPIDockerDocker
Azure
KubernetesKubernetes
MLOps
1-Layer Digit Recognition from Scratch

1-Layer Digit Recognition from Scratch

Developed a single-layer neural network for digit recognition using only Pandas (83% accuracy) by manually implementing propagation and updates. Later version with PyTorch achieved 94% accuracy.

Python
Pandas
PyTorch
Neural Networks
Machine Learning

Get In Touch

Interested in collaborating on research, discussing projects, or exploring potential opportunities? Feel free to reach out!

Contact Information

Email

howaisi.h@gmail.com

Phone

Redacted

This feature is coming later :)

Location

North Carolina

Send Me a Message