Projects - Kevin Galvan Cuesta

Real-time Voice Conversion

An app I developed after being unable to test voice conversion applications on my laptop while abroad. Most systems are CUDA-based and require significant processing power, which this lightweight version avoids by running locally on Apple Silicon. It runs on an M-Series MacBook Air with 300-400ms of latency.

The system is an RVC-based pipeline using PyTorch MPS to make it viable on Apple Silicon. To allow real-time functionality, the audio is processed in 100-250ms chunks with overlapping context. Inference is performed on recent windows to keep the stream stable and reduce audio artifacts. I also built a custom low-latency audio pipeline with CoreAudio and BlackHole to handle the routing, buffering, and synchronization.

This is ongoing work. I'm currently focused on reducing latency while maintaining stability, particularly around queue management and chunk scheduling.

ARC Artificial General Intelligence Competition

I developed an AI agent capable of solving visual IQ test problems, where each problem consists of a set input images (left) and target transformations (right). After it is shown 2-4 examples of inputs and targets, the agent must be able to generate a target image using only an unpaired input image. This agent was created using many mixed techniques primarily from field of Knowledge-Based AI.

The current version is able to solve 60+ distinct visual IQ problems and each iteration is improving its ability to recognize problems, generalize them, and apply new solution techniques. It uses Generate and Test, a Domain-Specific Language, Heuristic Search, and Means-Ends Analysis, in addition to analyzing the inputs using Computer Vision methods from SciPy. Basic transformations are built into the agent where it then modifies and combines these transformations to come up with the target image. This work is not publicly available, but I am happy to discuss the project in more detail.

Reinforcement Learning Stock Trading Bot

At Georgia Tech, I spent time applying Machine Learning techniques to the stock trading problem. This work encompassed building a market simulator that accounted for commission, slippage, and other real-world factors; implementing the Q-Learning algorithm; and formatting the market simulator to fit the Reinforcement Learning framework. I researched Q-Learning theory to effectively parameterize the algorithm for Out-of-Sample performance.

The Stock Trader beat benchmarks (longing the target stock) on a number of stocks over different time periods. It did so while achieving lower levels of volatility compared to the underlying stock price. It also showed adaptation to different market conditions including varying levels of slippage in the simulations. It was able to do so using only a few indicators such as Bollinger Bands, Momentum, and Percentage Price Index. This is an ongoing work as I investigate increasing the granularity of the data (minute-data/other indicators) and using more advanced techniques such as Deep Q-Learning. This work is not publicly available, but I am happy to discuss the project in more detail.

Google Certification: ML Analytics

This certification covered several topics including statistics, regressions, and machine learning techniques. As part of the program, I developed a prediction regression model from NYC Taxi and Limousine data. The model used data from pick up and drop off locations, time stamps, fares, and other variables to predict trip durations. Furthermore, a decision tree and XGBoost model were trained to predict whether a rider would be a generous tipper. Practical applications could increase revenue and improve ride efficiency, as well as provide drivers with ways to earn more revenue.

This certification delved deeply into project planning, data cleaning, Exploratory Data Analysis, model construction with parameter tuning, visualizing results, and providing reports to stakeholders. The GitHub repository contains executive summaries for each course within the program, along with Jupyter coding books to follow along with the work. Pickle files are provided to run the models. The dataset can be found here.

SVM Ensemble: Predicting Development

This was the central project that I worked on as a consultant for the California Housing Department. Using our predicted financial metrics for building permits of different sizes, I created a Support Vector Machine ensemble to predict the likelihood that a parcel of land would be developed. A large portion of the preprocessing was completed in Stata using .do files. The regressions used as input for the SVM were also completed in Stata, but later migrated to Excel. The SVM was made using Scikit-learn's SVM function in Python. Some final preprocessing occurs here as well as calculating and displaying metrics such as Precision, Recall, and ROC curves.

Due to the size of the original model parameters and datasets, I have only included the smallest files used for initial testing. The finalized models folder contains various tuning settings and training size sets that were utilized to obtain the best-performing models. Running the best models often took several days (on a CUDA-supported NVIDIA GPU), and I have included the output of our final, highest-performing Support Vector Machine model (pictured above).

Computer Vision: Pose Estimation

Above you are seeing our team's first iteration of a web app that allows you to analyze human poses without having to install any software tools on your device! The original desktop application, which is available for download, takes in images of a person and the subject's height and outputs coordinates in a metric space using a Convolutional Neural Network (CNN) from the BlazePose and Caffe libraries. Our website is based on the original app. It includes improved features and, most importantly, enhanced ability to accurately map 68 desired body nodes and produce positional and angular time data about their coordinates.

**The web project has unfortunately been taken private by the university and is no longer publicly accessible.

Naive Bayes Classifier

During my time as an Advanced Econometrics teaching assistant, I developed a hands-on approach to teaching Machine Learning concepts to students. To achieve this, I created a problem set that demonstrated the probability mechanics behind the Naive Bayes Algorithm. I removed certain sections from the assignment version and left instructions for the students to work out on their own. The problem set involved creating synthetic samples, calculating the running sum of positive observations, and using those observations to calculate the chi-squared statistic. The students were then tasked with predicting the likelihood of the next observation being positive. Finally, I provided them with prior knowledge about the distribution of samples to train the classifier. This approach helped to enhance the students' understanding of Machine Learning concepts and techniques.

Above is the probability estimate that Box 2 (b2) is of type X (chiX) as the number of observations increase. It is a multinomial classifier with 3 classifications.