Ethan Maluhia Roberts Portfolio

About Me

I'm Maluhia, I love working on machine learning models as well as process analysis and improvement. Currently I am a founding member of the Solutions Engineering team at Paramark, a startup in San Francisco working on Marketing Mix Models for enterprise level marketing organizations. I'm a full stack data professional, working in Python to do ETL on client marketing data, training and tuning ML models, and designing, analyzing, and recommending marketing experiments. I take analysis work off the software engineers' plates and help guide the work on the product that I use now and that the clients will use in the near future! I have an MS in Analytics from Georgia Institute of Technology and a BS in Computer Science from BYU-Hawai'i.

Check Out My Resume

Personal Projects

Take a look at what I've been working on

Breaking Grad(ients)

Deep Learning, PyTorch, Object Oriented Programming, Neural Networks, Vision Transformers

This Deep Learning project explores the effects of Fast Gradient Sign Method attacks on pre-trained models such as EfficientNet, Data Efficient Image Transformers, and our own custom JuanchitoCNN. We wanted to show how these models react to the common white-box attack of FGSM when discriminating between AI generated and naturally generated images.

Read the Report

GitHub & Code
Pokemon Team Personality Recommender

Python, Natural Language Processing, APIs

This project takes personality inputs from a user, compares them to the Pokedex descriptions for each Pokemon, then returns a team based on cosine similarity. The project utilizes natural language processing techniques such as Bag of Words, Term Frequency Inverse Document Frequency, GloVe, and BERT.

Check It Out
Sentimental

R, R Shiny, Natural Language Processing

This R Shiny tool is for blog writers and email marketers to see the sentiment and flow of their writing. The users get sentiment counts along with highlighted words associated with the count, a summary of what they wrote, ratio comparisons of sentiments, a list of ambiguous words to update to make their message more clear, and a writing trajectory which measures message valence by sentence. (The US Declaration of Independence is a great sample, takes about 21 seconds)

Test It Out
Complainalyzer

Python, R, Tableau, Time Series Analysis, Natural Language Processing

Our goal was to transform the way consumer finance complaints from the Consumer Finance Protection Bureau (CFPB) database are visualized, understood, and addressed. We highlight patterns and trends in customer complaints using techniques such as LDA, VADER, and Bigram Analysis; and we continue to predict future complaint trajectories through Facebook Prophet and ARIMA.

GitHub & Results

Tableau Dashboard
Confusion Matrix Scaler

R, R Shiny, Classification

This R Shiny tool is for scaling a confusion matrix up or down. I use the Scaler for compensating inaccuracies in a training/testing matrix against a new binary classification prediction. If my accuracy in training/testing is anything other than 100%, I want to be able to break down how many of the Predicted Positives and Predicted Negatives are likely to be False Positives and False Negatives. I use this on sales pipelines, after using classification methods to predict which deals will are likely to be closed won and which are likely to be closed lost.

Try It
Pandemic Simulation

Python, Statistical Analysis

This statistical analysis compares different pandemic scenarios and compares the severities of the situations. The project aimed to mimic real virus statistics, population density statistics, and vaccine distribution statistics.

Read It

Work Projects

While my personal work is public, here is some of the more proprietary work I do:

In Progress

Paramark

Founding and building out the Solutions Engineering team; helping hire, train, and mentor new engineers to independence, as well as develop team processes and documentation
Partnering with software engineers to design, build, and implement Marketing Mix Modeling (MMM) and ETL product features to increase processing efficiency and scalability

Completed

Quid

Developed, validated, and deployed new time-series machine learning models, reducing average sales forecasting error from +16% to +1.5%
Optimized sales forecasting workflows through data collection, ETL automation, data exploration, and financial model prototyping, leading to a more scalable process and reducing time spent on forecasting tasks by 80%
Coordinated new Professional Services strategy among accounting, services, and sales operations teams, leading to a 50% reduction in time spent processing so far and a 90% reduction in by-hand errors

iPayables

Developed and implemented A/B tests (DOE) for evaluating the performance of marketing assets, resulting in an increase of click-through rates from 5% to 13% on ads, and 15% to 22% on whitepapers
Built data pipelines and reporting automations in CRMs, creating transparency (and frankly, encouraging competition in the sales team) leading to a sales pipeline increase of over 20%, as well as a 541% increase sales leads

Skills

Languages & Technology

Python, PyTorch
NumPy, Pandas
Scikit-Learn, NLTK
R, R Shiny
SQL, SQLite
Snowflake, Fivetran
S3 (AWS)
Tableau, Power BI
Jupyter Notebooks
Data

Time Series Analysis
Neural Networks
Exploratory Data Analysis
Natural Language Processing
Machine Learning
Statistical Analysis
Classification
Clustering
ETL Design
Business

Sales/Marketing Forecasting & Analysis
Experiment Design & Analysis
Process Improvement & Automation
Project Management
Customer Segmentation
Churn Analysis
Product Development & Analytics
A/B Testing

Breaking Grad(ients)

Pokemon Team Personality Recommender

Sentimental

Complainalyzer

Confusion Matrix Scaler

Pandemic Simulation

In Progress

Paramark

Completed

Quid

iPayables

Languages & Technology

Data

Business