David Cardenas

Logo

View the Project on GitHub dehercar/portfolio

Data science - Data engineering

Contact me in: LinkedIn

Location: Monterrey, Mexico.

Technical Skills: Python, SQL Server, SSIS, Power BI, AWS Sagemaker, Tableau, GIT

Relevant Projects

UNSPSC Categorizer

NLP tool developed in Python using RegEx, NLTK and scikit-learn (Tfidf, SVM) and Descriptions-UNSPSC labels as datasource (csv) to retrieve UNSPSC for a given part description.

See repository.

Part Number Matcher

This Power BI dashboard uses as datasource conclusions about similarities on Part Numbers to identify duplicated developed with Python and using RegEx, fuzzywuzzy and PyODBC library to connect to SQL Server, as well as gathering data from other Power BI datasources using DAX Studio. Datasource length about 1 million rows.

Productivity Predictor

This is a Tableau dashboard developed for a Procurement Community whose datasource (3 million rows) has future productivity predictions at part number level with two main purposes:

  1. Give visibility of future impacts on productivity based on cost trends.
  2. Find money leakage opportunities that impacts productivity by getting the gap of cost based on issued invoices and past submitted price udpate requests (e.g. PIR in SAP).

1. Future Productivity

Unit Costs
a. Unit Costs predictions

Tasks to predict cost were:

b. Price Update Requests
Volumes
Market Raw Material Indices

2. Money Leakage

The Tableau dashboard was made with features such as merge datasources, blend relationships, use of parameters, LOD expressions, etc.

Sea Level Predictor

The objective of this project is to predict sea levels for 2050 based on a dataset of global sea levels since 1880.

This project in Jupyter Notebook uses Pandas, Numpy, Scipy and Matplotlib.

See repository.

Cardiac Diseases

These modules were developed to perform data analysis over a dataset of 70k patients. Variables like blood preasure, height, weigth, patologies, etc were examinated on them.

By analyzing demographic, medical and habits data we want to determine if they can lead into a cardiac disease.

See repository.

Blacklist System

This project was made for a Debt Collection team.

The objective of this project is to have a control of blocked phone numbers and emails for clients with problems in their credits.

It was implemented using Windows Forms automated with Excel Macros as GUI and connected to SQL Server database secured by Windows Authentication.

Git and Github for version control in all projects.

General automations with Power Automate, dataloader.io, DemandTools and Web Scraper Platform.