profile sourced from GitHub
Scouting report
Dominican data scientist publishing reusable pandas and dataset-similarity tools
assessed from open-source footprint
Juan is the clearest specialist here: a data/ML engineer (64% weight) working in Jupyter and Python, with the most credible original library in the batch. Pandas-Optimizer, which trims DataFrame dtypes for memory, holds 8 stars, and pysimilarity offers dataset-similarity methods. Followers (15) and a Kaggle presence add signal. A 0.8 abandoned ratio and no deployments are real caveats, but the authored, useful Python tooling makes this a genuine standout.
Authorship & open source
What they build
Industry experience
- Data, ML & AI
- Web & CMS
- Education & EdTech
Signal breakdown
9
top repo 8
15
25% forks
15
9.1 yr
0
Active
80% stale
Strengths
- Verified author — wrote 100% of commits on Pandas-Optimizer
- Original builder — 15 of their own repositories
- Data / ML focus with Backend
- Domain experience in Data, ML & AI & Web & CMS
- Core stack: Jupyter, Python, JavaScript
About
Skills
- Jupyter
- Python
- JavaScript
- Data Science
- Machine Learning
- Notebook
Featured work
Pandas Optimizer
Optimize pandas DataFrame data types for efficient memory use.
- Python
by Juan Alberto Nuñez Corporan
pysimilarity
Calculate similarity between datasets using several methods.
- Python
by Juan Alberto Nuñez Corporan
Thesis Project
You know
- Jupyter
by Juan Alberto Nuñez Corporan
Portfolio
Public Portfolio
- Jupyter
by Juan Alberto Nuñez Corporan
IPSDS
An interactive tutorial series for Data Science in Python.
- Jupyter
- Data Science
- Machine Learning
- Notebook
- Tutorial
by Juan Alberto Nuñez Corporan
TallerML
Primer commit
- Jupyter
by Juan Alberto Nuñez Corporan