profile sourced from GitHub
Scouting report
Junior building modern RAG + OCR pipelines in Python
assessed from open-source footprint
Zindzi is an early-career data/ML developer from Barbados working on genuinely current problems — a RAG chatbot using a late-chunking strategy and OCR preprocessing with Docling, Tesseract, and EasyOCR. At 4 repos and 36 commits last year with no stars or external contributions yet, she's junior but clearly hands-on with today's AI tooling. A promising entry-level ML/data engineering candidate.
Authorship & open source
What they build
Industry experience
- Data, ML & AI
- Web & CMS
Signal breakdown
0
4
0% forks
0
1.6 yr
0
Active
0% stale
Strengths
- Verified author — wrote 100% of commits on OCR-Data-Preprocessing
- Consistently active, low abandonment
- Data / ML focus with Backend
- Domain experience in Data, ML & AI & Web & CMS
- Core stack: Python, Jupyter
About
Skills
- Python
- Jupyter
Featured work
OCR Data Preprocessing
Exploring optical character recognition w/ Docling, Tesseract and EasyOCR
- Python
by Zindzi McCollin
RAG Late Chunking AI Chatbot
AI + RAG chatbot using late chunking strategy
- Python
by Zindzi McCollin
Portfolio Project Testing
Testing and Exploring
- Jupyter
by Zindzi McCollin
RAG Context Retrieval AI Chatbot
AI + RAG using context retrieval with 3 different embedding models
- Python
by Zindzi McCollin