Einstein M. Ebere.

Logo

My data science portfolio

View My GitHub Profile

Data Scientist | Machine Learning Researcher.

LinkedIn | X (FKA Twitter) | YouTube

Innovative Data Scientist, an expert in Python, SQL, PySpark, and data visualisation. Specialised in machine learning, NLP (BERT, RoBERTa, GPT), and Computer Vision (YOLO, ResNet, Diffusion Models), experienced in the use of vector databases for proficient similarity search and retrieval. Passionate about learning, and leveraging cutting-edge AI technologies including transformers and generative models. Skilled in building scalable, distributed solutions for complex data problems using Spark, driving insights and decision-making.

Skills

Databases: MySQL, PostgreSQL, MongoDB, ChromaDB
Programming & Data Processing: Python, SQL, PySpark, Hadoop.
API & MLOps: Flask, FastAPI, Streamlit, Docker, AWS, Azure, GCP, Render.
Data Visualisation & Analytics: Tableau, PowerBI, Matplotlib, Plotly, Seaborn.
Productivity Tools: MS Suite (Word, Excel, PowerPoint, Outlook), Jira, Slack, Teams.
Machine Learning & AI: PyTorck, Scikit-Learn, YOLO, Roboflow, Transformers, NLP, Computer Vision.
Non-Technical: Communication, attention to detail, self-motivation, collaboration, eagerness to learn.

Education

Research Experience

Optimising Medical Image Segmentation Through Attention Mechanisms and Custom Loss Functions @ Royal Holloway University of London.
(June 2024 - Aug 2024)

Highlight - Intensive two-month-long computer vision research focused on improving the accuracy of breast cancer, skin lesions, and lung segmentation.

Developed and applied advanced data preprocessing and augmentation techniques to enhance model performance and increase dataset diversity. Integrated CBAM and SEBlock attention mechanisms into the DeepLabV3+ architecture using PyTorch, significantly improving segmentation accuracy. Proposed and tested a new DiceBCE loss function, boosting model convergence and performance. Conducted in-depth analysis, demonstrating improvements in segmentation accuracy for breast cancer, skin lesions, and lung images. Currently preparing a research paper for publication, focusing on the impact of attention mechanisms in medical image segmentation.

Work Experience

Data Scientist @ Vault Hill (August - October 2023)
Collaborated on AI and data-driven initiatives, optimising workflows and delivering measurable business outcomes. By implementing scalable pipelines and fostering collaboration through clear documentation and insight-sharing, I ensured alignment between technical solutions and organisational goals while driving innovation and success.

Data Analyst & Backend Developer @ SunFi (Januray - June 2022)
Optimised system performance and streamlined workflows by implementing automation and scalable infrastructure solutions. Efforts included designing efficient database schemas, developing Slack bot templates, and integrating APIs for bug reporting, resulting in improved operational efficiency and team collaboration.

Projects

LookOutAI (Multi-model - Computer Vision & NLP)

LookOutAI is a sophisticated image recognition tool designed to enhance security and privacy by identifying individuals in photos or videos using advanced AI technology. It provides detailed descriptions of their actions or behaviour and offers versatile features such as selectively blurring faces to ensure privacy or un-censoring specific targets for clarity. Ideal for security applications, LookOutAI enables law enforcement or security teams to process video evidence with precision while safeguarding the privacy of uninvolved individuals.

System Ourput Sample

Robust Medical Image Segmentation - DeepLabsV3+ with Attention (Computer Vision)

Extensive research on optimising the DeepLabV3+ architecture for medical image segmentation. Through rigorous experimentation, I developed enhanced variants of DeepLabV3+ by integrating attention mechanisms, specifically Convolutional Block Attention Module (CBAM) and Squeeze-and-Excitation (SEBlock) attention, applied in parallel within the encoder. These attention-equipped models consistently outperformed the base DeepLabV3+ and other published research works across multiple medical imaging datasets, including breast cancer ultrasound, lung X-rays, and ISIC skin lesions 2017.

Lung Segmentation Predictions

Automated Attention Tracking and Reporting system (Computer Vision)

The Automated Attention Tracking and Reporting system is a sleek AI-driven dashboard that monitors and reports live distraction levels during classes, meetings, or lectures. Utilising advanced computer vision techniques to help educators and facilitators enhance focus and attention during sessions.

Distracted Image

Extractive, Abatractive document/text Summarising System. (NLP)

The document summarising system developed as my final year project for a B.Sc. in Software Engineering at Babcock University goes beyond basic summarisation. It accurately predicts and suggests the top 2 topics for any text input, detects the language of the text and features text-to-speech functionality to read summaries aloud. Additionally, users can download their summaries as Word documents (.docx).

214431479-aae08584-b96e-4934-a205-45a315d6cb94

Certifications (Proof of continuous learning)

Blogs

Volunteering

Interests and Activities

I am dedicated to staying at the forefront of artificial intelligence, I do this by regularly reading research papers and follow industry trends through newsletters such as TechCrunch, TL;DR as well as other popular tech blogs and company X and LinkedIn accounts.

Teaching

Passionate about teaching and sharing knowledge, I actively seek opportunities to educate others through small group meetings, workshops, and voluntary initiatives. I believe in fostering a collaborative learning environment, whether within my company or through community engagement, to inspire growth and empower individuals to reach their full potential.

Fitness and Working Out

I enjoy exploring various training methods, and maintaining a balanced lifestyle through activities such as running, working out as well as meal prep every now and then.

Hackathons

I enjoy hackathons. I recently attended and won a two-day AI hackathon at Heat Geeks AI where my team won the prize for Innovation by developing an AI tool to help homeowners and installers assess heat pump locations using satellite imagery, internal company API and OpenAI API. References Available on request.