Guim Casadellà
Data Scientist - McKinsey & Company
Data Science, Machine Learning and AI
As a passionate professional in the fields of Data Science, Machine Learning, and AI, I believe in the power of technology to solve complex problems and drive innovation. I’m highly interested in the research, design, and development of advanced solutions.
Below are some of the projects I’ve worked on that best showcase my skills and approach to these fields.
Featured Projects and Experience
1. DatathonFME 2024 WINNING PROJECT - Design Decoder MANGO challenge
- Awards: First prize winners for this datathon’s challenge.
- Overview: Design Decoder is an AI tool aimed at making the life of designers easier by automating the process of logging new garment samples into digital systems. The model extracts design attributes from product images and metadata using a fashion-pretrained CLIP model and traditional AI models for classification.
- Technologies Used: Python, PyTorch, CLIP, XGBoost, OpenCV, Streamlit, NumPy, Pandas, Scikit-learn
- Key Highlights: Image Embeddings, Attribute Classification, Computer Vision, AI Automation
-
2. HackUPC 2025 - DataCenter-DeCoder
- Awards: Second prize winners for this hackathon’s challenge.
- Overview: DataCenter-DeCoder is an interactive tool for designing and optimizing data center configurations. It generates valid layouts that satisfy complex constraints and optimize objectives, using modules such as transformers, water supply units, and processors. The tool combines a drag-and-drop interface with powerful optimization algorithms for resource management and layout validation.
- Technologies Used: Python, FastAPI, React, TypeScript, Vite, MongoDB
- Key Highlights: Constraint-based Optimization, Interactive Module Placement, Resource Management, Data Visualization
- GitHub link to the project
3. Bank to Notion Finance OS
- Awards: Personal project to integrate bank transaction data directly into a Notion finance dashboard.
- Overview: Bank to Notion Finance OS automates the upload, processing, and categorization of bank transactions from CSV files into a complete Notion financial dashboard. Born from the frustration of manual data entry, the project streamlines personal finance tracking with a productivity-focused UI and keyword-based categorization.
- Technologies Used: Python, FastAPI, Vue.js 3, Vite, TailwindCSS, Notion API
- Key Highlights: Notion Integration, Transaction Automation, CSV Parsing, Budget Tracking, Keyword-based Categorization
- GitHub link to the project
4. Fashion Sales prediction - Machine Learning Course Final Project
- Overview: This Data Science project explores the viability of predicting sales of fashion articles from a multimodal dataset using traditional ML techniques. It develops several techniques to extract insigths from product metadata, images and temporal features.
- Technologies Used: Python, Pandas, PyTorch, OpenCV, Matplotlib, Scikit-learn, XGBoost, SMOTE
- Key Highlights: Image Embeddings, BagOfVisualWords, TimeSeries, Bootstrapping, Bagging, Regression, Classification
-
5. Reward Optimization in Semantic Segmentation - CVC Internship
- Institution: Computer Vision Center (CVC)
- Internship Period: January 24 - July 10
- Overview: Developed a reward optimization framework for semantic segmentation tasks during my internship at CVC. This project involved creating algorithms to improve the accuracy and efficiency of semantic segmentation models.
- Technologies Used: Python, Pytorch, TensorBoard, OpenCV, OpenMMLab, DeepLabv3
- Key Highlights: Semantic Segmentation, Reward Optimization, Model Efficiency, Algorithm Development
- GitHub link to the project
6. LoRA for Semantic Segmentation Domain Adaptation - CVC Internship
- Institution: Computer Vision Center (CVC)
- Internship Period: January 24 - July 10
- Overview: Focused on domain adaptation for semantic segmentation using the LoRA (Low-Rank Adaptation) technique. The project aimed to improve model performance when transferring knowledge from synthetic to real-world datasets.
- Technologies Used: Python, Pytorch, SegFormer-B0, LoRA, HuggingFace
- Key Highlights: Domain Adaptation, Semantic Segmentation, Transformer-based Architecture, Low-Rank Adaptation
-
7. HackUPC 2024 - Inditex TECH Challenge
- Overview: Developed a solution to detect duplicate or similar images in a dataset, a clothing recommendation system based on the outfit provided by the user, and an assistant system to manage out-of-stock problems for Inditex. Utilized the pre-trained model CLIP for understanding image context and semantic segmentation with a U-NET model for detecting and segmenting clothes in images.
- Technologies Used: Python, Pytorch, Huggingface, CLIP, U-NET, Semantic Segmentation
- Key Highlights: Image Embedding, Contextual Understanding, Clothing Recommendation, Semantic Segmentation, FrontEnd
-
8. CriminalMap
- Overview: Developed a crime mapping app using Python and C. Implemented A* algorithm variation for pathfinding. When computing the best route between two points, it takes in account real time crime data in order to suggest the fastest & safest route.
- Technologies Used: Python, C, Overpass API, Selenium + BeautifulSoup, Google Maps API, Matplotlib
- Key Highlights: Algorithm Design (A*), Data Flow, Data Parsing, Geospatial Analysis, Data Visualization, Web scraping.
- GitHub link to the project
9. NovartisDatathon 2023
- Overview: Developed a ML model for forecasting a company’s sales in the healthcare industry. Focused on Time Series Analysis for extracting information of time-dependent features such as trends, seasons.
- Technologies Used: Python, Pandas, LightGBM, Latitude
- Key Highlights: Time Series Forecasting, Data cleaning, Lag features, Rolling Features, Feature engineering, LightGBM model
- GitHub link to the project
10. Datathon FME 2023 - Outfit generator
- Overview: In the Datathon FME 2023, our team developed an innovative AI tool that recommends outfits aligning with a brand’s core philosophy. The challenge was to integrate diverse data types - from tabular data to image features. Our solution involved merging product data with features extracted using a Convolutional Neural Network (CNN), feeding this rich dataset into a Transformer model to generate brand-aligned outfit suggestions.
- Technologies Used: Python, Pandas, TensorFlow, Matplotlib, Jupyter Notebook
- Key Highlights: Data cleaning, Data preprocessing, Feature extraction, Dimensionality reduction, Fill-in-the-blank, Convolutional Neural Networks, Transformer model, Deep learning
- GitHub link to the project
For more projects, check out my GitHub repositories.
🛠 Skills & Technologies
- Programming Languages: Python, R, C++, C, Dart, Java
- Data Analysis Tools: Pandas, NumPy, SciPy
- Machine Learning Libraries: Scikit-Learn, TensorFlow, Keras, Pytorch
- Deep Learning Frameworks: OpenMMLab, SLM Lab, HuggingFace, OpenAI Gym
- Data Visualization: Matplotlib, Seaborn, Tensorboard
- Database Management: SQL, NoSQL, Firebase
- Software Frameworks: Flutter, Django
- Others: Git, Jupyter Notebooks
🌐 Connect With Me And Check My Other Work
I’m always open to discussing data science and new opportunities. Let’s connect!
- LinkedIn: guimCC
- HuggingFace: guimCC
- Email: guim.casadella@gmail.com