Mirabito - Ata Gunaydin; Elif Yildrim; Ozguven Salih; Same Genc;: Difference between revisions
No edit summary (change visibility) |
No edit summary (change visibility) |
||
| Line 144: | Line 144: | ||
Fine-tune association rule thresholds and integrate top product combinations into campaign design. |
Fine-tune association rule thresholds and integrate top product combinations into campaign design. |
||
== Week 7 10/27/2025 == |
|||
'''Attendance:''' Samet GENC, Elif YILDIRIM, Ata GUNAYDIN, Salih OZGUVEN |
|||
'''Accomplishments:''' |
|||
'''Summary:''' |
|||
'''AI Loyalty Engine Team:''' |
|||
Implemented and integrated new machine learning algorithms to enhance customer segmentation within Promos. |
|||
The added models were trained on the existing datasets to improve segmentation accuracy and cluster interpretability. |
|||
Within Promos, product names are now properly mapped and displayed, ensuring more consistent data representation and facilitating model evaluation. |
|||
Preliminary tests indicate improved segmentation coherence and better alignment between predicted clusters and actual customer behavior patterns. |
|||
'''Data Infrastructure Team:''' |
|||
Revision as of 22:43, 27 October 2025
Week 1 09/09/2025
Attendance: Samet GENC, Elif YILDIRIM, Ata GUNAYDIN, Salih OZGUVEN
Summary: This week, we held our initial project meeting and outlined the scope of the two interconnected projects: the AI Loyalty Engine and the Data Infrastructure Modeling. We reviewed technical requirements, development tools, and environment setup instructions, including Node.js, Angular, and C#. Additionally, we discussed laptop preferences and extension configurations to ensure a smooth development workflow across platforms.
Accomplishments:
- Defined project responsibilities: AI Loyalty Engine vs. Data Infrastructure Modeling.
- Reviewed system architecture and collaboration needs between both groups.
- Shared setup instructions for Node.js, Angular CLI, and C#.
- Established development environment recommendations for Windows and Mac (Visual Studio / VS Code).
To-Do:
- Install required tools (Node.js, Angular CLI, VS/VS Code with extensions).
- Coordinate regular check-ins between both project teams.
Week 2 & Week 3 09/16/2025
Attendance: Samet GENC, Elif YILDIRIM, Ata GUNAYDIN, Salih OZGUVEN
Summary: This week, we focused on reviewing the provided datasets and understanding their features. We examined both the Customer Basket and Inventory data to explore potential correlations and identify missing elements necessary for modeling. Additionally, we analyzed the AI engine code from the repositories to gain insights into the existing implementation. Finally, we held discussions with Kaan Balta from last year’s team, who provided observations and context regarding the codebase and its structure.
Accomplishments:
- Reviewed Customer Basket dataset and Inventory data, working to identify correlations between them.
- Explored dataset features and clarified column definitions, missing values, and potential resolutions.
- Investigated the AI Loyalty Engine source code and associated programs in the repositories.
To-Do:
- Draft scope documents outlining tasks, timelines, milestones, constraints, and deliverables.
- Propose solutions for gaps identified in the Inventory dataset.
- Continue deep-diving into feature relationships between datasets to refine modeling strategy.
- Begin aligning Data Infrastructure outputs with AI Loyalty Engine requirements for integration.
Week 4 10/06/2025
Attendance: Samet GENC, Elif YILDIRIM, Ata GUNAYDIN, Salih OZGUVEN
Summary: This week, both teams made significant progress in shaping the foundation for the upcoming development phases. The Data Infrastructure team focused on creating a sustainable and refined mock dataset to support the AI Loyalty Engine. All columns and their importance were discussed in detail, and several new features (e.g., `expiration_date`, `seasons`, `day_time`) were engineered. Features with unclear functionality were noted for expert consultation. Meanwhile, the AI Loyalty Engine team drafted improvement options for the model, researched multiple algorithmic approaches, and mapped key integration points within the existing system. Additionally, both teams collaborated to prepare a comprehensive Project Scope Document, clarifying requirements, milestones, and objectives for the semester.
Accomplishments:
Data Infrastructure Team:
- Initiated the creation of a sustainable mock dataset for AI engine development.
- Discussed and evaluated all dataset columns based on their relevance to the model.
- Designed new features (`expiration_date`, `seasons`, `day_time`) to enhance predictive capacity.
- Identified uncertain features for expert validation.
AI Loyalty Engine Team:
- Drafted Improvement Options v0.1, researched potential models, and mapped code/extension points.
- Explored a range of candidate models: ALS, BPR-MF, LightGCN, SAR, Two-Tower , SASRec/BERT4Rec, LightGBM/XGBoost, DeepFM/DLRM, LambdaMART, ILP/Knapsack, and Uplift/CATE approaches.
- Created key artifacts including: Improvements Document, Model Survey, and Code Map.
- Defined project scope: AI engine will propose weekly campaign offers for Marketing (not real-time per user).
Collaboration:
- Conducted a joint session to document requirements and structure a unified project plan.
- Developed a detailed Project Scope Document outlining tasks, deliverables, and shared milestones.
To-Do:
Data Infrastructure Team:
- Finalize the feature schema (expiry, inventory, margin, supplier funds).
- Validate mock dataset quality and integrate with AI engine inputs.
AI Loyalty Engine Team:
- Prototype Two-Tower retrieval model and train LightGBM baseline.
- Draft portfolio optimizer and design initial geo A/B testing plan.
General:
- Address open risks, ensure reliability of inventory/expiry data, confirm store clustering and segmentations, and align objectives on profit uplift and waste reduction.
Week 5 10/13/2025
Attendance: Samet GENC, Elif YILDIRIM, Ata GUNAYDIN, Salih OZGUVEN
Summary: This week, the Data Infrastructure team validated refined transaction data and finalized schema definitions along with ER diagram, while the AI Loyalty Engine team completed the hybrid model integration and dashboard setup.
Accomplishments:
Data Infrastructure Team:
Implemented a data checklist script for structure, numeric, and key integrity checks.
Finalized the POS–Inventory One-Page Schema.
AI Loyalty Engine Team:
Built a hybrid recommendation system combining LightGBM, SAR, and rule-based scoring.
Integrated business logic: margin and discount limits, stock and expiry tracking.
Delivered the API, real-time dashboard, and CSV export tools.
To-Do:
Integrate continuous data validation into ETL flow.
Create a separate table for detailed inventory data to support stock-level tracking.
Tune model weights and perform end-to-end integration testing.
Week 6 10/20/2025
Attendance: Samet GENC, Elif YILDIRIM, Ata GUNAYDIN, Salih OZGUVEN
Summary: This week marked significant progress in data refinement and feature integration across both teams. The Data Infrastructure group focused on extensive data cleaning to improve efficiency and model readiness, while the AI Loyalty Engine team implemented dynamic promotional logic and refined customer segmentation. The teams also collaborated to align datasets and ensure seamless system integration, paving the way for advanced analytical modeling in the coming weeks.
Accomplishments:
Data Infrastructure Team:
Performed large-scale data cleaning by removing deposit/fees, fuel purchases, age verification, non-merchandise items, and car wash entries.
Successfully reduced dataset size by more than half while maintaining data consistency and analytical depth.
Applied previously tested feature engineering and time-series transformations to the real dataset.
Implemented association rule mining using the Apriori algorithm to identify frequently co-purchased product combinations.
Conducted exploratory analysis to examine purchasing traffic before, during, and after holidays.
AI Loyalty Engine Team:
Integrated the system with the newly cleaned and standardized dataset from the Data Infrastructure team.
Handled missing values, unified data formats, and validated readiness for modeling.
Implemented holiday- and season-based promotional logic within the system.
Designed adaptive mechanisms for dynamically adjusting promotional campaigns based on time-sensitive factors such as holidays and seasonal demand.
Categorized customers based on engagement activity into segments including Active, At-Risk, and Churn.
To-Do:
Continue optimizing data quality and explore further feature correlations for deeper insights.
Fine-tune association rule thresholds and integrate top product combinations into campaign design.
Week 7 10/27/2025
Attendance: Samet GENC, Elif YILDIRIM, Ata GUNAYDIN, Salih OZGUVEN
Accomplishments:
Summary:
AI Loyalty Engine Team:
Implemented and integrated new machine learning algorithms to enhance customer segmentation within Promos. The added models were trained on the existing datasets to improve segmentation accuracy and cluster interpretability. Within Promos, product names are now properly mapped and displayed, ensuring more consistent data representation and facilitating model evaluation. Preliminary tests indicate improved segmentation coherence and better alignment between predicted clusters and actual customer behavior patterns.
Data Infrastructure Team: