Дата просвещение

Search

Search IconIcon to open search

Last updated Aug 11, 2023 Edit Source

Created: 2022-10-31 21:10:49 Tags: #ML #programming #DataScience

# Note

What is end-to-end ML project? When i talk about “end-to-end” i mean these steps you need to go through before your project is ready:

  1. Create infrastructure for your project:
    • Set up remote server
    • Deploy all needed tools such as Jupyter (or Jupyterhub) for developing and quickly experiments running, MLFlow for saving models and experiments results, your own Docker registry, create K8S infrasctructure (if needed), Airflow for etl processes (data collection and processing), …
  2. Choose your system type: online (real-time system) or offline (batch) system.
  3. Choose a problem you want to solve (text santiment, face detection, analyze user behaviour, online/offline recommender system, …)
  4. Find or collect data and choose where you want to store data
  5. Preprocess data and generate features
  6. Experiments with model: training, validation, testing
  7. Deploy ML Pipeline (deploy with Docker Containers, create Airflow DAG, deploy with API, …)
  8. Set up model retraining if needed
  9. Create Web UI, or Mobile app, or API for your model
  10. Set up tools for monitoring your system (you can use Graphana for ex.):

After these steps you have end-to-end ML system with data collection, preprocessing, monitoring and model retraining.

# References