1# Machine Learning μμ
2
3Machine_Learning ν΄λμ 14κ° λ μ¨μ ν΄λΉνλ μ€ν κ°λ₯ν Jupyter Notebook μμ μ
λλ€.
4
5## ν΄λ ꡬ쑰
6
7```
8examples/
9βββ 01_linear_regression.ipynb # μ ν νκ·
10βββ 02_logistic_regression.ipynb # λ‘μ§μ€ν± νκ·
11βββ 03_model_evaluation.ipynb # λͺ¨λΈ νκ° μ§ν
12βββ 04_cross_validation.ipynb # κ΅μ°¨ κ²μ¦
13βββ 05_preprocessing.ipynb # λ°μ΄ν° μ μ²λ¦¬
14βββ 06_decision_tree.ipynb # κ²°μ νΈλ¦¬
15βββ 07_random_forest.ipynb # λλ€ ν¬λ μ€νΈ
16βββ 08_xgboost_lightgbm.ipynb # XGBoost, LightGBM
17βββ 09_svm.ipynb # SVM (Support Vector Machine)
18βββ 10_knn_naive_bayes.ipynb # k-NN, λμ΄λΈ λ² μ΄μ¦
19βββ 11_clustering.ipynb # K-Means, DBSCAN
20βββ 12_pca.ipynb # PCA, t-SNE μ°¨μ μΆμ
21βββ 13_pipeline.ipynb # sklearn νμ΄νλΌμΈ
22βββ 14_kaggle_project.ipynb # μ€μ Kaggle νλ‘μ νΈ
23βββ datasets/ # μμ λ°μ΄ν°μ
24βββ README.md
25```
26
27## μ€ν λ°©λ²
28
29### νκ²½ μ€μ
30
31```bash
32# κ°μνκ²½ μμ± (κΆμ₯)
33python -m venv ml-env
34source ml-env/bin/activate # Windows: ml-env\Scripts\activate
35
36# νμν ν¨ν€μ§ μ€μΉ
37pip install numpy pandas matplotlib seaborn scikit-learn jupyter
38
39# XGBoost, LightGBM (08 λ μ¨μ©)
40pip install xgboost lightgbm
41```
42
43### Jupyter Notebook μ€ν
44
45```bash
46cd Machine_Learning/examples
47jupyter notebook
48
49# λλ JupyterLab
50jupyter lab
51```
52
53## λ μ¨λ³ μμ λͺ©λ‘
54
55| λ μ¨ | μ£Όμ | ν΅μ¬ λ΄μ© |
56|------|------|----------|
57| 01 | μ ν νκ· | λ¨μ/λ€μ€ νκ·, MSE, RΒ² |
58| 02 | λ‘μ§μ€ν± νκ· | μ΄μ§/λ€μ€ λΆλ₯, ROC-AUC |
59| 03 | λͺ¨λΈ νκ° | μ νλ, μ λ°λ, μ¬νμ¨, F1 |
60| 04 | κ΅μ°¨ κ²μ¦ | K-Fold, Stratified, GridSearchCV |
61| 05 | μ μ²λ¦¬ | μ€μΌμΌλ§, μΈμ½λ©, κ²°μΈ‘μΉ |
62| 06 | κ²°μ νΈλ¦¬ | νΈλ¦¬ μκ°ν, κ³Όμ ν© λ°©μ§ |
63| 07 | λλ€ ν¬λ μ€νΈ | λ°°κΉ
, OOB, νΉμ± μ€μλ |
64| 08 | XGBoost/LightGBM | κ·ΈλλμΈνΈ λΆμ€ν
, μ‘°κΈ° μ’
λ£ |
65| 09 | SVM | 컀λ νΈλ¦, νμ΄νΌνλ μΈ |
66| 10 | k-NN/λμ΄λΈ λ² μ΄μ¦ | 거리 κΈ°λ°, νλ₯ κΈ°λ° λΆλ₯ |
67| 11 | ν΄λ¬μ€ν°λ§ | K-Means, DBSCAN, μ€λ£¨μ£ |
68| 12 | μ°¨μ μΆμ | PCA, t-SNE, μ€λͺ
λΆμ° |
69| 13 | νμ΄νλΌμΈ | Pipeline, ColumnTransformer |
70| 14 | Kaggle νλ‘μ νΈ | Titanic, νΉμ± 곡ν |
71
72## νμ΅ μμ
73
741. **κΈ°μ΄**: 01 β 02 β 03 β 04 β 05
752. **νΈλ¦¬ λͺ¨λΈ**: 06 β 07 β 08
763. **κΈ°ν μκ³ λ¦¬μ¦**: 09 β 10
774. **λΉμ§λ νμ΅**: 11 β 12
785. **μ€μ **: 13 β 14
79
80## λ°μ΄ν°μ
81
82μμ μμ μ¬μ©νλ λ°μ΄ν°μ
:
83
84| λ°μ΄ν°μ
| μΆμ² | μ©λ |
85|---------|------|------|
86| Iris | sklearn | λΆλ₯ (λ€μ€ ν΄λμ€) |
87| Wine | sklearn | λΆλ₯ (λ€μ€ ν΄λμ€) |
88| California Housing | sklearn | νκ· |
89| Digits | sklearn | λΆλ₯ (μ΄λ―Έμ§) |
90| Titanic | Kaggle | λΆλ₯ (μ€μ ) |
91
92## νμ ν¨ν€μ§
93
94```
95numpy>=1.21.0
96pandas>=1.3.0
97matplotlib>=3.4.0
98seaborn>=0.11.0
99scikit-learn>=1.0.0
100jupyter>=1.0.0
101xgboost>=1.5.0 # 08 λ μ¨
102lightgbm>=3.3.0 # 08 λ μ¨
103```
104
105## μ°Έκ³ μλ£
106
107- [scikit-learn 곡μ λ¬Έμ](https://scikit-learn.org/stable/)
108- [Kaggle](https://www.kaggle.com/)
109- [Machine Learning Mastery](https://machinelearningmastery.com/)