09_svm.ipynb - Examples

  1{
  2 "cells": [
  3  {
  4   "cell_type": "markdown",
  5   "id": "cell-0",
  6   "metadata": {},
  7   "source": [
  8    "# 09. 서포트 벡터 머신 (Support Vector Machine)\n",
  9    "\n",
 10    "## 학습 목표\n",
 11    "- SVM의 마진 최대화 원리 이해\n",
 12    "- 서포트 벡터의 역할 학습\n",
 13    "- 커널 트릭으로 비선형 문제 해결\n",
 14    "- 하이퍼파라미터 C와 gamma 튜닝\n",
 15    "- SVR로 회귀 문제 해결"
 16   ]
 17  },
 18  {
 19   "cell_type": "code",
 20   "execution_count": null,
 21   "id": "cell-1",
 22   "metadata": {},
 23   "outputs": [],
 24   "source": [
 25    "# 라이브러리 임포트\n",
 26    "import numpy as np\n",
 27    "import pandas as pd\n",
 28    "import matplotlib.pyplot as plt\n",
 29    "from sklearn import svm\n",
 30    "from sklearn.svm import SVC, SVR, LinearSVC\n",
 31    "from sklearn.datasets import (\n",
 32    "    make_blobs, make_classification, make_moons, make_circles,\n",
 33    "    load_iris, load_breast_cancer, load_diabetes\n",
 34    ")\n",
 35    "from sklearn.model_selection import train_test_split, GridSearchCV, cross_val_score\n",
 36    "from sklearn.preprocessing import StandardScaler\n",
 37    "from sklearn.metrics import accuracy_score, classification_report, mean_squared_error, r2_score\n",
 38    "\n",
 39    "# 한글 폰트 설정\n",
 40    "plt.rcParams['font.family'] = 'DejaVu Sans'\n",
 41    "plt.rcParams['axes.unicode_minus'] = False\n",
 42    "np.random.seed(42)"
 43   ]
 44  },
 45  {
 46   "cell_type": "markdown",
 47   "id": "cell-2",
 48   "metadata": {},
 49   "source": [
 50    "## 1. 선형 SVM - 마진 최대화\n",
 51    "\n",
 52    "SVM의 핵심은 두 클래스를 분리하는 최적의 초평면(hyperplane)을 찾는 것입니다.\n",
 53    "마진(margin)을 최대화하여 일반화 성능을 높입니다."
 54   ]
 55  },
 56  {
 57   "cell_type": "code",
 58   "execution_count": null,
 59   "id": "cell-3",
 60   "metadata": {},
 61   "outputs": [],
 62   "source": [
 63    "# 선형 분리 가능한 데이터 생성\n",
 64    "X, y = make_blobs(n_samples=100, centers=2, random_state=6)\n",
 65    "\n",
 66    "# 선형 SVM 학습\n",
 67    "clf = svm.SVC(kernel='linear', C=1000)\n",
 68    "clf.fit(X, y)\n",
 69    "\n",
 70    "print(f\"서포트 벡터 수: {len(clf.support_vectors_)}\")\n",
 71    "print(f\"가중치 (w): {clf.coef_}\")\n",
 72    "print(f\"절편 (b): {clf.intercept_}\")"
 73   ]
 74  },
 75  {
 76   "cell_type": "code",
 77   "execution_count": null,
 78   "id": "cell-4",
 79   "metadata": {},
 80   "outputs": [],
 81   "source": [
 82    "# 결정 경계와 마진 시각화\n",
 83    "plt.figure(figsize=(10, 8))\n",
 84    "\n",
 85    "# 데이터 포인트\n",
 86    "plt.scatter(X[:, 0], X[:, 1], c=y, cmap='coolwarm', s=100, edgecolors='black')\n",
 87    "\n",
 88    "# 결정 경계와 마진\n",
 89    "ax = plt.gca()\n",
 90    "xlim = ax.get_xlim()\n",
 91    "ylim = ax.get_ylim()\n",
 92    "\n",
 93    "# 그리드 생성\n",
 94    "xx = np.linspace(xlim[0], xlim[1], 30)\n",
 95    "yy = np.linspace(ylim[0], ylim[1], 30)\n",
 96    "YY, XX = np.meshgrid(yy, xx)\n",
 97    "xy = np.vstack([XX.ravel(), YY.ravel()]).T\n",
 98    "Z = clf.decision_function(xy).reshape(XX.shape)\n",
 99    "\n",
100    "# 결정 경계와 마진 그리기\n",
101    "ax.contour(XX, YY, Z, colors='k', levels=[-1, 0, 1],\n",
102    "           linestyles=['--', '-', '--'], linewidths=[1, 2, 1])\n",
103    "\n",
104    "# 서포트 벡터 표시\n",
105    "ax.scatter(clf.support_vectors_[:, 0], clf.support_vectors_[:, 1],\n",
106    "           s=200, linewidth=2, facecolors='none', edgecolors='green',\n",
107    "           label='Support Vectors')\n",
108    "\n",
109    "plt.xlabel('Feature 1')\n",
110    "plt.ylabel('Feature 2')\n",
111    "plt.title('Linear SVM: Maximum Margin Classifier')\n",
112    "plt.legend()\n",
113    "plt.show()"
114   ]
115  },
116  {
117   "cell_type": "markdown",
118   "id": "cell-5",
119   "metadata": {},
120   "source": [
121    "## 2. 소프트 마진 - C 파라미터\n",
122    "\n",
123    "실제 데이터는 완벽하게 선형 분리가 불가능합니다.\n",
124    "C 파라미터로 오분류와 마진 크기의 균형을 조절합니다.\n",
125    "\n",
126    "- **C 큼**: 오분류 페널티 큼 → 좁은 마진, 과적합 위험\n",
127    "- **C 작음**: 오분류 허용 → 넓은 마진, 일반화 향상"
128   ]
129  },
130  {
131   "cell_type": "code",
132   "execution_count": null,
133   "id": "cell-6",
134   "metadata": {},
135   "outputs": [],
136   "source": [
137    "# 노이즈가 있는 데이터\n",
138    "X, y = make_classification(\n",
139    "    n_samples=200, n_features=2, n_redundant=0,\n",
140    "    n_informative=2, n_clusters_per_class=1,\n",
141    "    flip_y=0.1,  # 10% 노이즈\n",
142    "    random_state=42\n",
143    ")\n",
144    "\n",
145    "# 여러 C 값 비교\n",
146    "fig, axes = plt.subplots(1, 3, figsize=(15, 5))\n",
147    "C_values = [0.1, 1, 100]\n",
148    "\n",
149    "for ax, C in zip(axes, C_values):\n",
150    "    clf = svm.SVC(kernel='linear', C=C)\n",
151    "    clf.fit(X, y)\n",
152    "\n",
153    "    # 결정 경계\n",
154    "    xlim = [X[:, 0].min() - 0.5, X[:, 0].max() + 0.5]\n",
155    "    ylim = [X[:, 1].min() - 0.5, X[:, 1].max() + 0.5]\n",
156    "    xx, yy = np.meshgrid(np.linspace(xlim[0], xlim[1], 100),\n",
157    "                         np.linspace(ylim[0], ylim[1], 100))\n",
158    "\n",
159    "    Z = clf.decision_function(np.c_[xx.ravel(), yy.ravel()])\n",
160    "    Z = Z.reshape(xx.shape)\n",
161    "\n",
162    "    ax.contourf(xx, yy, Z, alpha=0.3, cmap='coolwarm')\n",
163    "    ax.contour(xx, yy, Z, colors='k', levels=[-1, 0, 1],\n",
164    "               linestyles=['--', '-', '--'])\n",
165    "    ax.scatter(X[:, 0], X[:, 1], c=y, cmap='coolwarm', edgecolors='black')\n",
166    "    ax.scatter(clf.support_vectors_[:, 0], clf.support_vectors_[:, 1],\n",
167    "               s=150, facecolors='none', edgecolors='green', linewidths=2)\n",
168    "    ax.set_title(f'C = {C}\\nSupport Vectors: {len(clf.support_vectors_)}')\n",
169    "\n",
170    "plt.tight_layout()\n",
171    "plt.show()"
172   ]
173  },
174  {
175   "cell_type": "markdown",
176   "id": "cell-7",
177   "metadata": {},
178   "source": [
179    "## 3. 커널 트릭 - 비선형 분류\n",
180    "\n",
181    "커널 함수로 데이터를 고차원 공간에 매핑하여 비선형 패턴을 처리합니다.\n",
182    "\n",
183    "주요 커널:\n",
184    "- **linear**: K(x, y) = x·y\n",
185    "- **polynomial**: K(x, y) = (γ·x·y + r)^d\n",
186    "- **rbf** (Gaussian): K(x, y) = exp(-γ||x - y||²)\n",
187    "- **sigmoid**: K(x, y) = tanh(γ·x·y + r)"
188   ]
189  },
190  {
191   "cell_type": "code",
192   "execution_count": null,
193   "id": "cell-8",
194   "metadata": {},
195   "outputs": [],
196   "source": [
197    "# 비선형 데이터 생성\n",
198    "X_moons, y_moons = make_moons(n_samples=200, noise=0.1, random_state=42)\n",
199    "X_circles, y_circles = make_circles(n_samples=200, noise=0.1, factor=0.5, random_state=42)\n",
200    "\n",
201    "# 커널 비교\n",
202    "kernels = ['linear', 'poly', 'rbf']\n",
203    "\n",
204    "fig, axes = plt.subplots(2, 3, figsize=(15, 10))\n",
205    "\n",
206    "for row, (X_data, y_data, name) in enumerate([(X_moons, y_moons, 'Moons'),\n",
207    "                                                (X_circles, y_circles, 'Circles')]):\n",
208    "    for col, kernel in enumerate(kernels):\n",
209    "        ax = axes[row, col]\n",
210    "\n",
211    "        # SVM 학습\n",
212    "        if kernel == 'poly':\n",
213    "            clf = svm.SVC(kernel=kernel, degree=3, gamma='scale')\n",
214    "        else:\n",
215    "            clf = svm.SVC(kernel=kernel, gamma='scale')\n",
216    "        clf.fit(X_data, y_data)\n",
217    "\n",
218    "        # 결정 경계\n",
219    "        xlim = [X_data[:, 0].min() - 0.5, X_data[:, 0].max() + 0.5]\n",
220    "        ylim = [X_data[:, 1].min() - 0.5, X_data[:, 1].max() + 0.5]\n",
221    "        xx, yy = np.meshgrid(np.linspace(xlim[0], xlim[1], 100),\n",
222    "                             np.linspace(ylim[0], ylim[1], 100))\n",
223    "\n",
224    "        Z = clf.predict(np.c_[xx.ravel(), yy.ravel()])\n",
225    "        Z = Z.reshape(xx.shape)\n",
226    "\n",
227    "        ax.contourf(xx, yy, Z, alpha=0.3, cmap='coolwarm')\n",
228    "        ax.scatter(X_data[:, 0], X_data[:, 1], c=y_data, cmap='coolwarm', edgecolors='black')\n",
229    "        ax.set_title(f'{name} - {kernel}\\nAccuracy: {clf.score(X_data, y_data):.3f}')\n",
230    "\n",
231    "plt.tight_layout()\n",
232    "plt.show()"
233   ]
234  },
235  {
236   "cell_type": "markdown",
237   "id": "cell-9",
238   "metadata": {},
239   "source": [
240    "## 4. RBF 커널과 gamma 파라미터\n",
241    "\n",
242    "RBF 커널에서 gamma는 각 데이터 포인트의 영향 범위를 결정합니다.\n",
243    "\n",
244    "- **gamma 큼**: 영향 범위 좁음 → 복잡한 경계, 과적합 위험\n",
245    "- **gamma 작음**: 영향 범위 넓음 → 단순한 경계, 과소적합 위험"
246   ]
247  },
248  {
249   "cell_type": "code",
250   "execution_count": null,
251   "id": "cell-10",
252   "metadata": {},
253   "outputs": [],
254   "source": [
255    "# gamma 효과 시각화\n",
256    "fig, axes = plt.subplots(1, 4, figsize=(20, 5))\n",
257    "gamma_values = [0.1, 1, 10, 100]\n",
258    "\n",
259    "X, y = make_moons(n_samples=200, noise=0.1, random_state=42)\n",
260    "\n",
261    "for ax, gamma in zip(axes, gamma_values):\n",
262    "    clf = svm.SVC(kernel='rbf', gamma=gamma, C=1)\n",
263    "    clf.fit(X, y)\n",
264    "\n",
265    "    xlim = [X[:, 0].min() - 0.5, X[:, 0].max() + 0.5]\n",
266    "    ylim = [X[:, 1].min() - 0.5, X[:, 1].max() + 0.5]\n",
267    "    xx, yy = np.meshgrid(np.linspace(xlim[0], xlim[1], 100),\n",
268    "                         np.linspace(ylim[0], ylim[1], 100))\n",
269    "\n",
270    "    Z = clf.predict(np.c_[xx.ravel(), yy.ravel()])\n",
271    "    Z = Z.reshape(xx.shape)\n",
272    "\n",
273    "    ax.contourf(xx, yy, Z, alpha=0.3, cmap='coolwarm')\n",
274    "    ax.scatter(X[:, 0], X[:, 1], c=y, cmap='coolwarm', edgecolors='black')\n",
275    "    ax.scatter(clf.support_vectors_[:, 0], clf.support_vectors_[:, 1],\n",
276    "               s=100, facecolors='none', edgecolors='green', linewidths=2)\n",
277    "    ax.set_title(f'gamma = {gamma}\\nSVs: {len(clf.support_vectors_)}')\n",
278    "\n",
279    "plt.tight_layout()\n",
280    "plt.show()"
281   ]
282  },
283  {
284   "cell_type": "markdown",
285   "id": "cell-11",
286   "metadata": {},
287   "source": [
288    "## 5. SVC - 실제 데이터 분류\n",
289    "\n",
290    "Iris 데이터셋으로 다중 클래스 분류를 수행합니다.\n",
291    "**중요**: SVM은 특성 스케일에 민감하므로 스케일링이 필수입니다."
292   ]
293  },
294  {
295   "cell_type": "code",
296   "execution_count": null,
297   "id": "cell-12",
298   "metadata": {},
299   "outputs": [],
300   "source": [
301    "# 데이터 로드\n",
302    "iris = load_iris()\n",
303    "X, y = iris.data, iris.target\n",
304    "\n",
305    "# 데이터 분할\n",
306    "X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)\n",
307    "\n",
308    "# 스케일링 (SVM은 스케일에 민감)\n",
309    "scaler = StandardScaler()\n",
310    "X_train_scaled = scaler.fit_transform(X_train)\n",
311    "X_test_scaled = scaler.transform(X_test)\n",
312    "\n",
313    "# SVM 학습\n",
314    "svm_clf = SVC(\n",
315    "    C=1.0,\n",
316    "    kernel='rbf',\n",
317    "    gamma='scale',\n",
318    "    probability=True,  # 확률 예측 활성화\n",
319    "    random_state=42\n",
320    ")\n",
321    "svm_clf.fit(X_train_scaled, y_train)\n",
322    "\n",
323    "# 예측\n",
324    "y_pred = svm_clf.predict(X_test_scaled)\n",
325    "\n",
326    "print(\"SVM 분류 결과:\")\n",
327    "print(f\"  정확도: {accuracy_score(y_test, y_pred):.4f}\")\n",
328    "print(f\"  서포트 벡터 수: {len(svm_clf.support_vectors_)}\")\n",
329    "print(\"\\n분류 리포트:\")\n",
330    "print(classification_report(y_test, y_pred, target_names=iris.target_names))"
331   ]
332  },
333  {
334   "cell_type": "code",
335   "execution_count": null,
336   "id": "cell-13",
337   "metadata": {},
338   "outputs": [],
339   "source": [
340    "# 확률 예측\n",
341    "y_proba = svm_clf.predict_proba(X_test_scaled[:5])\n",
342    "\n",
343    "print(\"확률 예측 (처음 5개):\")\n",
344    "print(f\"클래스: {iris.target_names}\")\n",
345    "print(y_proba)\n",
346    "print(f\"\\n예측 클래스: {y_pred[:5]}\")\n",
347    "print(f\"실제 클래스: {y_test[:5]}\")"
348   ]
349  },
350  {
351   "cell_type": "markdown",
352   "id": "cell-14",
353   "metadata": {},
354   "source": [
355    "## 6. 스케일링의 중요성\n",
356    "\n",
357    "SVM은 거리 기반 알고리즘이므로 특성 스케일이 다르면 성능이 저하됩니다."
358   ]
359  },
360  {
361   "cell_type": "code",
362   "execution_count": null,
363   "id": "cell-15",
364   "metadata": {},
365   "outputs": [],
366   "source": [
367    "# 스케일링 효과 비교\n",
368    "cancer = load_breast_cancer()\n",
369    "X_train_c, X_test_c, y_train_c, y_test_c = train_test_split(\n",
370    "    cancer.data, cancer.target, test_size=0.2, random_state=42\n",
371    ")\n",
372    "\n",
373    "# 스케일링 없이\n",
374    "svm_no_scale = SVC(kernel='rbf', C=1, gamma='scale')\n",
375    "svm_no_scale.fit(X_train_c, y_train_c)\n",
376    "acc_no_scale = svm_no_scale.score(X_test_c, y_test_c)\n",
377    "\n",
378    "# 스케일링 후\n",
379    "scaler = StandardScaler()\n",
380    "X_train_c_scaled = scaler.fit_transform(X_train_c)\n",
381    "X_test_c_scaled = scaler.transform(X_test_c)\n",
382    "\n",
383    "svm_scaled = SVC(kernel='rbf', C=1, gamma='scale')\n",
384    "svm_scaled.fit(X_train_c_scaled, y_train_c)\n",
385    "acc_scaled = svm_scaled.score(X_test_c_scaled, y_test_c)\n",
386    "\n",
387    "print(\"스케일링 효과:\")\n",
388    "print(f\"  스케일링 없이: {acc_no_scale:.4f}\")\n",
389    "print(f\"  스케일링 후:   {acc_scaled:.4f}\")\n",
390    "print(f\"  성능 향상:     {(acc_scaled - acc_no_scale) * 100:.2f}%\")"
391   ]
392  },
393  {
394   "cell_type": "markdown",
395   "id": "cell-16",
396   "metadata": {},
397   "source": [
398    "## 7. 하이퍼파라미터 튜닝 - Grid Search\n",
399    "\n",
400    "C와 gamma를 동시에 튜닝하여 최적 조합을 찾습니다."
401   ]
402  },
403  {
404   "cell_type": "code",
405   "execution_count": null,
406   "id": "cell-17",
407   "metadata": {},
408   "outputs": [],
409   "source": [
410    "# 파라미터 그리드\n",
411    "param_grid = {\n",
412    "    'C': [0.1, 1, 10, 100],\n",
413    "    'gamma': ['scale', 'auto', 0.01, 0.1, 1],\n",
414    "    'kernel': ['rbf', 'poly']\n",
415    "}\n",
416    "\n",
417    "# Grid Search\n",
418    "grid_search = GridSearchCV(\n",
419    "    SVC(random_state=42),\n",
420    "    param_grid,\n",
421    "    cv=5,\n",
422    "    scoring='accuracy',\n",
423    "    n_jobs=-1,\n",
424    "    verbose=1\n",
425    ")\n",
426    "\n",
427    "grid_search.fit(X_train_scaled, y_train)\n",
428    "\n",
429    "print(\"\\nGrid Search 결과:\")\n",
430    "print(f\"  최적 파라미터: {grid_search.best_params_}\")\n",
431    "print(f\"  최적 CV 점수: {grid_search.best_score_:.4f}\")\n",
432    "print(f\"  테스트 점수: {grid_search.score(X_test_scaled, y_test):.4f}\")"
433   ]
434  },
435  {
436   "cell_type": "code",
437   "execution_count": null,
438   "id": "cell-18",
439   "metadata": {},
440   "outputs": [],
441   "source": [
442    "# C와 gamma 동시 튜닝 시각화 (RBF 커널만)\n",
443    "C_range = np.logspace(-2, 2, 5)\n",
444    "gamma_range = np.logspace(-3, 1, 5)\n",
445    "\n",
446    "# 점수 계산\n",
447    "scores = np.zeros((len(C_range), len(gamma_range)))\n",
448    "\n",
449    "for i, C in enumerate(C_range):\n",
450    "    for j, gamma in enumerate(gamma_range):\n",
451    "        svm_clf = SVC(C=C, gamma=gamma, kernel='rbf')\n",
452    "        svm_clf.fit(X_train_c_scaled, y_train_c)\n",
453    "        scores[i, j] = svm_clf.score(X_test_c_scaled, y_test_c)\n",
454    "\n",
455    "# 히트맵 시각화\n",
456    "plt.figure(figsize=(10, 8))\n",
457    "plt.imshow(scores, interpolation='nearest', cmap='viridis')\n",
458    "plt.xlabel('gamma')\n",
459    "plt.ylabel('C')\n",
460    "plt.colorbar(label='Accuracy')\n",
461    "plt.xticks(np.arange(len(gamma_range)), [f'{g:.3f}' for g in gamma_range])\n",
462    "plt.yticks(np.arange(len(C_range)), [f'{c:.2f}' for c in C_range])\n",
463    "plt.title('SVM Hyperparameter Tuning (RBF Kernel)')\n",
464    "\n",
465    "# 최적점 표시\n",
466    "best_i, best_j = np.unravel_index(scores.argmax(), scores.shape)\n",
467    "plt.scatter(best_j, best_i, marker='*', s=300, c='red', edgecolors='white')\n",
468    "\n",
469    "plt.tight_layout()\n",
470    "plt.show()\n",
471    "\n",
472    "print(f\"최적 C: {C_range[best_i]:.2f}\")\n",
473    "print(f\"최적 gamma: {gamma_range[best_j]:.3f}\")\n",
474    "print(f\"최고 정확도: {scores.max():.4f}\")"
475   ]
476  },
477  {
478   "cell_type": "markdown",
479   "id": "cell-19",
480   "metadata": {},
481   "source": [
482    "## 8. SVR - Support Vector Regression\n",
483    "\n",
484    "SVM을 회귀 문제에 적용합니다.\n",
485    "epsilon-tube 내의 오차는 무시하고, 튜브 밖의 오차만 페널티를 줍니다."
486   ]
487  },
488  {
489   "cell_type": "code",
490   "execution_count": null,
491   "id": "cell-20",
492   "metadata": {},
493   "outputs": [],
494   "source": [
495    "# 데이터 로드\n",
496    "diabetes = load_diabetes()\n",
497    "X_train_d, X_test_d, y_train_d, y_test_d = train_test_split(\n",
498    "    diabetes.data, diabetes.target, test_size=0.2, random_state=42\n",
499    ")\n",
500    "\n",
501    "# 스케일링\n",
502    "scaler = StandardScaler()\n",
503    "X_train_d_scaled = scaler.fit_transform(X_train_d)\n",
504    "X_test_d_scaled = scaler.transform(X_test_d)\n",
505    "\n",
506    "# SVR 학습\n",
507    "svr = SVR(\n",
508    "    kernel='rbf',\n",
509    "    C=100,\n",
510    "    epsilon=0.1,  # 튜브 폭: 이 안의 오차는 무시\n",
511    "    gamma='scale'\n",
512    ")\n",
513    "svr.fit(X_train_d_scaled, y_train_d)\n",
514    "\n",
515    "# 예측\n",
516    "y_pred_d = svr.predict(X_test_d_scaled)\n",
517    "\n",
518    "print(\"SVR 회귀 결과:\")\n",
519    "print(f\"  MSE: {mean_squared_error(y_test_d, y_pred_d):.4f}\")\n",
520    "print(f\"  RMSE: {np.sqrt(mean_squared_error(y_test_d, y_pred_d)):.4f}\")\n",
521    "print(f\"  R²: {r2_score(y_test_d, y_pred_d):.4f}\")\n",
522    "print(f\"  서포트 벡터 수: {len(svr.support_vectors_)}\")"
523   ]
524  },
525  {
526   "cell_type": "code",
527   "execution_count": null,
528   "id": "cell-21",
529   "metadata": {},
530   "outputs": [],
531   "source": [
532    "# 시각화\n",
533    "plt.figure(figsize=(8, 6))\n",
534    "plt.scatter(y_test_d, y_pred_d, alpha=0.7, edgecolors='black')\n",
535    "plt.plot([y_test_d.min(), y_test_d.max()], [y_test_d.min(), y_test_d.max()], 'r--', lw=2)\n",
536    "plt.xlabel('Actual')\n",
537    "plt.ylabel('Predicted')\n",
538    "plt.title(f'SVR Regression (R² = {r2_score(y_test_d, y_pred_d):.4f})')\n",
539    "plt.grid(True, alpha=0.3)\n",
540    "plt.tight_layout()\n",
541    "plt.show()"
542   ]
543  },
544  {
545   "cell_type": "markdown",
546   "id": "cell-22",
547   "metadata": {},
548   "source": [
549    "## 9. 다중 클래스 분류 전략\n",
550    "\n",
551    "SVM은 이진 분류기이므로 다중 클래스는 다음 전략으로 처리합니다.\n",
552    "\n",
553    "- **OvO (One-vs-One)**: k(k-1)/2 개 분류기, SVC 기본값\n",
554    "- **OvR (One-vs-Rest)**: k 개 분류기, LinearSVC 기본값"
555   ]
556  },
557  {
558   "cell_type": "code",
559   "execution_count": null,
560   "id": "cell-23",
561   "metadata": {},
562   "outputs": [],
563   "source": [
564    "# OvO (기본)\n",
565    "svm_ovo = SVC(kernel='rbf', decision_function_shape='ovo')\n",
566    "svm_ovo.fit(X_train_scaled, y_train)\n",
567    "print(f\"OvO 정확도: {svm_ovo.score(X_test_scaled, y_test):.4f}\")\n",
568    "\n",
569    "# OvR\n",
570    "svm_ovr = SVC(kernel='rbf', decision_function_shape='ovr')\n",
571    "svm_ovr.fit(X_train_scaled, y_train)\n",
572    "print(f\"OvR 정확도: {svm_ovr.score(X_test_scaled, y_test):.4f}\")\n",
573    "\n",
574    "# LinearSVC (OvR 기본)\n",
575    "linear_svc = LinearSVC(dual=True, max_iter=10000)\n",
576    "linear_svc.fit(X_train_scaled, y_train)\n",
577    "print(f\"LinearSVC 정확도: {linear_svc.score(X_test_scaled, y_test):.4f}\")"
578   ]
579  },
580  {
581   "cell_type": "markdown",
582   "id": "cell-24",
583   "metadata": {},
584   "source": [
585    "## 10. 커널 비교 - 실전\n",
586    "\n",
587    "유방암 데이터로 여러 커널의 성능을 비교합니다."
588   ]
589  },
590  {
591   "cell_type": "code",
592   "execution_count": null,
593   "id": "cell-25",
594   "metadata": {},
595   "outputs": [],
596   "source": [
597    "kernels = ['linear', 'poly', 'rbf', 'sigmoid']\n",
598    "\n",
599    "print(\"커널별 성능 비교 (Breast Cancer):\")\n",
600    "print(\"-\" * 50)\n",
601    "\n",
602    "for kernel in kernels:\n",
603    "    if kernel == 'poly':\n",
604    "        svm_model = SVC(kernel=kernel, degree=3, gamma='scale')\n",
605    "    else:\n",
606    "        svm_model = SVC(kernel=kernel, gamma='scale')\n",
607    "\n",
608    "    svm_model.fit(X_train_c_scaled, y_train_c)\n",
609    "    acc = svm_model.score(X_test_c_scaled, y_test_c)\n",
610    "    print(f\"  {kernel:8s}: {acc:.4f} (SVs: {len(svm_model.support_vectors_)})\")"
611   ]
612  },
613  {
614   "cell_type": "markdown",
615   "id": "cell-26",
616   "metadata": {},
617   "source": [
618    "## 정리\n",
619    "\n",
620    "### 핵심 개념\n",
621    "\n",
622    "| 개념 | 설명 |\n",
623    "|------|------|\n",
624    "| **서포트 벡터** | 마진 경계에 위치한 핵심 데이터 포인트 |\n",
625    "| **마진** | 결정 경계와 서포트 벡터 사이의 거리 |\n",
626    "| **C** | 규제 파라미터 (큼: 좁은 마진, 작음: 넓은 마진) |\n",
627    "| **gamma** | RBF 커널 범위 (큼: 좁은 영향, 작음: 넓은 영향) |\n",
628    "| **커널** | 데이터를 고차원으로 매핑하는 함수 |\n",
629    "\n",
630    "### SVM 사용 체크리스트\n",
631    "\n",
632    "1. ✅ **스케일링 필수**: StandardScaler 또는 MinMaxScaler 적용\n",
633    "2. ✅ **커널 선택**: 선형 분리 가능 → linear, 비선형 → rbf\n",
634    "3. ✅ **파라미터 튜닝**: C와 gamma를 GridSearchCV로 튜닝\n",
635    "4. ✅ **대용량 데이터**: LinearSVC 또는 SGDClassifier 사용\n",
636    "5. ✅ **확률 필요시**: probability=True 설정 (추가 비용 발생)\n",
637    "\n",
638    "### 장단점\n",
639    "\n",
640    "**장점**:\n",
641    "- 고차원 데이터에 효과적\n",
642    "- 메모리 효율적 (서포트 벡터만 저장)\n",
643    "- 다양한 커널로 비선형 문제 해결\n",
644    "\n",
645    "**단점**:\n",
646    "- 대용량 데이터에 느림 (O(n²) ~ O(n³))\n",
647    "- 스케일링 필수\n",
648    "- 파라미터 튜닝 필요\n",
649    "\n",
650    "### 다음 단계\n",
651    "- k-Nearest Neighbors (kNN)\n",
652    "- Naive Bayes\n",
653    "- Ensemble methods"
654   ]
655  }
656 ],
657 "metadata": {
658  "kernelspec": {
659   "display_name": "Python 3",
660   "language": "python",
661   "name": "python3"
662  },
663  "language_info": {
664   "name": "python",
665   "version": "3.9.0"
666  }
667 },
668 "nbformat": 4,
669 "nbformat_minor": 5
670}