sklearn.make_classification

時間 2019-11-11

標籤 sklearn.make sklearn make classification 简体版

原文原文鏈接

sklearn.datasets.make_classification(n_samples=100, n_features=20, n_informative=2, 算法

n_redundant=2, n_repeated=0, n_classes=2, n_clusters_per_class=2, weights=None, 數組

flip_y=0.01, class_sep=1.0, hypercube=True,shift=0.0, scale=1.0, dom

shuffle=True, random_state=None) spa

功能：生成樣本集，一般用於分類算法orm

參數：ip

n_features :特徵個數= n_informative（） + n_redundant + n_repeated
n_informative：多信息特徵的個數
n_redundant：冗餘信息，informative特徵的隨機線性組合
n_repeated ：重複信息，隨機提取n_informative和n_redundant 特徵
n_classes：分類類別
n_clusters_per_class ：某一個類別是由幾個cluster構成的ci

weights:列表類型，權重比io

class_sep:乘以超立方體大小的因子。較大的值分散了簇/類，並使分類任務更容易。默認爲1form

random_state: 若是是int，random_state是隨機數發生器使用的種子; 若是RandomState實例，random_state是隨機數生成器; 若是沒有，則隨機數生成器是np.random使用的RandomState實例。class

返回值：

X：形狀數組[n_samples，n_features]
生成的樣本。

y：形狀數組[n_samples]每一個樣本的類成員的整數標籤。

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。