怎么删除50%的行分享一定列值

0

的问题

df.groupby(['target']).count()
目标 数据
103210
积极的 211082

现在,我正数据是太大。 我要删除50%的行其值的 TargetPositive. 我怎么可以这样做?

dataframe drop pandas python
2021-11-24 03:30:28
1

最好的答案

6

留一半 Positive 行, sample 50%的 Positive 行使用 frac=0.5drop 这些索引:

indexes = df[df.target == 'Positive'].sample(frac=0.5).index
df = df.drop(indexes)

保持完全100K Positive 行, sample 100K Positive 行使用 n=100_000concat 他们 Negative 行:

df = pd.concat([
    df[df.target == 'Negative'],
    df[df.target == 'Positive'].sample(n=100_000)
])
2021-11-24 04:27:20

其他语言

此页面有其他语言版本

Русский
..................................................................................................................
Italiano
..................................................................................................................
Polski
..................................................................................................................
Română
..................................................................................................................
한국어
..................................................................................................................
हिन्दी
..................................................................................................................
Français
..................................................................................................................
Türk
..................................................................................................................
Česk
..................................................................................................................
Português
..................................................................................................................
ไทย
..................................................................................................................
Español
..................................................................................................................
Slovenský
..................................................................................................................