我有两个赛车赛车数据集,9列:
df1:
我想设置多个条件,其中两个数据集进行比较,并试图打印出真正的或虚假的,如果条件满足后比较。 条件是:
- race_start(df1)<=race_start(df2)
- race_end(df1)>=race_end(df2)
- safety_start(df1)<=safety_start(df2)
- safety_end(df1)>=safety_end(df2)
- starting_front(df1)<=starting_front(df2)
- starting_back(df1)<=starting_back(df2)
- 进站(df1)>=进站(df2)
- no_pitstop(df1)>=no_pitstop(df2)
- 状态(df1)=状态(df2).
第一,我试图这样的:
import numpy as np
df1['race_start_final'] = np.where(df1.race_start <= df2.race_start, 'True', 'False')
df1['race_end_final'] = np.where(df1.race_end >= df2.race_end, 'True', 'False')
df1['safety_start_final'] = np.where(df1.safety_start <= df2.safety_start, 'True', 'False')
df1['safety_end_final'] = np.where(df1.safety_end >= df2.safety_end, 'True', 'False')
df1['starting_front_final'] = np.where(df1.starting_front <= df2.starting_front, 'True', 'False')
df1['starting_back_final'] = np.where(df1.starting_back <= df2.starting_back, 'True', 'False')
df1['pitstop_final'] = np.where(df1.pitstop >= df2.pitstop, 'True', 'False')
df1['pitstop_final'] = np.where(df1.pitstop >= df2.pitstop, 'True', 'False')
df1['status_final'] = np.where(df1.status == df2.status, 'True', 'False')
但我得到了一个错误的说法:
ValueError: Can only compare identically-labeled Series objects
我做它吗? 我试图寻找替代解决方案,但没有发现任何关于我想做什么? 谁能告诉我如果我选择过程是正确的方式? 感谢