python - 如何根据 groupby 使 matplotlib 中的 xticks 均匀分布，尽管它们很有价值？

Question

我试图绘制不同公司按年龄划分的平均回报。我希望年龄变量均匀分布，尽管步长增加并且分布不均匀。我的目标是这样的：我的阴谋目标

fig, ax = plt.subplots(figsize=(10,6), sharex=True, sharey=True)

ax.plot(ipo_cut[["IR", "Age", "Timespan"]][(ipo_cut["Timespan"] == "1980-1989") & (ipo_cut["Age"] >= 2)].groupby("Age").mean(), color="r")
ax2 = ax.twinx()
ax2.plot(ipo_cut[["IR", "Age", "Timespan"]][(ipo_cut["Timespan"] == "1990-1998") & (ipo_cut["Age"] >= 2)].groupby("Age").mean(), color = "g")
ax3 = ax.twinx()
ax3.plot(ipo_cut[["IR", "Age", "Timespan"]][(ipo_cut["Timespan"] == "1999-2000") & (ipo_cut["Age"] >= 2)].groupby("Age").mean(), color="grey")
ax4 = ax.twinx()
ax4.plot(ipo_cut[["IR", "Age", "Timespan"]][(ipo_cut["Timespan"] == "2001-2003") & (ipo_cut["Age"] >= 2)].groupby("Age").mean())

ax.set(title ="Average First-day Returns by Age of Firm at Time of IPO",
      xlabel = "Age",
      ylabel = "Average First-Day Return in %")
ax.set_ylim([0,1])
ax2.set_ylim([0,1])
ax3.set_ylim([0,1])
ax4.set_ylim([0,1])
ax.set_xlim([0,70])
ax2.set_xlim([0,70])
ax3.set_xlim([0,70])
ax4.set_xlim([0,70])
ax2.get_yaxis().set_visible(False)
ax3.get_yaxis().set_visible(False)
ax4.get_yaxis().set_visible(False)

plt.xticks([0,1,2,3,4,5,6,7,8,9,10, 11,12,13,15,17,20,30,40,50,60,70])
plt.setp(ax.get_xticklabels(), rotation = 90)

fig.show()

输出如下所示：我的输出图

根据其他帖子，我只能显示所需的“年龄”年份，但不幸的是轴分布不均匀。我觉得我真的很接近解决方案，但我自己找不到答案。

seaborn 中的解决方案对我也很有帮助。

编辑一些附加信息：x 向量的形状和数据不匹配。我仅针对相关年龄过滤了我的数据框，但缺少一些信息。这是一个最小的可重现示例：

import matplotlib.pyplot as py
import numpy as np
    
#create dataframe
x = np.arange(0,11)
df = pd.DataFrame(data = np.random.rand(len(x)), index = x, columns=["IR"])
df.index.name = "Age"
df.loc[[0, 9], "IR"] = np.nan
df.dropna(inplace = True)

# open figure
fig, ax = plt.subplots(figsize=(10,6))
# create x-values
x = [0,1,2,5,7,9,10]
# plot
ax.plot(x, df, color="r")
ax.set(
    title ="Average First-day Returns by Age of Firm at Time of IPO",
    xlabel = "Age",
    ylabel = "Average First-Day Return in %",
    ylim = [0,1],
    xlim = [0,10])
ax.tick_params(axis='x', labelrotation=90) 
fig.show()

我的目标是绘制没有间隙的整个数据框。所以我的想法是我在背景中有线图，x 轴上有所需的 x 值。我希望这个信息帮助。

解决方案：

import matplotlib.pyplot as py
import numpy as np
    
#create dataframe
x = np.arange(0,10)
df = pd.DataFrame(data = np.random.rand(len(x)), index = x, columns=["IR"])
df.index.name = "Age"
#create nan in sample
df.loc[[0, 9], "IR"] = np.nan
# slice data for unbroken line (solution by max)
lg = df["IR"].isna()
# create ticks for even distribution
n = 10
a = np.arange(n)
# open figure
fig, ax = plt.subplots(figsize=(10,6))
# create x-values fir tick labels
x = (0,5,10,13,15,24,30,40,55,70)
# plot
ax.plot(a[~lg], df[~lg], color="r")
ax.set(
    title ="Average First-day Returns by Age of Firm at Time of IPO",
    xlabel = "Age",
    ylabel = "Average First-Day Return in %",
    ylim = [0,1],
    xlim = [0,10])
ax.xaxis.set_ticks(a) #set the ticks to be a
ax.xaxis.set_ticklabels(x) # change the ticks' names to x
ax.tick_params(axis='x', labelrotation=90) 
fig.show()

多亏了 max 和“如何使刻度均匀分布，尽管它们的值？ ”的输入，我可以找到解决问题的方法。如果您的原始数据框的 x 值的步长不是我建议简单reset_index()的 .

score 0 · Accepted Answer

我建议在开始时创建所需的 x 向量并在其上绘制数据。然后你可以让matplotlib做剩下的事情。反过来，您将人为地创建一个均匀分布的图（如果您不提供 x 值，matplotlib假定数据是步长为 1 的点序列），您只是更改 x 的外观-axis 而不是图形本身。

import matplotlib.pyplot as py
import numpy as np

# open figure
fig, ax = plt.subplots(figsize=(10,6))
# create x-values
x = [0,1,2,3,4,5,6,7,8,9,10, 11,12,13,15,17,20,30,40,50,60,70]
# create (random) y-values
y = np.random.rand(len(x))
# plot
ax.plot(x, y, color="r")
ax.set(
    title ="Average First-day Returns by Age of Firm at Time of IPO",
    xlabel = "Age",
    ylabel = "Average First-Day Return in %",
    ylim = [0,1],
    xlim = [0,70])
ax.tick_params(axis='x', labelrotation=90) 
fig.show()

我剥离了您的代码并创建了一些人工虚拟数据。看看：如何创建一个最小的可重现示例。

python - 如何根据 groupby 使 matplotlib 中的 xticks 均匀分布，尽管它们很有价值？

1 回答 1

Related

Reference