如何在一个数字中为特定的年份值范围指定不同的颜色?(Python)

2024-05-19 17:07:32 发布

您现在位置:Python中文网/ 问答频道 /正文

我有一个时间序列数据集,从1992年到2017年。我可以设置整个数据点的颜色,但我要的是为特定的年份范围设置所需的颜色。例如,1992-1995年的“蓝色”,1995-2005年的“红色”等等,我们怎么能做到呢?在

数据集有2列:年份和值。在

import numpy as np
import pandas as pd
from scipy import stats
from sklearn import linear_model
from matplotlib import pyplot as plt
import pylab
import matplotlib.patches as mpatches
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.linear_model import LinearRegression

Atlantic = pd.read_csv('C:\\AtlanticEnd.csv', error_bad_lines=False)

X = Atlantic['year']

y = Atlantic['Poseidon']

plt.figure(figsize=(20,10))
plt.ylabel('Change in mean sea level [mm]', fontsize=20)
plt.xlabel('Years', fontsize=20)
plt.title('Atlantic Ocean - Mean Sea Level', fontsize=20)
colors = ["blue", "red", "green", "purple"]
texts = ["Poseidon", "Jason1", "Jason2", "Jason3"]
patches = [ plt.plot([],[], marker="o", ms=10, ls="", mec=None, color=colors[i], 
            label="{:s}".format(texts[i]) )[0]  for i in range(len(texts)) ]
plt.legend(handles=patches, loc='upper left', ncol=1, facecolor="grey", numpoints=1 )

plt.plot(X, y, 'ro', color='red')

slope, intercept, r_value, p_value, std_err = stats.linregress(X, y)
plt.plot(X, X*slope+intercept, 'b')

plt.axis([1992, 2018, -25, 80])

plt.grid(True)

plt.show()

def trendline(Atlantic, order=1):
    coeffs = np.polyfit(Atlantic.index.values, list(Atlantic), order)
    slope = coeffs[-2]
    return float(slope)

slope = trendline(y)
print(slope)

enter image description here


Tags: 数据fromimportplotmatplotlib颜色asnp
2条回答

我为这个函数创建了自己的随机数据,但是假设您的日期范围不重叠,这应该可以工作。你的约会似乎也不是pd.datetime类型的。这应该适用于pd.datetime类型,但字典中的查找值将类似于("1992-01-01","2000-01-01")等等。在

# Create data
data = np.random.rand(260,1)
dates = np.array(list(range(1992,2018))*10)

df = pd.DataFrame({"y":data[:,0],"date":dates})
df = df.sort(columns="date")

# Dictionary lookup
lookup_dict = {(1992,2000):"r", (2001,2006):"b",(2007,2018):"k"}

# Slice data and plot
fig, ax = plt.subplots()
for lrange in lookup_dict:
    temp = df[(df.date>=lrange[0]) & (df.date<=lrange[1])]
    ax.plot(temp.date,temp.y,color=lookup_dict[lrange], marker="o",ls="none")

这会产生:

enter image description here

我可以想象,使用彩色地图绘制散点图可能是一个简单的解决方案。如果年份是以十进制格式给出的,则散点的颜色将由年份定义。BoundaryNorm将定义值的范围,并且可以从颜色列表轻松创建colormap。在

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import matplotlib.colors

y =  np.random.rand(300)*26+1992
d = (3.075*(y-1992)-17)+np.random.normal(0,5,300)
df = pd.DataFrame({"year" : y, "data" : d})

bounds = [1992,1995,2005,2015,2018]
colors = ["darkorchid", "crimson", "limegreen", "gold"]
cmap = matplotlib.colors.ListedColormap(colors)
norm = matplotlib.colors.BoundaryNorm(bounds, len(colors))

fig, ax = plt.subplots()
sc = ax.scatter(df.year, df.data, c=df.year.values, cmap=cmap, norm=norm)
fig.colorbar(sc, spacing="proportional")

fit = np.polyfit(df.year.values, df.data.values, deg=1)
ax.plot(df.year, np.poly1d(fit)(df.year.values), color="k")

plt.show()

enter image description here

相关问题 更多 >