2017-02-23 1 views
1

J'ai timeseries x avec la structure suivante:pandas jours série numpy se décalés de 1

>>> type(x) 
Out[5]: pandas.core.series.Series 

>>> x.head() 
Out[6]: 
2016-06-01 00:00:00+09:00 110.946 
2016-06-01 00:01:00+09:00 110.887 
2016-06-01 00:02:00+09:00 110.864 
2016-06-01 00:03:00+09:00 110.877 
2016-06-01 00:04:00+09:00 110.904 

>>> x.tail() 
Out[7]: 
2016-07-27 08:55:00+09:00 104.905 
2016-07-27 08:56:00+09:00 104.865 
2016-07-27 08:57:00+09:00 104.875 
2016-07-27 08:58:00+09:00 104.855 
2016-07-27 08:59:00+09:00 104.845 

>>> x.index 
Out[8]: 
DatetimeIndex(['2016-06-01 00:00:00+09:00', '2016-06-01 00:01:00+09:00', '2016-06-01 00:02:00+09:00', '2016-06-01 00:03:00+09:00', '2016-06-01 00:04:00+09:00', '2016-06-01 00:05:00+09:00', '2016-06-01 00:06:00+09:00', '2016-06-01 00:07:00+09:00', '2016-06-01 00:08:00+09:00', '2016-06-01 00:09:00+09:00', 
       ... 
       '2016-07-27 08:50:00+09:00', '2016-07-27 08:51:00+09:00', '2016-07-27 08:52:00+09:00', '2016-07-27 08:53:00+09:00', '2016-07-27 08:54:00+09:00', '2016-07-27 08:55:00+09:00', '2016-07-27 08:56:00+09:00', '2016-07-27 08:57:00+09:00', '2016-07-27 08:58:00+09:00', '2016-07-27 08:59:00+09:00'], dtype='datetime64[ns]', length=55364, freq=None, tz='Asia/Tokyo') 

Maintenant, si je tente d'obtenir seulement les jours uniques dans x en utilisant le code suivant:

unique_days = np.unique(np.array(x.index.values.astype('<M8[D]'))) 

weirdly i get:

>>> unique_days 
Out[9]: array(['2016-05-31', '2016-06-01', '2016-06-02', '2016-06-03', '2016-06-05', '2016-06-06', '2016-06-07', '2016-06-08', '2016-06-09', '2016-06-10', '2016-06-12', '2016-06-13', '2016-06-14', '2016-06-15', '2016-06-16', '2016-06-17', '2016-06-19', '2016-06-20', '2016-06-21', '2016-06-22', '2016-06-23', '2016-06-24', '2016-06-26', '2016-06-27', '2016-06-28', '2016-06-29', '2016-06-30', '2016-07-01', '2016-07-03', '2016-07-04', '2016-07-05', '2016-07-06', '2016-07-07', '2016-07-08', '2016-07-10', '2016-07-11', '2016-07-12', '2016-07-13', '2016-07-14', '2016-07-15', '2016-07-17', '2016-07-18', '2016-07-19', '2016-07-20', '2016-07-21', '2016-07-22', '2016-07-24', '2016-07-25', '2016-07-26'], dtype='datetime64[D]') 

donc, fondamentalement, il déplace les jours de 1 jour. Y a-t-il un moyen de contourner cela pour obtenir les bons jours?

Répondre

0

utilisant unique_days = np.unique(x.index.date)

au lieu de np.unique(np.array(x.index.values.astype('<M8[D]')))

résolu le problème pour moi.

1

Je pense qu'il y a un problème avec les fuseaux horaires en numpy - il convertit en tz-aware DatetimeIndex.

print (x.index.tz_convert(None)) 
DatetimeIndex(['2016-05-31 15:00:00', '2016-05-31 15:01:00', 
       '2016-05-31 15:02:00', '2016-05-31 15:03:00', 
       '2016-05-31 15:04:00'], 
       dtype='datetime64[ns]', name='idx', freq=None) 

Pour moi travaille round à days avec unique:

print (x.index.round('D').unique()) 


print (x.index) 
DatetimeIndex(['2016-06-01 00:00:00+09:00', '2016-06-01 00:01:00+09:00', 
       '2016-06-01 00:02:00+09:00', '2016-06-01 00:03:00+09:00', 
       '2016-06-01 00:04:00+09:00'], 
       dtype='datetime64[ns, Asia/Tokyo]', name='idx', freq=None) 

print (x.index.round('D').unique()) 
DatetimeIndex(['2016-06-01 00:00:00+09:00'], 
       dtype='datetime64[ns, Asia/Tokyo]', name='idx', freq=None) 
+0

merci jezrael, ceci est utile. Cependant, pour l'instant, j'ai une méthode plus simple pour le poster ci-dessous. – dayum