Abstract
Time series feature engineering is a time-consuming process because scientists and engineers have to consider the multifarious algorithms of signal processing and time series analysis for identifying and extracting meaningful features from time series. The Python package tsfresh (Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests) accelerates this process by combining 63 time series characterization methods, which by default compute a total of 794 time series features, with feature selection on basis automatically configured hypothesis tests. By identifying statistically significant time series characteristics in an early stage of the data science process, tsfresh closes feedback loops with domain experts and fosters the development of domain specific features early on. The package implements standard APIs of time series and machine learning libraries (e.g. pandas and scikit-learn) and is designed for both exploratory analyses as well as straightforward integration into operational data science applications.
Keywords
Affiliated Institutions
Related Publications
The Great Crash, the Oil Price Shock, and the Unit Root Hypothesis
We consider the null hypothesis that a time series has a unit root with possibly nonzero drift against the alternative that the process is «trend-stationary». The interest is th...
Faster and Better: A Machine Learning Approach to Corner Detection
The repeatability and efficiency of a corner detector determines how likely it is to be useful in a real-world application. The repeatability is important because the same scene...
Bootstrap Methods and their Application
Bootstrap methods are computer-intensive methods of statistical analysis, which use simulation to calculate standard errors, confidence intervals, and significance tests. The me...
Convolutional feature masking for joint object and stuff segmentation
The topic of semantic segmentation has witnessed considerable progress due to the powerful features learned by convolutional neural networks (CNNs). The current leading approach...
A Multi-View Deep Learning Approach for Cross Domain User Modeling in Recommendation Systems
Recent online services rely heavily on automatic personalization to recommend relevant content to a large number of users. This requires systems to scale promptly to accommodate...
Publication Info
- Year
- 2018
- Type
- article
- Volume
- 307
- Pages
- 72-77
- Citations
- 1100
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1016/j.neucom.2018.03.067