Abstract
In this paper we are concerned with the practical issues of working with data sets common to finance, statistics, and other related fields. pandas is a new library which aims to facilitate working with these data sets and to provide a set of fundamental building blocks for implementing statistical models. We will discuss specific design issues encountered in the course of developing pandas with relevant examples and some comparisons with the R language. We conclude by discussing possible future directions for statistical computing and data analysis using Python.
Keywords
Affiliated Institutions
Related Publications
SCANPY: large-scale single-cell gene expression data analysis
Scanpy is a scalable toolkit for analyzing single-cell gene expression data. It includes methods for preprocessing, visualization, clustering, pseudotime and trajectory inferenc...
Figuring Out Factors: The Use and Misuse of Factor Analysis
Factor analysis is a technique which is designed to reveal whether or not the pattern of responses on a number of tests can be explained by a smaller number of underlying traits...
New tools for automated high-resolution cryo-EM structure determination in RELION-3
Here, we describe the third major release of RELION. CPU-based vector acceleration has been added in addition to GPU support, which provides flexibility in use of resources and ...
fastp: an ultra-fast all-in-one FASTQ preprocessor
Abstract Motivation Quality control and preprocessing of FASTQ files are essential to providing clean data for downstream analysis. Traditionally, a different tool is used for e...
HTSeq—a Python framework to work with high-throughput sequencing data
Abstract Motivation: A large choice of tools exists for many standard tasks in the analysis of high-throughput sequencing (HTS) data. However, once a project deviates from stand...
Publication Info
- Year
- 2010
- Type
- article
- Pages
- 56-61
- Citations
- 10212
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.25080/majora-92bf1922-00a