Skip to content

Commit d6bed01

Browse files
rabernatshoyer
authored andcommitted
revise top-level package description (#2430)
* revise main package description * Update doc/index.rst Co-Authored-By: rabernat <[email protected]> * Update doc/index.rst Co-Authored-By: rabernat <[email protected]> * Update doc/index.rst Co-Authored-By: rabernat <[email protected]> * next draft * add mention of netCDF * eliminate CDM reference * update README and setup.py * Split long paragraph, minor rewordings
1 parent 624d78f commit d6bed01

File tree

3 files changed

+75
-33
lines changed

3 files changed

+75
-33
lines changed

README.rst

Lines changed: 26 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -18,20 +18,34 @@ xarray: N-D labeled arrays and datasets
1818
.. image:: https://img.shields.io/badge/powered%20by-NumFOCUS-orange.svg?style=flat&colorA=E1523D&colorB=007D8A
1919
:target: http://numfocus.org
2020

21-
**xarray** (formerly **xray**) is an open source project and Python package that aims to bring the
22-
labeled data power of pandas_ to the physical sciences, by providing
23-
N-dimensional variants of the core pandas data structures.
24-
25-
Our goal is to provide a pandas-like and pandas-compatible toolkit for
26-
analytics on multi-dimensional arrays, rather than the tabular data for which
27-
pandas excels. Our approach adopts the `Common Data Model`_ for self-
28-
describing scientific data in widespread use in the Earth sciences:
29-
``xarray.Dataset`` is an in-memory representation of a netCDF file.
30-
21+
**xarray** (formerly **xray**) is an open source project and Python package
22+
that makes working with labelled multi-dimensional arrays simple,
23+
efficient, and fun!
24+
25+
Multi-dimensional (a.k.a. N-dimensional, ND) arrays (sometimes called
26+
"tensors") are an essential part of computational science.
27+
They are encountered in a wide range of fields, including physics, astronomy,
28+
geoscience, bioinformatics, engineering, finance, and deep learning.
29+
In Python, NumPy_ provides the fundamental data structure and API for
30+
working with raw ND arrays.
31+
However, real-world datasets are usually more than just raw numbers;
32+
they have labels which encode information about how the array values map
33+
to locations in space, time, etc.
34+
35+
By introducing *dimensions*, *coordinates*, and *attributes* on top of raw
36+
NumPy-like arrays, xarray is able to understand these labels and use them to
37+
provide a more intuitive, more concise, and less error-prone experience.
38+
Xarray also provides a large and growing library of functions for advanced
39+
analytics and visualization with these data structures.
40+
Xarray was inspired by and borrows heavily from pandas_, the popular data
41+
analysis package focused on labelled tabular data.
42+
Xarray can read and write data from most common labeled ND-array storage
43+
formats and is particularly tailored to working with netCDF_ files, which were
44+
the source of xarray's data model.
45+
46+
.. _NumPy: http://www.numpy.org/
3147
.. _pandas: http://pandas.pydata.org
32-
.. _Common Data Model: http://www.unidata.ucar.edu/software/thredds/current/netcdf-java/CDM
3348
.. _netCDF: http://www.unidata.ucar.edu/software/netcdf
34-
.. _OPeNDAP: http://www.opendap.org/
3549

3650
Why xarray?
3751
-----------

doc/index.rst

Lines changed: 26 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -2,19 +2,33 @@ xarray: N-D labeled arrays and datasets in Python
22
=================================================
33

44
**xarray** (formerly **xray**) is an open source project and Python package
5-
that aims to bring the labeled data power of pandas_ to the physical sciences,
6-
by providing N-dimensional variants of the core pandas data structures.
7-
8-
Our goal is to provide a pandas-like and pandas-compatible toolkit for
9-
analytics on multi-dimensional arrays, rather than the tabular data for which
10-
pandas excels. Our approach adopts the `Common Data Model`_ for self-
11-
describing scientific data in widespread use in the Earth sciences:
12-
``xarray.Dataset`` is an in-memory representation of a netCDF file.
13-
5+
that makes working with labelled multi-dimensional arrays simple,
6+
efficient, and fun!
7+
8+
Multi-dimensional (a.k.a. N-dimensional, ND) arrays (sometimes called
9+
"tensors") are an essential part of computational science.
10+
They are encountered in a wide range of fields, including physics, astronomy,
11+
geoscience, bioinformatics, engineering, finance, and deep learning.
12+
In Python, NumPy_ provides the fundamental data structure and API for
13+
working with raw ND arrays.
14+
However, real-world datasets are usually more than just raw numbers;
15+
they have labels which encode information about how the array values map
16+
to locations in space, time, etc.
17+
18+
By introducing *dimensions*, *coordinates*, and *attributes* on top of raw
19+
NumPy-like arrays, xarray is able to understand these labels and use them to
20+
provide a more intuitive, more concise, and less error-prone experience.
21+
Xarray also provides a large and growing library of functions for advanced
22+
analytics and visualization with these data structures.
23+
Xarray was inspired by and borrows heavily from pandas_, the popular data
24+
analysis package focused on labelled tabular data.
25+
Xarray can read and write data from most common labeled ND-array storage
26+
formats and is particularly tailored to working with netCDF_ files, which were
27+
the source of xarray's data model.
28+
29+
.. _NumPy: http://www.numpy.org/
1430
.. _pandas: http://pandas.pydata.org
15-
.. _Common Data Model: http://www.unidata.ucar.edu/software/thredds/current/netcdf-java/CDM
1631
.. _netCDF: http://www.unidata.ucar.edu/software/netcdf
17-
.. _OPeNDAP: http://www.opendap.org/
1832

1933
Documentation
2034
-------------
@@ -106,7 +120,7 @@ See also
106120
.. _2015 Unidata Users Workshop talk: https://www.youtube.com/watch?v=J9ypQOnt5l8
107121
.. _tutorial: https://github.com/Unidata/unidata-users-workshop/blob/master/notebooks/xray-tutorial.ipynb
108122
.. _with answers: https://github.com/Unidata/unidata-users-workshop/blob/master/notebooks/xray-tutorial-with-answers.ipynb
109-
.. _Nicolas Fauchereau's tutorial: http://nbviewer.ipython.org/github/nicolasfauchereau/metocean/blob/master/notebooks/xray.ipynb
123+
.. _Nicolas Fauchereau's tutorial: http://nbviewer.iPython.org/github/nicolasfauchereau/metocean/blob/master/notebooks/xray.ipynb
110124

111125
Get in touch
112126
------------

setup.py

Lines changed: 23 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -34,19 +34,33 @@
3434
DESCRIPTION = "N-D labeled arrays and datasets in Python"
3535
LONG_DESCRIPTION = """
3636
**xarray** (formerly **xray**) is an open source project and Python package
37-
that aims to bring the labeled data power of pandas_ to the physical sciences,
38-
by providing N-dimensional variants of the core pandas data structures.
37+
that makes working with labelled multi-dimensional arrays simple,
38+
efficient, and fun!
3939
40-
Our goal is to provide a pandas-like and pandas-compatible toolkit for
41-
analytics on multi-dimensional arrays, rather than the tabular data for which
42-
pandas excels. Our approach adopts the `Common Data Model`_ for self-
43-
describing scientific data in widespread use in the Earth sciences:
44-
``xarray.Dataset`` is an in-memory representation of a netCDF file.
40+
Multi-dimensional (a.k.a. N-dimensional, ND) arrays (sometimes called
41+
"tensors") are an essential part of computational science.
42+
They are encountered in a wide range of fields, including physics, astronomy,
43+
geoscience, bioinformatics, engineering, finance, and deep learning.
44+
In Python, NumPy_ provides the fundamental data structure and API for
45+
working with raw ND arrays.
46+
However, real-world datasets are usually more than just raw numbers;
47+
they have labels which encode information about how the array values map
48+
to locations in space, time, etc.
4549
50+
By introducing *dimensions*, *coordinates*, and *attributes* on top of raw
51+
NumPy-like arrays, xarray is able to understand these labels and use them to
52+
provide a more intuitive, more concise, and less error-prone experience.
53+
Xarray also provides a large and growing library of functions for advanced
54+
analytics and visualization with these data structures.
55+
Xarray was inspired by and borrows heavily from pandas_, the popular data
56+
analysis package focused on labelled tabular data.
57+
Xarray can read and write data from most common labeled ND-array storage
58+
formats and is particularly tailored to working with netCDF_ files, which were
59+
the source of xarray's data model.
60+
61+
.. _NumPy: http://www.numpy.org/
4662
.. _pandas: http://pandas.pydata.org
47-
.. _Common Data Model: http://www.unidata.ucar.edu/software/thredds/current/netcdf-java/CDM
4863
.. _netCDF: http://www.unidata.ucar.edu/software/netcdf
49-
.. _OPeNDAP: http://www.opendap.org/
5064
5165
Important links
5266
---------------

0 commit comments

Comments
 (0)