Package Release Info

python-pandas-doc-0.22.0-bp150.2.7

Update Info: Base Release
Available in Package Hub : 15

platforms

AArch64
ppc64le
s390x
x86-64

subpackages

python-pandas-doc

Change Logs

Version: 0.22.0-bp150.2.6

* Thu Jan 11 2018 tchvatal@suse.com

- Format with spec-cleaner

* Wed Jan 03 2018 arun@gmx.de

- specfile:
  * update copyright year
- update to version 0.22.0:
  * Pandas 0.22.0 changes the handling of empty and all-NA sums and
    products. The summary is that
    + The sum of an empty or all-NA Series is now 0
    + The product of an empty or all-NA Series is now 1
    + We?ve added a min_count parameter to .sum() and .prod()
    controlling the minimum number of valid values for the result to
    be valid. If fewer than min_count non-NA values are present, the
    result is NA. The default is 0. To return NaN, the 0.21
    behavior, use min_count=1.

* Sat Dec 16 2017 arun@gmx.de

- update to version 0.21.1:
  * Highlights include:
    + Temporarily restore matplotlib datetime plotting
    functionality. This should resolve issues for users who
    implicitly relied on pandas to plot datetimes with
    matplotlib. See here.
    + Improvements to the Parquet IO functions introduced in
    0.21.0. See here.
  * Improvements to the Parquet IO functionality
    + DataFrame.to_parquet() will now write non-default indexes when
    the underlying engine supports it. The indexes will be preserved
    when reading back in with read_parquet() (GH18581).
    + read_parquet() now allows to specify the columns to read from a
    parquet file (GH18154)
    + read_parquet() now allows to specify kwargs which are passed to
    the respective engine (GH18216)
  * Other Enhancements
    + Timestamp.timestamp() is now available in Python 2.7. (GH17329)
    + Grouper and TimeGrouper now have a friendly repr output
    (GH18203).
  * Deprecations
    + pandas.tseries.register has been renamed to
    pandas.plotting.register_matplotlib_converters`() (GH18301)
  * Performance Improvements
    + Improved performance of plotting large series/dataframes
    (GH18236).
  * Conversion
    + Bug in TimedeltaIndex subtraction could incorrectly overflow
    when NaT is present (GH17791)
    + Bug in DatetimeIndex subtracting datetimelike from DatetimeIndex
    could fail to overflow (GH18020)
    + Bug in IntervalIndex.copy() when copying and IntervalIndex with
    non-default closed (GH18339)
    + Bug in DataFrame.to_dict() where columns of datetime that are
    tz-aware were not converted to required arrays when used with
    orient='records', raising"TypeError` (GH18372)
    + Bug in DateTimeIndex and date_range() where mismatching tz-aware
    start and end timezones would not raise an err if end.tzinfo is
    None (GH18431)
    + Bug in Series.fillna() which raised when passed a long integer
    on Python 2 (GH18159).
  * Indexing
    + Bug in a boolean comparison of a datetime.datetime and a
    datetime64[ns] dtype Series (GH17965)
    + Bug where a MultiIndex with more than a million records was not
    raising AttributeError when trying to access a missing attribute
    (GH18165)
    + Bug in IntervalIndex constructor when a list of intervals is
    passed with non-default closed (GH18334)
    + Bug in Index.putmask when an invalid mask passed (GH18368)
    + Bug in masked assignment of a timedelta64[ns] dtype Series,
    incorrectly coerced to float (GH18493)
  * I/O
    + Bug in class:~pandas.io.stata.StataReader not converting
    date/time columns with display formatting addressed
    (GH17990). Previously columns with display formatting were
    normally left as ordinal numbers and not converted to datetime
    objects.
    + Bug in read_csv() when reading a compressed UTF-16 encoded file
    (GH18071)
    + Bug in read_csv() for handling null values in index columns when
    specifying na_filter=False (GH5239)
    + Bug in read_csv() when reading numeric category fields with high
    cardinality (GH18186)
    + Bug in DataFrame.to_csv() when the table had MultiIndex columns,
    and a list of strings was passed in for header (GH5539)
    + Bug in parsing integer datetime-like columns with specified
    format in read_sql (GH17855).
    + Bug in DataFrame.to_msgpack() when serializing data of the
    numpy.bool_ datatype (GH18390)
    + Bug in read_json() not decoding when reading line deliminted
    JSON from S3 (GH17200)
    + Bug in pandas.io.json.json_normalize() to avoid modification of
    meta (GH18610)
    + Bug in to_latex() where repeated multi-index values were not
    printed even though a higher level index differed from the
    previous row (GH14484)
    + Bug when reading NaN-only categorical columns in HDFStore
    (GH18413)
    + Bug in DataFrame.to_latex() with longtable=True where a latex
    multicolumn always spanned over three columns (GH17959)
  * Plotting
    + Bug in DataFrame.plot() and Series.plot() with DatetimeIndex
    where a figure generated by them is not pickleable in Python 3
    (GH18439)
  * Groupby/Resample/Rolling
    + Bug in DataFrame.resample(...).apply(...) when there is a
    callable that returns different columns (GH15169)
    + Bug in DataFrame.resample(...) when there is a time change (DST)
    and resampling frequecy is 12h or higher (GH15549)
    + Bug in pd.DataFrameGroupBy.count() when counting over a
    datetimelike column (GH13393)
    + Bug in rolling.var where calculation is inaccurate with a
    zero-valued array (GH18430)
  * Reshaping
    + Error message in pd.merge_asof() for key datatype mismatch now
    includes datatype of left and right key (GH18068)
    + Bug in pd.concat when empty and non-empty DataFrames or Series
    are concatenated (GH18178 GH18187)
    + Bug in DataFrame.filter(...) when unicode is passed as a
    condition in Python 2 (GH13101)
    + Bug when merging empty DataFrames when np.seterr(divide='raise')
    is set (GH17776)
  * Numeric
    + Bug in pd.Series.rolling.skew() and rolling.kurt() with all
    equal values has floating issue (GH18044)
    + Bug in TimedeltaIndex subtraction could incorrectly overflow
    when NaT is present (GH17791)
    + Bug in DatetimeIndex subtracting datetimelike from DatetimeIndex
    could fail to overflow (GH18020)
  * Categorical
    + Bug in DataFrame.astype() where casting to ?category? on an
    empty DataFrame causes a segmentation fault (GH18004)
    + Error messages in the testing module have been improved when
    items have different CategoricalDtype (GH18069)
    + CategoricalIndex can now correctly take a
    pd.api.types.CategoricalDtype as its dtype (GH18116)
    + Bug in Categorical.unique() returning read-only codes array when
    all categories were NaN (GH18051)
    + Bug in DataFrame.groupby(axis=1) with a CategoricalIndex
    (GH18432)
  * String
    + Series.str.split() will now propogate NaN values across all
    expanded columns instead of None (GH18450)

* Mon Oct 30 2017 arun@gmx.de

- specfile:
  * updated minimum numpy version to 1.9.0 (see setup.py)
- update to version 0.21.0:
  * Highlights include:
    + Integration with Apache Parquet, including a new top-level
    read_parquet() function and DataFrame.to_parquet() method, see
    here.
    + New user-facing pandas.api.types.CategoricalDtype for specifying
    categoricals independent of the data, see here.
    + The behavior of sum and prod on all-NaN Series/DataFrames is now
    consistent and no longer depends on whether bottleneck is
    installed, see here.
    + Compatibility fixes for pypy, see here.
    + Additions to the drop, reindex and rename API to make them more
    consistent, see here.
    + Addition of the new methods DataFrame.infer_objects (see here)
    and GroupBy.pipe (see here).
    + Indexing with a list of labels, where one or more of the labels
    is missing, is deprecated and will raise a KeyError in a future
    version, see here.
  * full list at http://pandas.pydata.org/pandas-docs/stable/whatsnew.html

* Sat Sep 23 2017 arun@gmx.de

- update to version 0.20.3:
  * bug fix release, see http://pandas.pydata.org/pandas-docs/stable/whatsnew.html#v0-20-3-july-7-2017
    for complete changelog
- changes from version 0.20.2:
  * bug fix release, see http://pandas.pydata.org/pandas-docs/stable/whatsnew.html#v0-20-2-june-4-2017
    for complete changelog

* Tue May 30 2017 toddrme2178@gmail.com

- Fix documentation BuildRequires.

* Thu May 18 2017 toddrme2178@gmail.com

- Update to version 0.20.1
  Highlights include:
  * New ``.agg()`` API for Series/DataFrame similar to the
    groupby-rolling-resample API's
  * Integration with the ``feather-format``, including a new
    top-level ``pd.read_feather()`` and ``DataFrame.to_feather()``
    method
  * The ``.ix`` indexer has been deprecated
  * ``Panel`` has been deprecated
  * Addition of an ``IntervalIndex`` and ``Interval`` scalar type
  * Improved user API when grouping by index levels in ``.groupby()``
  * Improved support for ``UInt64`` dtypes
  * A new orient for JSON serialization, ``orient='table'``, that
    uses the Table Schema spec and that gives the possibility for
    a more interactive repr in the Jupyter Notebook
  * Experimental support for exporting styled DataFrames
    (``DataFrame.style``) to Excel
  * Window binary corr/cov operations now return a MultiIndexed
    ``DataFrame`` rather than a ``Panel``, as ``Panel`` is now
    deprecated
  * Support for S3 handling now uses ``s3fs``
  * Google BigQuery support now uses the ``pandas-gbq`` library

* Tue Apr 25 2017 toddrme2178@gmail.com

- Implement single-spec version.

* Thu Mar 30 2017 toddrme2178@gmail.com

- update to version 0.19.2:
  * Enhancements
    The pd.merge_asof(), added in 0.19.0, gained some improvements:
    + pd.merge_asof() gained left_index/right_index and
    left_by/right_by arguments (GH14253)
    + pd.merge_asof() can take multiple columns in by parameter and
    has specialized dtypes for better performace (GH13936)
  * Performance Improvements
    + Performance regression with PeriodIndex (GH14822)
    + Performance regression in indexing with getitem (GH14930)
    + Improved performance of .replace() (GH12745)
    + Improved performance Series creation with a datetime index and
    dictionary data (GH14894)
  * Bug Fixes
    + Compat with python 3.6 for pickling of some offsets (GH14685)
    + Compat with python 3.6 for some indexing exception types
    (GH14684, GH14689)
    + Compat with python 3.6 for deprecation warnings in the test
    suite (GH14681)
    + Compat with python 3.6 for Timestamp pickles (GH14689)
    + Compat with dateutil==2.6.0; segfault reported in the testing
    suite (GH14621)
    + Allow nanoseconds in Timestamp.replace as a kwarg (GH14621)
    + Bug in pd.read_csv in which aliasing was being done for
    na_values when passed in as a dictionary (GH14203)
    + Bug in pd.read_csv in which column indices for a dict-like
    na_values were not being respected (GH14203)
    + Bug in pd.read_csv where reading files fails, if the number of
    headers is equal to the number of lines in the file (GH14515)
    + Bug in pd.read_csv for the Python engine in which an unhelpful
    error message was being raised when multi-char delimiters were
    not being respected with quotes (GH14582)
    + Fix bugs (GH14734, GH13654) in pd.read_sas and
    pandas.io.sas.sas7bdat.SAS7BDATReader that caused problems when
    reading a SAS file incrementally.
    + Bug in pd.read_csv for the Python engine in which an unhelpful
    error message was being raised when skipfooter was not being
    respected by Python?s CSV library (GH13879)
    + Bug in .fillna() in which timezone aware datetime64 values were
    incorrectly rounded (GH14872)
    + Bug in .groupby(..., sort=True) of a non-lexsorted MultiIndex
    when grouping with multiple levels (GH14776)
    + Bug in pd.cut with negative values and a single bin (GH14652)
    + Bug in pd.to_numeric where a 0 was not unsigned on a
    downcast='unsigned' argument (GH14401)
    + Bug in plotting regular and irregular timeseries using shared
    axes (sharex=True or ax.twinx()) (GH13341, GH14322).
    + Bug in not propogating exceptions in parsing invalid datetimes,
    noted in python 3.6 (GH14561)
    + Bug in resampling a DatetimeIndex in local TZ, covering a DST
    change, which would raise AmbiguousTimeError (GH14682)
    + Bug in indexing that transformed RecursionError into KeyError or
    IndexingError (GH14554)
    + Bug in HDFStore when writing a MultiIndex when using
    data_columns=True (GH14435)
    + Bug in HDFStore.append() when writing a Series and passing a
    min_itemsize argument containing a value for the index (GH11412)
    + Bug when writing to a HDFStore in table format with a
    min_itemsize value for the index and without asking to append
    (GH10381)
    + Bug in Series.groupby.nunique() raising an IndexError for an
    empty Series (GH12553)
    + Bug in DataFrame.nlargest and DataFrame.nsmallest when the index
    had duplicate values (GH13412)
    + Bug in clipboard functions on linux with python2 with unicode
    and separators (GH13747)
    + Bug in clipboard functions on Windows 10 and python 3 (GH14362,
    GH12807)
    + Bug in .to_clipboard() and Excel compat (GH12529)
    + Bug in DataFrame.combine_first() for integer columns (GH14687).
    + Bug in pd.read_csv() in which the dtype parameter was not being
    respected for empty data (GH14712)
    + Bug in pd.read_csv() in which the nrows parameter was not being
    respected for large input when using the C engine for parsing
    (GH7626)
    + Bug in pd.merge_asof() could not handle timezone-aware
    DatetimeIndex when a tolerance was specified (GH14844)
    + Explicit check in to_stata and StataWriter for out-of-range
    values when writing doubles (GH14618)
    + Bug in .plot(kind='kde') which did not drop missing values to
    generate the KDE Plot, instead generating an empty
    plot. (GH14821)
    + Bug in unstack() if called with a list of column(s) as an
    argument, regardless of the dtypes of all columns, they get
    coerced to object (GH11847)
- update to version 0.19.1:
  * Performance Improvements
    + Fixed performance regression in factorization of Period data
    (GH14338)
    + Fixed performance regression in Series.asof(where) when where is
    a scalar (GH14461)
    + Improved performance in DataFrame.asof(where) when where is a
    scalar (GH14461)
    + Improved performance in .to_json() when lines=True (GH14408)
    + Improved performance in certain types of loc indexing with a
    MultiIndex (GH14551).
  * Bug Fixes
    + Source installs from PyPI will now again work without cython
    installed, as in previous versions (GH14204)
    + Compat with Cython 0.25 for building (GH14496)
    + Fixed regression where user-provided file handles were closed in
    read_csv (c engine) (GH14418).
    + Fixed regression in DataFrame.quantile when missing values where
    present in some columns (GH14357).
    + Fixed regression in Index.difference where the freq of a
    DatetimeIndex was incorrectly set (GH14323)
    + Added back pandas.core.common.array_equivalent with a
    deprecation warning (GH14555).
    + Bug in pd.read_csv for the C engine in which quotation marks
    were improperly parsed in skipped rows (GH14459)
    + Bug in pd.read_csv for Python 2.x in which Unicode quote
    characters were no longer being respected (GH14477)
    + Fixed regression in Index.append when categorical indices were
    appended (GH14545).
    + Fixed regression in pd.DataFrame where constructor fails when
    given dict with None value (GH14381)
    + Fixed regression in DatetimeIndex._maybe_cast_slice_bound when
    index is empty (GH14354).
    + Bug in localizing an ambiguous timezone when a boolean is passed
    (GH14402)
    + Bug in TimedeltaIndex addition with a Datetime-like object where
    addition overflow in the negative direction was not being caught
    (GH14068, GH14453)
    + Bug in string indexing against data with object Index may raise
    AttributeError (GH14424)
    + Corrrecly raise ValueError on empty input to pd.eval() and
    df.query() (GH13139)
    + Bug in RangeIndex.intersection when result is a empty set
    (GH14364).
    + Bug in groupby-transform broadcasting that could cause incorrect
    dtype coercion (GH14457)
    + Bug in Series.__setitem__ which allowed mutating read-only
    arrays (GH14359).
    + Bug in DataFrame.insert where multiple calls with duplicate
    columns can fail (GH14291)
    + pd.merge() will raise ValueError with non-boolean parameters in
    passed boolean type arguments (GH14434)
    + Bug in Timestamp where dates very near the minimum (1677-09)
    could underflow on creation (GH14415)
    + Bug in pd.concat where names of the keys were not propagated to
    the resulting MultiIndex (GH14252)
    + Bug in pd.concat where axis cannot take string parameters 'rows'
    or 'columns' (GH14369)
    + Bug in pd.concat with dataframes heterogeneous in length and
    tuple keys (GH14438)
    + Bug in MultiIndex.set_levels where illegal level values were
    still set after raising an error (GH13754)
    + Bug in DataFrame.to_json where lines=True and a value contained
    a } character (GH14391)
    + Bug in df.groupby causing an AttributeError when grouping a
    single index frame by a column and the index level
    (:issue`14327`)
    + Bug in df.groupby where TypeError raised when
    pd.Grouper(key=...) is passed in a list (GH14334)
    + Bug in pd.pivot_table may raise TypeError or ValueError when
    index or columns is not scalar and values is not specified
    (GH14380)

* Mon Mar 27 2017 toddrme2178@gmail.com

- Fix documentation building

* Sun Oct 23 2016 toddrme2178@gmail.com

- update to version 0.19.0:
  (long changelog, see http://pandas.pydata.org/pandas-docs/stable/whatsnew.html#v0-19-0-october-2-2016)
  * Highlights include:
    + merge_asof() for asof-style time-series joining
    + .rolling() is now time-series aware
    + read_csv() now supports parsing Categorical data
    + A function union_categorical() has been added for combining
    categoricals
    + PeriodIndex now has its own period dtype, and changed to be more
    consistent with other Index classes
    + Sparse data structures gained enhanced support of int and bool
    dtypes
    + Comparison operations with Series no longer ignores the index,
    see here for an overview of the API changes.
    + Introduction of a pandas development API for utility functions
    + Deprecation of Panel4D and PanelND. We recommend to represent
    these types of n-dimensional data with the xarray package.
    + Removal of the previously deprecated modules pandas.io.data,
    pandas.io.wb, pandas.tools.rplot.
- specfile:
  * require python3-Cython
  * Split documentation into own subpackage to speed up build.
  * Remove buildrequires for optional dependencies to speed up build.
- Remove unneeded patches:
  * 0001_disable_experimental_msgpack_big_endian.patch ^
  * 0001_respect_byteorder_in_statareader.patch