openSUSE Build Service

buildservice-autocommit accepted request 1154995 from

Dirk Mueller (dirkmueller) about 2 months ago (revision 60)

baserev update by copy to link target

Dirk Mueller (dirkmueller) accepted request 1154922 from

Benjamin Greiner (bnavigator) about 2 months ago (revision 59)

- Update to 2024.2.0
  * allow loading categoricals even if not so in the pandas
    metadata, when a column is dict-encoded and we only have one
    row-group (#863)
  * apply dtype to the columns names series, even when selecting no
    columns (#861, 859)
  * don’t make strings while estimating bye column size (#858)
  * handle upstream depr (#857, 856)

buildservice-autocommit accepted request 1144842 from

Daniel Garcia (dgarcia) 3 months ago (revision 58)

baserev update by copy to link target

Daniel Garcia (dgarcia) committed 3 months ago (revision 57)

- Do not run tests in s390x, bsc#1218603

buildservice-autocommit accepted request 1130976 from

Dirk Mueller (dirkmueller) 5 months ago (revision 56)

baserev update by copy to link target

Dirk Mueller (dirkmueller) committed 5 months ago (revision 55)

Dirk Mueller (dirkmueller) committed 5 months ago (revision 54)

- update to 2023.10.0:
  * Datetime units in empty() with tz (#893)
  * Fewer inplace decompressions for V2 pages (#890
  * Allow writing categorical column with no categories (#888)
  * Fixes for new numpy (#886)
  * RLE bools and DELTA for v1 pages (#885, 883)

buildservice-autocommit accepted request 1130498 from

Dirk Mueller (dirkmueller) 5 months ago (revision 53)

baserev update by copy to link target

Dirk Mueller (dirkmueller) committed 5 months ago (revision 52)

- update to 2023.8.0:
  * More general timestamp units (#874)
  * ReadTheDocs V2 (#871)
  * Better roundtrip dtypes (#861, 859)
  * No convert when computing bytes-per-item for str (#858)
- Add patch to fox the test test_delta_from_def_2 on
  * row-level filtering of the data. Whereas previously, only full
    row-groups could be excluded on the basis of their parquet
    metadata statistics (if present), filtering can now be done
    within row-groups too. The syntax is the same as before,
    allowing for multiple column expressions to be combined with
    AND|OR, depending on the list structure. This mechanism
    requires two passes: one to load the columns needed to create
    the boolean mask, and another to load the columns actually
    needed in the output. This will not be faster, and may be
    slower, but in some cases can save significant memory
    footprint, if a small fraction of rows are considered good and
    the columns for the filter expression are not in the output.
  * DELTA integer encoding (read-only): experimentally working,
    but we only have one test file to verify against, since it is
    not trivial to persuade Spark to produce files encoded this
    way. DELTA can be extremely compact a representation for
  * nanosecond resolution times: the new extended "logical" types
    system supports nanoseconds alongside the previous millis and
    micros. We now emit these for the default pandas time type,
    and produce full parquet schema including both "converted" and
    "logical" type information. Note that all output has
    isAdjustedToUTC=True, i.e., these are timestamps rather than
    local time. The time-zone is stored in the metadata, as
    before, and will be successfully recreated only in fastparquet

buildservice-autocommit accepted request 1096412 from

Markéta Machová (mcalabkova) 10 months ago (revision 51)

baserev update by copy to link target

Markéta Machová (mcalabkova) accepted request 1096315 from

Arun Persaud (apersaud) 10 months ago (revision 50)

update to latest version

buildservice-autocommit accepted request 1083474 from

Dirk Mueller (dirkmueller) about 1 year ago (revision 49)

baserev update by copy to link target

Dirk Mueller (dirkmueller) committed about 1 year ago (revision 48)

- update to 2023.4.0:
  * allow loading categoricals even if not so in the pandas metadata,
    when a column is dict-encodedand we only have one row-group (#863)
 * apply dtype to the columns names series, even when selecting no
    columns (#861, 859)
 * don't make strings while estimating bye column size (#858)
 * handle upstream depr (#857, 856)

buildservice-autocommit accepted request 1064743 from

Matej Cepl (mcepl) about 1 year ago (revision 47)

baserev update by copy to link target

Matej Cepl (mcepl) accepted request 1064736 from

Arun Persaud (apersaud) about 1 year ago (revision 46)

update to latest version

buildservice-autocommit accepted request 1046424 from

Dirk Mueller (dirkmueller) over 1 year ago (revision 45)

baserev update by copy to link target

Dirk Mueller (dirkmueller) accepted request 1046313 from

Benjamin Greiner (bnavigator) over 1 year ago (revision 44)

- Update to 2022.12.0
  * check all int32 values before passing to thrift writer
  * fix type of num_rows to i64 for big single file
- Release 2022.11.0
  * Switch to calver
  * Speed up loading of nullable types
  * Allow schema evolution by addition of columns
  * Allow specifying dtypes of output
  * update to scm versioning
  * fixes to row filter, statistics and tests
  * support pathlib.Paths
  * JSON encoder options
- Drop fastparquet-pr813-updatefixes.patch

buildservice-autocommit accepted request 1045047 from

Dirk Mueller (dirkmueller) over 1 year ago (revision 43)

baserev update by copy to link target

Dirk Mueller (dirkmueller) accepted request 1044387 from

Guillaume GARDET (Guillaume_G) over 1 year ago (revision 42)

- Add patch to fox the test test_delta_from_def_2 on 
  aarch64, armv7 and ppc64le:
  * fastparquet-pr835.patch

buildservice-autocommit accepted request 1032429 from

Markéta Machová (mcalabkova) over 1 year ago (revision 41)

baserev update by copy to link target

Places

Revisions of python-fastparquet

Places