* Wed Mar 24 2021 Luigi Baldoni <aloisio@gmx.com>
- Update to version 5.10.2
Fix:
* Restore mlr manpage to distro file
* Mon Mar 22 2021 Luigi Baldoni <aloisio@gmx.com>
- Update to version 5.10.1
Fix:
* NR is stale after apply remove-empty-columns
* Segfault with --tsv --implicit-csv-header
* Default value of uninitialized variables for multiplication:
docs vs awk
* Bug with uniq + sort
* Mon Nov 30 2020 aloisio@gmx.com
- Update to version 5.10.0
Features:
* The unsparsify -f feature fulfills #387 from @sjackman .
* The new sort-within-records verb is an old ask, underway
from the Go port, backported to C.
* Likewise the truncate DSL function.
Bugfixes:
* The count -n feature was not implemented as intended.
* Pretty-print format now works correctly with
- -headerless-csv-output.
* The seqgen verb now correctly tracks NR and FNR in the
records it emits.
* An intermittent JSON-parsing bug has been fixed.
* Thu Sep 03 2020 Luigi Baldoni <aloisio@gmx.com>
- Update to version 5.9.1
* Disallow prepipe within .mlrrc (fixes CVE-2020-15167)
* Wed Aug 19 2020 aloisio@gmx.com
- Update to version 5.9.0
* You can now save common defaults in a ~/.mlrrc. For example,
if you normally process CSV files, you can say that in your
~/.mlrrc and you can leave off the --csv flag from your mlr
commands.
* Tue Aug 04 2020 aloisio@gmx.com
- Update to version 5.8.0
Features:
* The new count verb is a keystroke-saver for stats -a count
- f {some field name}`.
* --jsonx and --ojsonx are keystroke-savers for --json
- -jvstack and --ojson --jvstack, which is to say, multi-line
pretty-printed JSON format.
* The new -s name=value feature for mlr put and mlr filter
gives you simpler access to environment variables in your
Miller script, as requested in #315.
Bugfixes:
* mlr format-values is no longer SEGVing on CSV/TSV input.
This was reported on #330.
* #313 fixes a corner case when field names within
command-line arguments have embedded newlines.
* Line/column indicators for JSON-formatting error messages
are now correct (previously they were showing up as 0).
* end {print NF} no longer SEGVs. This was reported in #330.
* Several broken doc links were fixed up as reported on #329.
- Drop miller-5.3.0-gcc43.patch (no longer necessary)
- Spec cleanup
* Tue Mar 17 2020 aloisio@gmx.com
- Update to version 5.7.0
Features:
* The new remove-empty-columns and skip-trivial-records are
keystroke-savers for things which would other require DSL
syntax, as tracked in #274.
Bugfixes:
* A bug regarding optional regex-pattern groups was fixed in
[#277].
* As of #294 you can now specify --implicit-csv-header for the
join-file in mlr join.
* A bug with spaces in XTAB-file values was fixed on #296.
* A bug with missing final newline for XTAB-formatted files
using MMAP files was fixed on #301.
- Drop group tag
* Sun Sep 22 2019 Luigi Baldoni <aloisio@gmx.com>
- Update to version 5.6.2
* #271 fixes a corner-case bug with more than 100 CSV/TSV files
with headers of varying lengths.
* Fri Sep 13 2019 Luigi Baldoni <aloisio@gmx.com>
- Update to version 5.6.0
Features:
* The new system DSL function allows you to run arbitrary
shell commands and store them in field values. Some example
usages are documented here. This is in response to issues #246
and #209.
* There is now support for ASV and USV file formats. This is
in response to issue #245.
* The new format-values verb allows you to apply numerical
formatting across all record values. This is in response to
issue #252.
Documentation:
* The new DKVP I/O in Python sample code now works for Python
2 as well as Python 3.
* There is a new cookbook entry on doing multiple joins. This
is in response to issue #235.
Bugfixes:
* The toupper, tolower, and capitalize DSL functions are now
UTF-8 aware, thanks to @sheredom's marvelous
https://github.com/sheredom/utf8.h. The internationalization
page has also been expanded. This is in response to issue #254.
* #250 fixes a bug using in-place mode in conjunction with
verbs (such as rename or sort) which take field-name lists as
arguments.
* #253 fixes a bug in the label when one or more names are
common between old and new.
* #251 fixes a corner-case bug when (a) input is CSV; (b) the
last field ends with a comma and no newline; (c) input is from
standard input and/or --no-mmap is supplied.
* Sun Sep 01 2019 Luigi Baldoni <aloisio@gmx.com>
- Update to version 5.5.0
* Positional indexing and other data-cleaning features
Features:
* The new positional-indexing feature resolves #236 from
@aborruso. You can now get the name of the 3rd field of each
record via $[[3]], and its value by $[[[3]]]. These are both
usable on either the left-hand or right-hand side of assignment
statements, so you can more easily do things like renaming
fields progrmatically within the DSL.
* There is a new capitalize DSL function, complementing the
already-existing toupper. This stems from #236.
* There is a new skip-trivial-records verb, resolving #197.
Similarly, there is a new remove-empty-columns verb, resolving
[#206]. Both are useful for data-cleaning use-cases.
* Another pair is #181 and #256. While Miller uses mmap
internally (and invisibily) to get approximately a 20%
performance boost over not using it, this can cause
out-of-memory issues with reading either large files, or too
many small ones. Now, Miller automatically avoids mmap in these
cases. You can still use --mmap or --no-mmap if you want manual
control of this.
* There is a new --ivar option for the nest verb which
complements the already-existing --evar. This is from #260
thanks to @jgreely.
* There is a new keystroke-saving urandrange DSL function:
urandrange(low, high) is the same as low + (high - low) *
urand().
* There is a new -v option for the cat verb which writes a
low-level record-structure dump to standard error.
* There is a new -N option for mlr which is a keystroke-saver
for --implicit-csv-header --headerless-csv-output.
Documentation:
* The new FAQ entry
http://johnkerl.org/miller/doc/faq.html#How_to_escape_'%3F'_in_re
gexes%3F resolves #203.
* The new FAQ entry
http://johnkerl.org/miller/doc/faq.html#How_can_I_filter_by_date%
3F resolves #208.
* #244 fixes a documentation issue while highlighting the need
for #241.
Bugfixes:
* There was a SEGV using nest within then-chains, fixed in
response to #220.
* Quotes and backslashes weren't being escaped in JSON output
with --jvquoteall; reported on #222.
* Mon Oct 15 2018 Luigi Baldoni <aloisio@gmx.com>
- Update to version 5.4.0
Features:
* The new clean-whitespace verb resolves #190 from @aborruso.
Along with the new functions strip, lstrip, rstrip,
collapse_whitespace, and clean_whitespace, there is now both
coarse-grained and fine-grained control over whitespace
within field names and/or values. See the linked-to
documentation for examples.
* The new altkv verb resolves #184 which was originally opened
via an email request. This supports mapping value-lists such
as a,b,c,d to alternating key-value pairs such as a=b,c=d.
* The new fill-down verb resolves #189 by @aborruso. See the
linked-to documentation for examples.
* The uniq verb now has a uniq -a which resolves #168 from
@sjackman.
* The new regextract and regextract_or_else functions resolve
[#183] by @aborruso.
* The new ssub function arises from #171 by @dohse, as a
simplified way to avoid escaping characters which are special
to regular-expression parsers.
* There are new localtime functions in response to #170 by
@sitaramc. However note that as discussed on #170 these do
not undo one another in all circumstances. This is a
non-issue for timezones which do not do DST. Otherwise, please
use with disclaimers: localdate, localtime2sec, sec2localdate,
sec2localtime, strftime_local, and strptime_local.
* Travis builds at
https://travis-ci.org/johnkerl/miller/builds now run on OSX as
well as Linux.
* An Ubuntu 17 build issue was fixed by @singalen on #164.
Documentation:
* put/filter documentation was confusing as reported by
@NikosAlexandris on #169.
* The new FAQ entry
http://johnkerl.org/miller-releases/miller-head/doc/faq.html#How_
to_rectangularize_after_joins_with_unpaired? resolves #193 by
@aborruso.
* The new cookbook entry
http://johnkerl.org/miller/doc/cookbook.html#Options_for_dealing_
with_duplicate_rows arises from #168 from @sjackman.
* The unsparsify documentation had some words missing as
reported by @tst2005 on #194.
* There was a typo in the cookpage page
http://johnkerl.org/miller/doc/cookbook.html#Full_field_renames_a
nd_reassigns as fixed by @tst2005 in #192.
Bugfixes:
* There was a memory leak for TSV-format files only as
reported by @treynr on #181.
* Dollar sign in regular expressions were not being escaped
properly as reported by @dohse on #171.