FSCrawler 2.6
release-drafter
released this
09 Jan 18:34
·
2181 commits
to master
since this release
What's Changed
- Update Jackson to 2.9.8 (#657) @dadoonet
- Update to Tika 1.20 (#655) @dadoonet
- Update to Elasticsearch 6.5.3 (#649) @dadoonet
- Add a warning when using both silent and debug/trace (#647) @dadoonet
- Add documentation on how to run as a Windows service (#648) @dadoonet
- Check Elasticsearch 6 minor version (#642) @dadoonet
- Force the default number of shards to be 1 (#644) @dadoonet
- Update Guava transitive dependency to 27.0.1-jre (#645) @dadoonet
- Revisit Elasticsearch.Node and Rest settings (#638) @dadoonet
- Update to elasticsearch 6.5.1 (#637) @dadoonet
- Ignore dirs when
.fscrawlerignore
file is detected (#633) @dadoonet - Update issue templates (#632) @dadoonet
- Support multiple OCR languages (#631) @dadoonet
- Update Tika to 1.19.1 (#624) @dadoonet
- Create specific elasticsearch clients (#616) @dadoonet
- Add Release Drafter to automatically generate the release notes (#611) @dadoonet
- Add a Noop Parser (#610) @dadoonet
- Dump stack when not able to close FSCrawler (#609) @dadoonet
- Make default root dir Windows compatible (#595) @dadoonet
- Update to Tika 1.19 (#603) @dadoonet
- Update ossindex-maven-plugin to 3.0.1 (#604) @dadoonet
- Update to Jackson 2.9.7 (#602) @dadoonet
- Update to Elasticsearch 6.4.1 (#594) @dadoonet
- Add LGTM code quality badges (#597) @xcorail
- Support XML reoccurring structures (#593) @dadoonet
- Add a filter by content option (#585) @dadoonet
- Exclude dirs depending on dir full name (relative to root) (#561) @dadoonet
- Ignore files bigger than X (#584) @dadoonet
- Add
hocr
option for Tesseract-based OCR (#583) @dadoonet - Allow path partial matching (#582) @dadoonet
- Add support for Last Accessed date and Created date (#580) @dadoonet
- Use _doc doc type instead of doc (#581) @dadoonet
- Fix wrong detection of removed settings (#579) @dadoonet
- Add support for cloud id (#577) @dadoonet
- Update maven-compiler-plugin to 3.8.0 (#576) @dadoonet
- Add ossindex Maven plugin (#572) @dadoonet
- Close bulk processors with awaitClose instead of close (#570) @dadoonet
- Update to elasticsearch 6.3.2 (#569) @dadoonet
- Add File Permissions to generated documents (#567) @dadoonet
- Skip sonar build for external PRs (#568) @dadoonet
- Add a developer guide (#565) @dadoonet
- Add support for bulk size in bytes with unit (#563) @dadoonet
- Update to Elasticsearch 6.3.1 (#557) @dadoonet
- Revert "Use _doc doc type instead of doc" (#558) @dadoonet
- Use _doc doc type instead of doc (#554) @dadoonet
- Fix Sonar Critical issues (#551) @dadoonet
- Fix SonarQube hook (#550) @dadoonet
- Move documentation to https://readthedocs.org (#543) @dadoonet
- Allow using
store_source
without indexing content (#544) @dadoonet - Update to Tika 1.18 (#542) @dadoonet
- Update to Elasticsearch 6.3.0 (#541) @dadoonet
- Add a version check in tests (#527) @dadoonet
- Raw fields should be considered as text/keyword (#526) @dadoonet
- Add tests on OSS image as well (#525) @dadoonet
- Update elasticsearch to 6.2.2 (#524) @dadoonet
- Check that pipeline actually exists when starting (#522) @dadoonet
- Allow setting Tesseract path to executable and data (#520) @dadoonet
- Reduce Time to run tests from the IDE (#518) @dadoonet
- Update to elasticsearch 6.2.1 (#517) @dadoonet
- Split IT into different classes (#514) @dadoonet
- Start elasticsearch with docker-maven-plugin when running from the CLI (#513) @dadoonet
- Autodetect if a local node is running before starting docker (#512) @dadoonet
- Start removal of
core
module (#508) @dadoonet - Create fscrawler-rest module (#506) @dadoonet
- Create fscrawler-crawler-fs and fscrawler-crawler-ssh modules (#505) @dadoonet
- Clean package names (#504) @dadoonet
- Create fscrawler-tika and fscrawler-beans modules (#503) @dadoonet
- Create the fscrawler-cli module (#502) @dadoonet
- Move to Docker based integration tests (#500) @dadoonet
- Modify announcement email (#501) @dadoonet
- readme: add note that fs settings also affect rest (#492) @shadiakiki1986
- Fix ignore folders documentation (#488) @dadoonet
- Add more tests about moving files (#487) @dadoonet
- Includes and Excludes should not be case sensitive (#486) @dadoonet
- Split project into modules (#435) @dadoonet
- add setPipeline call when using REST (#475) @shadiakiki1986
- Add more info in case of bulk failures (#457) @dadoonet
- Don't rely on disk space for tests (#456) @dadoonet
- Update to Lucene 7.0.1 (#452) @dadoonet
- Update to maven-versions-plugin 2.5 (#453) @dadoonet
- Update to Log4J 2.9.1 (#451) @dadoonet
- Update to SQLite 3.20.1 (#450) @dadoonet
- Update to Jackson 2.9.2 (#449) @dadoonet
- Update to elasticsearch 6.0.0-beta2 (#434) @dadoonet
- Update dependencies (Jackson, Log4J, Jansi, SQLite, JSch, JCommander, Randomized Testing) (#430) @dadoonet
- use StringBuilder in a loop (#361) @ctamisier
- Add continue_on_error option to continue on error while crawling (#330) @kneubi
- Fix links typo (#326) @soruly
- Patch Log4J 2.8 to display messages on Windows (#323) @dadoonet
- Missing documentation for some local FS settings (#287) @shadiakiki1986
- add link to repo with dockerfile usage of fscrawler (#278) @shadiakiki1986
- documentation for loop moved to under --loop instead of under --rest (#277) @shadiakiki1986
- Use path analyzer for directory fields (#272) @dadoonet
- Prevent customised mappings from being overwritten (#231) @edjeavons
- Elasticsearch Client must use search size if set (#240) @babadofar
- Add OCR integration documentation (#224) @Jdecaudin
- Default REST elasticsearch port should be 9200 and not 9300 (#142) @FredDut
Thanks to
@FredDut, @Jdecaudin, @Quix0r, @babadofar, @barts2108, @coder-sa, @ctamisier, @dadoonet, @edjeavons, @fgaujous, @gpcmol, @it20one, @kneubi, @shadiakiki1986, @soruly, @vakopian, @xcorail, Ajitpal Singh and Julien Decaudin