aleph v3.14.1 releases: find the people and companies you look for
Aleph is a tool for indexing large amounts of both documents (PDF, Word, HTML) and structured (CSV, XLS, SQL) data for easy browsing and search. It is built with investigative reporting as a primary use case. Aleph allows cross-referencing mentions of well-known entities (such as people and companies) against watchlists, e.g. from prior research or public datasets.
Here are some key features:
- Web-based search across large document and data sets.
- Imports many file formats, including popular office formats, spreadsheets, email and zipped archives. Processing includes optical character recognition, language and encoding detection and named entity extraction.
- Load structured entity graph data from databases and CSV files. This allows navigation of complex datasets like companies registries, sanctions lists or procurement data. Import tools for OpenSanctions. are included.
- Receive notifications for new search matches with a personal watchlist.
- OAuth authorization and access control on a per-source and per-watchlist basis.
Changelog v3.14.1
What’s Changed
- Sentry support
This release adds support for sending error tracebacks to sentry.io (or a self-hosted instance). This is controlled by two environment variables:
SENTRY_DSN
andSENTRY_ENVIRONMENT
. - Fixed a flaky UI test (#3011)
ingest-file
version bumped to 3.18.4- Use
bump2version
for the docker-compose files incontrib/
to automatically keep them up to date.
Dependency upgrades
- Bump loader-utils from 2.0.2 to 2.0.4 in /ui by @dependabot in #2699
- Bump decode-uri-component from 0.2.0 to 0.2.2 in /ui by @dependabot in #2762
- Bump json5 from 1.0.1 to 1.0.2 in /ui by @dependabot in #2803
- Bump webpack from 5.74.0 to 5.76.1 in /ui by @dependabot in #2944
Download && Tutorial
Copyright (c) 2014-2015 Friedrich Lindenberg
Copyright (c) 2016-2017 Journalism Development Network, Inc.
Source: https://github.com/alephdata/