diff options
author | Vincent Ambo <mail@tazj.in> | 2023-01-18T15·35+0300 |
---|---|---|
committer | tazjin <tazjin@tvl.su> | 2023-01-18T15·44+0000 |
commit | c8918334142be3cf79e3555467a00545ea5fea07 (patch) | |
tree | 682b7465c8482076e5d3fd0d999283780b5b604a /tvix/eval/README.md | |
parent | 0ed6583edcdaac73f4ed7fb30b09d8fb95c25f9d (diff) |
feat(corp/data-import): build morphology database in derivation r/5693
This makes the actual imported database of the ~whole Russian language (all lemmas, grammemes, forms etc.) a Nix build target which is built in CI. This still needs schema normalisation (it's fairly directly mapped to the raw data), but it's already starting to be a useful data set. This also happens to be a pretty cool demonstration of the power of Nix. You can do `nix-build -A corp.russian.data-import.database` and out comes a perfectly valid SQLite database with a valid external data import! Change-Id: I5d6d15e67d0e4a7ff590fad06252be34f5d561fd Reviewed-on: https://cl.tvl.fyi/c/depot/+/7866 Reviewed-by: tazjin <tazjin@tvl.su> Tested-by: BuildkiteCI
Diffstat (limited to 'tvix/eval/README.md')
0 files changed, 0 insertions, 0 deletions