Commit Graph

312 Commits

Author SHA1 Message Date
guillaume-be
3af309cbac
Merge pull request #29 from proycon/tokclasspipeline
More generic token classification pipeline supporting multiple models
2020-05-07 14:30:22 +00:00
Maarten van Gompel
b840cbf57a Removed word_index and continuation for now (and the decoding part responsible for it), will be reintroduced once the tokenizers provide the offset information (guillaume-be/rust-tokenizers#14), issue #29 2020-05-07 14:21:22 +02:00
Maarten van Gompel
e044a9cbd9 Implemented a leaner NERModel that simply delegates to the new token_classification pipeline, but retains all backward-compatibility #29 (+ fixed tests and added docs) 2020-05-07 11:44:55 +02:00
Maarten van Gompel
bb23326ad0 Removed character offset computation to align with the original text post-prediction (also removes the 'space' attribute for now), should be solved at tokenisation time instead #29 2020-05-07 11:44:32 +02:00
Maarten van Gompel
e023420014 Removed unneeded LabelMapping abstraction and adapted DistilBert's maps to use i64 instead of i32 #29 2020-05-07 11:44:32 +02:00
Maarten van Gompel
ff0e8e2504 Implementing a more generic token classification pipeline with support for multiple model-types and less NER-centric naming #29 2020-05-07 11:44:32 +02:00
guillaume-be
18b43c1093
Merge pull request #28 from guillaume-be/updated_gpt2_decoding
Updated dependencies to use latest tokenization crate version
2020-05-03 19:25:09 +00:00
Guillaume B
1ddd1d9f51 Updated dependencies to use latest tokenization crate version 2020-05-03 20:52:24 +02:00
guillaume-be
d3a6a204dc
Merge pull request #26 from guillaume-be/electra_implementation
Electra implementation
2020-05-03 13:42:17 +00:00
Guillaume B
83e43ffcd5 Updated resource list 2020-05-03 14:59:13 +02:00
Guillaume B
139eecace7 updated documentation 2020-05-03 13:44:49 +02:00
Guillaume B
c6f5cdd859 Updated documentation 2020-05-03 13:37:18 +02:00
Guillaume B
9f608f4374 Addition of tests for Electra 2020-05-03 11:47:16 +02:00
Guillaume B
029d4bd47c Addition of Electra resources 2020-05-03 10:05:32 +02:00
Guillaume B
5a1c1ae7a0 Addition of ElectraDiscriminator 2020-05-03 09:46:59 +02:00
Guillaume B
1c1f91bcdf Merge remote-tracking branch 'remotes/origin/master' into electra_implementation 2020-05-02 13:58:27 +02:00
guillaume-be
9f4afc62ac
Merge pull request #25 from guillaume-be/additional_model_downloads
Additional model downloads
2020-05-02 11:52:35 +00:00
Guillaume B
cc8e02f03d Updated README 2020-05-02 11:25:19 +02:00
Guillaume B
b00bd8f97f Added additional GPT2 resources and license information 2020-05-02 10:43:53 +02:00
Guillaume B
75ed1f864b Updated resources to use cloudfront endpoints 2020-05-02 10:04:02 +02:00
Guillaume B
1cfef470d7 Updated version number 2020-05-02 09:57:37 +02:00
Guillaume B
e30f8a6b11 Updated cache directory setting 2020-05-02 09:12:26 +02:00
Guillaume B
259d30f58d Added possibility to define cache directory 2020-05-02 09:07:43 +02:00
Guillaume B
2c25dc1650 Addition of ElectraForTokenClassification 2020-05-01 10:00:47 +02:00
Guillaume B
4334fa1758 Addition of ElectraForMaskedLM 2020-05-01 08:47:52 +02:00
Guillaume B
5bec2548c1 Addition of Electra generator and discriminator heads 2020-04-29 18:59:37 +02:00
Guillaume B
45eeb7ae5b ElectraModel implementation, weights loaded, forward pass 2020-04-29 18:30:36 +02:00
Guillaume B
b2308d741d Addition of Electra Config and Embeddings 2020-04-28 18:51:04 +02:00
guillaume-be
33b586e450
Merge pull request #21 from julien-c/patch-1
tiny typo
2020-04-26 12:44:49 +02:00
guillaume-be
cdf0f860bc
Merge pull request #22 from guillaume-be/native_model_weights
Native model weights
2020-04-26 12:29:43 +02:00
Guillaume B
5f2765bc1e Updated doctests 2020-04-26 12:16:07 +02:00
Guillaume B
194a447b06 Updated documentation 2020-04-26 11:49:40 +02:00
Guillaume B
da3da86281 Removed cache from CI 2020-04-26 10:55:19 +02:00
Guillaume B
5d10fd4157 Updated CI script 2020-04-26 10:54:56 +02:00
Guillaume B
cb254995fd Addition of dependencies download example 2020-04-26 10:51:29 +02:00
Guillaume B
0a85d5ba21 Updated NER Pipeline 2020-04-26 09:13:13 +02:00
Guillaume B
2aeace4a6e Updated Question Answering Pipeline 2020-04-26 09:08:04 +02:00
Guillaume B
69b819ce5e Updated Summarization and Sentiment pipelines 2020-04-26 08:57:20 +02:00
Guillaume B
43091c8e24 Updated BART Base model var store path 2020-04-25 20:35:37 +02:00
Guillaume B
f7aa61f131 Updated Roberta models with new configs and dependency loaders 2020-04-25 17:53:04 +02:00
Guillaume B
ce4041e1d2 Updated GPT models with new configs and dependency loaders 2020-04-25 17:43:44 +02:00
Guillaume B
b5b772c488 Updated GPT2 models with new configs and dependency loaders 2020-04-25 17:34:59 +02:00
Guillaume B
9168e898a9 Updated BART models with new configs and dependency loaders 2020-04-25 17:27:33 +02:00
Guillaume B
5f4a8f8bdf Updated BERT models with new configs and dependency loaders 2020-04-25 17:01:46 +02:00
Guillaume B
3d5a3669ed Updated DistilBert models with new configs and dependency loaders 2020-04-25 16:42:56 +02:00
Guillaume B
cdfe14a3a4 Update of dependencies 2020-04-25 15:41:53 +02:00
Guillaume B
eaccc23be4 Download utilities 2020-04-25 11:31:02 +02:00
Julien Chaumond
1422affca9
tiny typo 2020-04-24 19:37:51 -04:00
Guillaume B
c65943ab2f Updated model weight download 2020-04-24 17:04:28 +02:00
guillaume-be
1451a64b89
Merge pull request #20 from guillaume-be/bart_optimizations
Bart optimizations
2020-04-07 10:43:28 +02:00