Commit Graph

312 Commits

Author SHA1 Message Date
guillaume-be
96d867b992
Merge pull request #48 from guillaume-be/sequence_classification_update
Sequence classification update
2020-06-05 09:02:59 +02:00
Guillaume B
b5c62b5d84 Updated version 2020-06-05 08:44:00 +02:00
Guillaume B
b42a14a396 Updated sentence classification to work with single input 2020-06-05 08:43:20 +02:00
Guillaume B
40a6a148a0 Migration of translation pipeline to be non mutable 2020-06-04 23:05:35 +02:00
Guillaume B
59687b5e82 Migration of BART to non mutable generator (3) 2020-06-04 22:19:42 +02:00
Guillaume B
3781040055 Migration of BART to non mutable generator (2) 2020-06-04 22:12:11 +02:00
Guillaume B
69a424d09a Migration of BART to non mutable generator (1) 2020-06-04 21:38:10 +02:00
Guillaume B
6d14e51018 Updated doctests 2020-06-04 13:12:04 +02:00
Guillaume B
69e7000aeb Definition of Cache enum and implementation for GPT2 2020-06-04 13:00:10 +02:00
Guillaume B
3c010a077b Updated generation code for GPT and GPT2 to allow non mutable models 2020-06-03 19:08:40 +02:00
guillaume-be
fa2182431c
Merge pull request #46 from guillaume-be/translation_update
Translation update
2020-05-27 16:00:14 +00:00
Guillaume B
f7c078abeb Updated documentation 2020-05-27 17:33:28 +02:00
Guillaume B
1dd522b238 Addition to English <-> Russian 2020-05-27 17:11:34 +02:00
guillaume-be
9f96c95a6a
Creation of local runtime for async resource download (#43)
* Creation of local runtime for async resource download

* CI trigger
2020-05-26 19:43:43 +02:00
Guillaume B
897833885e updated version 2020-05-25 22:34:33 +02:00
guillaume-be
d360b5cfdf
Merge pull request #44 from guillaume-be/marian_implementation
Marian implementation
2020-05-25 20:19:24 +00:00
Guillaume B
829965d68b Model resources update, documentation 2020-05-25 22:01:53 +02:00
Guillaume B
ff312a7b9e Allow for models requiring a prefix 2020-05-25 20:17:02 +02:00
Guillaume B
c315354340 Licenses update 2020-05-25 16:58:40 +02:00
Guillaume B
ff0277925e Creation of translation pipeline, added documentation, created unit tests 2020-05-25 16:20:23 +02:00
Guillaume B
67ab4fc772 Updated copyright notices 2020-05-24 21:20:21 +02:00
Guillaume B
ff2e9f2581 Update scores for padding index 2020-05-24 20:29:44 +02:00
Guillaume B
d6b5abe947 Split MarianForConditionalGeneration to own module 2020-05-24 20:15:07 +02:00
Guillaume B
cccad54194 Updates to BART for Marian compatibility 2020-05-24 17:00:04 +02:00
Guillaume B
2b498bb10a Creation of MarianGenerator 2020-05-24 15:39:17 +02:00
guillaume-be
60bffe6e8b
Merge pull request #41 from guillaume-be/multilabel_classification
Addition of multi-label classification prediction method for sequence…
2020-05-20 16:41:11 +00:00
Guillaume B
d0fc3ff40d Addition of multi-label classification prediction method for sequence classification pipeline
Version update
2020-05-20 17:51:29 +02:00
guillaume-be
749ceeecd0
Merge pull request #39 from guillaume-be/tokenizer_version_update
Support of rust_tokenizers 3.1.0
2020-05-19 16:59:39 +00:00
Guillaume B
33a623e54d Support of rust_tokenizers 3.1.0 2020-05-19 18:44:11 +02:00
guillaume-be
6ccf92fedb
Merge pull request #37 from guillaume-be/qa_optimization
Qa optimization
2020-05-16 07:58:00 +00:00
Guillaume B
b37ab31c66 Constant batch size for question answering, answers respect original input order 2020-05-16 09:42:45 +02:00
Guillaume B
2775ab1c1e handling of answers shorter than top_k 2020-05-16 09:29:25 +02:00
guillaume-be
75aeefde65 Merge pull request #36 from guillaume-be/question_answering_long_contexts
Updated QA allowing for long contexts
2020-05-15 20:25:28 +00:00
Guillaume B
32d0f7e221 Updated batching 2020-05-15 22:10:32 +02:00
Guillaume B
eee7ed4d42 Updated QA allowing for long contexts 2020-05-15 22:01:52 +02:00
guillaume-be
16753ea8fb
Merge pull request #33 from guillaume-be/token_entity_parsing
Token entity parsing
2020-05-13 15:11:09 +00:00
Guillaume B
3945416192 Updated token consolidation avoiding copy 2020-05-11 23:24:33 +02:00
Guillaume B
b31a569e50 Added documentation 2020-05-11 22:05:11 +02:00
Guillaume B
0bbb47d1db Added options for label consolidation for sub tokens 2020-05-11 16:40:25 +02:00
Guillaume B
6d61074f7f Updated consolidation and documentation 2020-05-10 12:27:42 +02:00
Guillaume B
1a9e315edf Added sub-token consolidation 2020-05-10 12:05:13 +02:00
Guillaume B
9d3a944051 Updated token classification pipeline to use next tokenization features, reference to original text 2020-05-10 11:38:28 +02:00
guillaume-be
705489169f
Merge pull request #32 from proycon/offsets
Adapted to API changes in offset-aware rust-tokenizer
2020-05-10 08:30:47 +00:00
Guillaume B
a95dd15e2d Updated dependencies
Bumped version number

Bumped version number
2020-05-10 10:15:59 +02:00
Guillaume B
bb94510075 Updated integration tests and formatting 2020-05-09 22:21:58 +02:00
Maarten van Gompel
e9c55e29b7 Adapted to API changes in offset-aware rust-tokenizer (guillaume-be/rust-tokenizers#14 , guillaume-be/rust-tokenizers#19) 2020-05-09 20:16:42 +02:00
guillaume-be
ddb90c7199
Merge pull request #31 from guillaume-be/generic_sequence_classification
Generic sequence classification
2020-05-07 17:57:20 +00:00
Guillaume B
1de940932f Addition to Electra to generic pipeline options and to token classification 2020-05-07 18:11:15 +02:00
Guillaume B
0489adafbc Shared generic pipeline components (config, tokenziers) moved to common module 2020-05-07 18:03:25 +02:00
Guillaume B
053413fcbc Addition of generic classification model (following generic token classification pattern) 2020-05-07 17:26:20 +02:00