Commit Graph

4894 Commits

Author SHA1 Message Date
Martin Junczys-Dowmunt
1b908a82a9 Merged PR 16162: ChrF validation metric
This PR adds ChrF as a validation metric. This follows the implementation from SacreBLEU.
2020-11-02 15:47:46 +00:00
Martin Junczys-Dowmunt
3a028f215c Merged PR 16144: Merge Cross-Entropy with Label-Smoothing operation
* Compute label-smoothing within the cross-entropy node, should result in faster training.
2020-11-01 18:54:24 +00:00
Martin Junczys-Dowmunt
160b36cec8 Merged PR 15896: Add --after N option to supersede --after-batches and --after-epochs
Replace `--after-batches N` and `--after-epochs N` with `--after Nu/Ne` which allows to specify updates, epochs, target labels with units, e.g.:
* `--after 30Gt` or `--after 50ku` or `--after 10e`
* Can also combine multiple criteria: `--after 30Gt,50ku,10e` and will stop when whichever hits first

Changes default `cost-type` from `ce-mean` to `ce-sum` and turns `display-label-counts` on by default.
2020-10-29 20:16:19 +00:00
Young Jin Kim
fe815e6c29
Move batched GEMM back (#710) 2020-10-26 13:45:59 -07:00
Nikolay Bogoychev
f3c99bf391
Fix MPI on GCC8+ (#742) 2020-10-26 12:48:59 -07:00
rhenry-nv
595fba4145
Fixes bug for certain reductions (#746)
* Fixes reductions into scalars for <= 32 input elements. Only affects reductions where 0 is not the identity
* Update CHANGELOG.md
* Adds space before "?"
* Adds comment explaining increase in margin for reduction tests. Adds axis comment to argument to reduce functions. Adds more tests for small reduction operators
2020-10-26 12:26:41 -07:00
Roman Grundkiewicz
ae866af035 Merged PR 15561: Properly compile FBGEMM in CMake MSVC build
This fixes compilation of FBGEMM on Windows using CMake:
1. Compiling FBGEMM and cpuinfo statically
2. Forcing USE_STATIC_LIBS if USE_FBGEMM is set
2020-09-25 15:50:56 +00:00
Roman Grundkiewicz
244cfaa07e Merged PR 15332: Add CPU-only and GPU-only builds to Azure Pipelines
A few updates to Azure Pipelines:
* Adding CPU-only and GPU-only builds on Ubuntu
* Compiling Marian statically in some of the Ubuntu builds
* Ubuntu build with minimum supported versions of CMake (3.5.1), gcc (5.5), CUDA (10.0 due to GCC 5.5), no MKL
* Compiling marian-server with Boost 1.72 on Windows builds
* Minor clean up
2020-09-25 14:57:39 +00:00
Martin Junczys-Dowmunt
91ad534c65 Merged PR 15320: Sync internal and external master
Updates internal master to external master.

Changes:
* Correct behavior for Pre-Norm transformer
* Small changes to CMake files
2020-09-19 16:22:08 +00:00
Roman Grundkiewicz
467b15e2b9
Updates to GitHub CI workflows (#730)
* Update GitHub workflows
* Disable marian-server on Windows checks
* Rename GitHub workflows
* Remove CMakeSettings.json files used for CI only
2020-09-16 10:27:38 +01:00
Roman Grundkiewicz
97f094b444
Release workflows (#731)
* Add GitHub workflow for generating releases
* Update pattern and job names
2020-09-15 09:42:37 -07:00
Roman Grundkiewicz
b32144c055 Clean up Ubuntu builds 2020-09-14 14:36:04 +01:00
Roman Grundkiewicz
621be16954 Build marian-server on Windows with Boost 1.72 2020-09-14 14:23:10 +01:00
Roman Grundkiewicz
26175e8286 Set timeout for Ubuntu jobs 2020-09-14 09:59:05 +01:00
Roman Grundkiewicz
fbdff66ee0 Remove call to marian-server.exe 2020-09-11 17:53:27 +01:00
Roman Grundkiewicz
3348390f27 Use CUDA 10 in UbuntuBuildMinimal 2020-09-11 17:36:55 +01:00
Roman Grundkiewicz
9e0d6e5f3a Set CUDA_TOOLKIT_ROOT_DIR 2020-09-11 16:58:11 +01:00
Roman Grundkiewicz
20338fd0d5 Fix path to custom cmake 2020-09-11 16:50:41 +01:00
Roman Grundkiewicz
1931bbaa8c Add Ubuntu build with minimum GCC and CMake supported 2020-09-11 14:28:29 +01:00
Roman Grundkiewicz
c192be1660 Disable building marian-server in Windows CI 2020-09-11 13:43:23 +01:00
Roman Grundkiewicz
f9a069c0eb Add matrix.static to Ubuntu builds 2020-09-11 13:31:02 +01:00
Roman Grundkiewicz
976f892917 Add comment on Boost on Azure-hosted runners 2020-09-11 13:25:32 +01:00
Roman Grundkiewicz
02d15861a6 Windows builds with marian-server 2020-09-11 13:03:26 +01:00
Roman Grundkiewicz
29d805a101 Try variables.var syntax for matrix variables 2020-09-11 12:20:31 +01:00
Roman Grundkiewicz
b71c66b8a3 Add matrix.gpu to Ubuntu builds 2020-09-11 11:49:55 +01:00
Roman Grundkiewicz
96291157d1 Move 'variables' to job level 2020-09-11 11:30:27 +01:00
Roman Grundkiewicz
0f1cab6c03 Add CPU-only and GPU-only builds to Azure DevOps 2020-09-11 11:11:34 +01:00
Roman Grundkiewicz
c1e823a6ca
Update VERSION 2020-09-11 10:09:00 +01:00
Roman Grundkiewicz
a28cbf123e
Remove duplicated EXT_LIBS (#728) 2020-09-10 14:04:42 -07:00
Marcin Junczys-Dowmunt
c904b05471 Old CMAKE does not understand VERSION_EQUAL_GREATER 2020-09-09 21:27:24 -07:00
Marcin Junczys-Dowmunt
4937f59747 suppress sentencepiece warnings for g++ > 8.0 2020-09-09 21:20:37 -07:00
Marcin Junczys-Dowmunt
2bd1f1c3c7 Merge branch 'master' into pmaster 2020-09-09 18:35:06 -07:00
Aaron Burke
5c45a37fcc Merged PR 14474: CMake build fixes for QuickSAND
- Add installation targets (enabled by GENERATE_MARIAN_INSTALL_TARGETS; default: OFF to preserve CMake 3.5.1 compatibility)
- Add COMPILE_LIBRARY_ONLY option (default: OFF) to exclude in-source executables from the build
- Compiler warning flags are no longer exported as part of the public link interface, only when building privately
- Always set CPUINFO_BUILD_TOOLS=OFF when building fbgemm, not just for MSVC builds

Related work items: #108034
2020-09-10 01:33:44 +00:00
Nikolay Bogoychev
9c6ad012a9
Compute75 for NCCL (#711)
* Propagate SM75 and SM80 to NCCL
2020-09-09 08:28:54 -07:00
Marcin Junczys-Dowmunt
951ecfe932
Enable final stack post-processing for transformer for correct prenorm behavior (#719)
This PR enables final post-processing of a full transformer stack for correct prenorm behavior.
See issues: #715 and #699,

List of changes:

Add final post-processing in encoder and decoder if requested with --transformer-postprocess-top. Can take combinations of d, n, a. Using a will add a skip connection from the bottom of the stack.
Add --task transformer-base-prenorm and --task transformer-big-prenorm which correspond to --task transformer-base --transformer-preprocess n --transformer-postprocess da --transformer-postprocess-top n.
2020-09-09 08:06:20 -07:00
Marcin Junczys-Dowmunt
660719cd27 Merge branch 'master' into pmaster 2020-09-08 11:50:47 -07:00
Marcin Junczys-Dowmunt
e71e7e2469 fix compilation with -DCOMPILE_CPU=off 2020-09-08 11:41:37 -07:00
Roman Grundkiewicz
4f8c2ed6a6 Merged PR 15232: Update Azure Pipelines
A few improvements to Azure Pipelines:
- Disabling build on Ubuntu 20.04 due to [issues with FBGEMM and GCC 9+](https://github.com/marian-nmt/marian-dev/issues/709)
- Replacing Invoke-WebRequest with wget.exe
- Cleaning environmental variables
2020-09-08 16:58:03 +00:00
Marcin Junczys-Dowmunt
794d6f7a55 Merge branch 'master' into pmaster 2020-09-07 12:39:21 -07:00
Martin Junczys-Dowmunt
e3916b3d08 Merged PR 15233: Sync internal master with public master
Regular sync of public and internal master.
2020-09-07 19:37:41 +00:00
Marcin Junczys-Dowmunt
82b5f3feb2 Merge branch 'master' into pmaster 2020-09-07 12:30:12 -07:00
Roman Grundkiewicz
d20172c406
Disable GitHub check with Ubuntu 20.04 and GCC 9 (#717)
* Disable GitHub check with Ubuntu 20.04 and GCC 9
* Use wget instead of Invoke-WebRequest
2020-09-07 20:04:00 +01:00
Roman Grundkiewicz
2457b49356 Update submodule regression-tests 2020-09-06 09:29:59 +01:00
Marcin Junczys-Dowmunt
f9f4155357 move pointer to NCCL version with reduced warnings on > g++-7 2020-09-05 18:58:53 -07:00
Martin Junczys-Dowmunt
ea3ac624c6 Merged PR 15155: Enable output layer without output bias
Adds `--output-omit-bias` option which allows to train an output layer without a bias vector. This is expected to be useful for `--output-approx-knn` during decoding, as the LSH-based k-NN search is then exactly approximating the correct top-K values for decoding. The bias adds a shift otherwise. In first experiments the lack of the output bias does not seem to result in any performance loss.
2020-09-04 17:58:28 +00:00
Roman Grundkiewicz
a4eb201632 Update submodule regression-tests 2020-09-04 09:36:02 -07:00
Roman Grundkiewicz
452f9c79e6
Print message that marian-server is listening on port X after it is accepting connections (#705)
* Print 'server is listening' after it is accepting connections; fix #701
* Minor code formatting
2020-09-03 11:47:10 +01:00
Roman Grundkiewicz
9336746f89
Add GitHub workflow with CUDA 11 on Ubuntu 20.04 (#704)
* Add GitHub workflow with CUDA 11 on Ubuntu 20.04
* Make build job names unique
* Make build with CUDA 11 experimental
2020-09-02 17:20:34 +01:00
Nikolay Bogoychev
4d9d15649e
Enable compute75 when using cuda10 (#698)
* Enable compute75 when using cuda10 or newer and disable compute <50 when using CUDA11
* Re-enable deprecated architectures with CUDA11
2020-09-01 08:56:24 -07:00
Roman Grundkiewicz
cd04725639 Update VERSION 2020-09-01 02:45:02 -07:00