Commit Graph

74 Commits

Author SHA1 Message Date
Frank Seide
4c8c1b820f Merged PR 14437: minor fixes to make VS happy after latest changes
minor fixes to make VS happy after latest changes
2020-07-29 18:00:59 +00:00
Frank Seide
f9029638ff Merged PR 13039: FAISS builds with Windows
FAISS builds with Windows
2020-05-22 02:49:04 +00:00
Frank Seide
77a420740c Merged PR 12958: ONNX support
This branch adds functionality to export ONNX models (with limitations).
2020-05-21 05:51:18 +00:00
Frank Seide
1d2a137e14 Merged PR 12957: fixes to VS Solution
fixes to VS Solution
2020-05-16 20:46:23 +00:00
Young Jin Kim
d06018d98c Merged PR 12505: Fix windows compile errors
Fix windows compile errors
- Add new files to the vs project files
- Fix a warning
- Add namespace specifiers for ambiguous types
2020-04-17 08:38:37 +00:00
Young Jin Kim
d2b4f3803e Merged PR 11831: Change the weight matrix quantization to use 7-bit min/max quantization to avoid overflow
1. Change the weight matrix quantization to use 7-bit min/max quantization
-> This resolves all the overflow issue, because weight and activations are quantized by min/max range.
2. Clip fp16 quantization to avoid overflow
3. Fix windows build errors (cmake options, vcproj file)
4. int8 pack model (encoder -> fp16)
2020-03-25 02:52:17 +00:00
Young Jin Kim
9c9a240354 Merged PR 10266: FBGEMM based Int8 model
FBGEMM based Int8 model - working with the master
1. Added int8 implementation into packed_gemm.h/cpp with FBGEMM
2. Update FBGEMM library to make it work on windows
3. Split 'ispacked' into packed8 and packed16
4. Change all names for PackFp32 to PackFp16 which is more accurate
2019-12-03 19:14:18 +00:00
Young Jin Kim
5fb31b28d2 Merged PR 10415: Fix windows build errors
1. Added template definition for 'uint64_t', Windows has different definition of 'long' type.
2. Fixed warnings on windows.
2019-11-13 07:37:46 +00:00
Young Jin Kim
5ed441f843 Merged PR 9284: Packed model support in production
This PR enables loading and serving of offline packed models.
* Enable support for FBGEMM offline-packed models
* Added different parameters sets for different parameter element types (currently inference only)
* New types (packed*) are used for the packed memory format.
* Added CPU instruction query into the quicksand API.

Related work items: #102572
2019-11-01 17:24:49 +00:00
Hieu Hoang
bd0118cc66 const NormalizeTempPrefix(). Retain all streamBuf so we can delete them afterwards 2019-10-25 07:30:49 -07:00
Hieu Hoang
9d31fb58dd merge with previous pull request 2019-10-24 14:57:45 -07:00
Hieu Hoang
c5c1a4c449 replace all OutputFileStream with OutputFileStreamNew 2019-10-22 17:50:32 -07:00
Hieu Hoang
c33074ce54 move code .h -> .cpp 2019-10-21 12:17:35 -07:00
Frank Seide
c5eeca2490 removed custom stdint.h, as it is not necessary 2019-09-12 10:09:59 -07:00
Frank Seide
eea9473786 Visual Studio Project builds again 2019-09-12 10:08:12 -07:00
Frank Seide
e40491f44a sorted out AVX2 build problems on Windows 2019-09-11 14:38:22 -07:00
Young Jin Kim
c0a0f4add9 Fix windows build error 2019-09-04 15:01:05 -07:00
Young Jin Kim
4c41fec0ae QuickSAND env fix, Remove binary dependecy of FBGEMM and compile FBGEMM together, Update FBGEMM to the latest. 2019-06-21 09:59:52 -07:00
Young Jin Kim
d2d8ec041d Enable FBGEMM based packed GEMM on windows 2019-06-18 16:59:11 -07:00
Frank Seide
865f863485 VS Project now builds with CUDA 2019-05-31 15:33:51 -07:00
Frank Seide
eacfd2de11 weird mode change back 2019-04-29 19:01:29 -07:00
Frank Seide
cd329be400 weird mode change 2019-04-29 17:52:23 -07:00
Frank Seide
b570638533 weirdo change of access permissions 2019-04-26 14:59:40 -07:00
Frank Seide
4351229899 (updated the VS project) 2019-04-12 09:41:11 -07:00
Frank Seide
ac04bbfa16 merged from master 2019-03-14 17:30:44 -07:00
Frank Seide
f6304a43e8 merged from fseide/criterionfunction 2019-03-13 09:45:47 -07:00
Frank Seide
ff59ebd12b merged from master 2019-03-13 09:14:49 -07:00
Frank Seide
918a06be26 merged with master 2019-03-13 09:00:29 -07:00
Frank Seide
e7dbe93691 merged with fseide/criterionfunction 2019-02-22 15:23:02 -08:00
Frank Seide
23ece0040a ModelBase::build() now returns an Expr, not a RationalLoss 2019-02-22 13:21:45 -08:00
Frank Seide
f88eb0d368 commenting and minor refactoring of beam search 2019-02-06 20:25:43 -08:00
Frank Seide
f74055c2c9 merged from master 2019-02-06 11:21:33 -08:00
Frank Seide
b7d245945f added factored_vocab stubs 2019-02-05 11:45:55 -08:00
Frank Seide
de69efea79 fix after merge 2019-02-04 17:32:53 -08:00
Frank Seide
1b9dad403d renamed VocabBase to IVocab, since it is a pure interface 2019-02-04 17:09:34 -08:00
Frank Seide
ff4db737bf updated after merge from master 2019-02-04 17:05:01 -08:00
Frank Seide
41117fb432 abstracted Expr State::targetIndices_ to Words targetWords_, in prep for factored word representations 2019-02-04 16:34:26 -08:00
Frank Seide
bcbe247a57 auto-merged pieces of last merge 2019-02-04 12:45:25 -08:00
Frank Seide
f9e63da391 removed auto-conversion between Word and WordIndex, and dealt with fallout 2019-01-31 22:24:26 -08:00
Frank Seide
acee183116 towards fixing the bug that bert.h includes transformer.h 2019-01-31 14:55:45 -08:00
Frank Seide
7b649c00e0 In VS Project, changed transformer.h from CPP file to a true header, and added the stub, to fix a compilation warning 2019-01-31 14:45:51 -08:00
Frank Seide
93763c30e6 now always normalizes factors 2019-01-28 10:02:53 -08:00
Marcin Junczys-Dowmunt
50d64de62c
Merge pull request #415 from marian-nmt/chmod-x
C++ source should not have execute bits
2019-01-25 08:52:30 -08:00
Kenneth Heafield
f100a8649a C++ source is should not have execute bits 2019-01-23 23:20:46 +00:00
Frank Seide
2e331b254d merged dot_csr() change 2019-01-15 16:45:49 -08:00
Frank Seide
31f4f721b4 scaffolding for new operation dot_csr(); renamed prod.cu to prod.cpp, since it no longer has CUDA code 2019-01-15 13:25:36 -08:00
Frank Seide
c5839f07e0 towards factored embeddings; towards csr_dot() operation 2019-01-10 21:14:55 -08:00
Frank Seide
9023bf8b90 fixed build break due to MPI change in examples 2019-01-08 17:15:39 -08:00
Frank Seide
f6de599980 generalized step() to narrow() and sliceView(), new class Slice;
bug fix: SliceViewNodeOp should use correct size for memory piece;
new operation stopGradient()
2018-12-26 16:35:56 -08:00
Frank Seide
efc20adacc added elementwise comparison operators (lt(), eq(), );
bug fix: Maximum and Minimum got it backwards;
streamlined the operator tests a little
2018-12-26 14:21:23 -08:00