You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: docs/source/embed.rst
+25-7
Original file line number
Diff line number
Diff line change
@@ -76,6 +76,8 @@ Prepare data
76
76
Pair wise
77
77
~~~~~~~~~~~~~
78
78
79
+
If the positive and negative examples have some noise in label, the directly point-wise cross-entropy maybe not the best. The pair wise just compare relatively, or the hinge loss with margin could be better.
If the positive and negative examples have some noise in label, the directly point-wise cross-entropy maybe not the best. The pair wise just compare relatively, or the hinge loss with margin could be better.
193
+
We can use point-wise train, similar to use `tfidf` in information retrieval.
192
194
193
-
arcface
195
+
**arcface**
194
196
195
197
- layer wise learning rate
196
198
- batch size is important
@@ -202,7 +204,6 @@ List wise
202
204
~~~~~~~~~~~~~~~~~~
203
205
204
206
205
-
206
207
3. Training skills to enhance the performance
207
208
----------------------------------------------
208
209
@@ -225,14 +226,31 @@ tuning the important parameters:
225
226
226
227
227
228
Hard negative mining
228
-
~~~~~~~~~~~~~~~~~~~~~~~~
229
-
offline hard mining
229
+
~~~~~~~~~~~~~~~~~~~~~~~~~
230
+
231
+
- offline hard mining or online hard mining
232
+
233
+
If we only have query and positive, we can use it to generate more negative samples to enhance the retrieval performance.
230
234
231
-
online hard mining
235
+
The data format of `input_file` to generate hard negative is `(query, positive)` or `(query, positive, negative)`
236
+
The format of `candidate_pool` of corpus is jsonl of `{text}`
0 commit comments