Running on:
Interface:
First we add history data (about 150kk) into the index, and randomly select some to train the index.
And then we have newly updated data everyday (about 400k/day).
How should I train the index everyday? Should I only randomly select {the daily updated data} for training, or randomly select {the daily updated data and all history data}for training?