-
Notifications
You must be signed in to change notification settings - Fork 21
Open
Description
Hello,
Tensor assertion error is raised if you try to train the model. It starts with the following:
0%| | 0/10 [00:00<?, ?it/s]../aten/src/ATen/native/cuda/Indexing.cu:1289: indexSelectLargeIndex: block: [56,0,0], thread: [64,0,0] Assertion `srcIndex < srcSelectDimSize` failed.It's possible to avoid this error by resizing embeddings:
model.resize_token_embeddings(len(tokenizer))Unfortunately, it might affect the performance. Could you please review the finetune.py script and fix this problem?
Thank you in advance.
BR,
Mehti
Metadata
Metadata
Assignees
Labels
No labels