"Iter 10: Train loss 1.275, It/sec 0.144, Tokens/sec 115.780\n",
"[WARNING] Some sequences are longer than 2048 tokens. Consider pre-splitting your data to save memory.\n",
"Iter 20: Train loss 1.052, It/sec 0.087, Tokens/sec 92.686\n",
"[WARNING] Some sequences are longer than 2048 tokens. Consider pre-splitting your data to save memory.\n",
"Iter 30: Train loss 1.230, It/sec 0.110, Tokens/sec 91.892\n",
"[WARNING] Some sequences are longer than 2048 tokens. Consider pre-splitting your data to save memory.\n",
"Iter 40: Train loss 1.032, It/sec 0.109, Tokens/sec 91.080\n",
"Iter 50: Train loss 0.977, It/sec 0.128, Tokens/sec 95.607\n",
"Iter 60: Train loss 1.021, It/sec 0.166, Tokens/sec 94.361\n",
"[WARNING] Some sequences are longer than 2048 tokens. Consider pre-splitting your data to save memory.\n",
"Iter 70: Train loss 1.077, It/sec 0.097, Tokens/sec 87.647\n",
"[WARNING] Some sequences are longer than 2048 tokens. Consider pre-splitting your data to save memory.\n",
"libc++abi: terminating due to uncaught exception of type std::runtime_error: [METAL] Command buffer execution failed: Insufficient Memory (00000008:kIOGPUCommandBufferCallbackErrorOutOfMemory)\n"
],
"metadata": {
"collapsed": false
},
"id": "51b420d949a23c54"
"id": "7e10d007a2d411f0"
}
],
"metadata": {
...
...
%% Cell type:markdown id:d6264ff5d5024ba1 tags:
# Finetuning experiments
based on https://github.com/ml-explore/mlx-examples/tree/main/lora
This only downloads new content if the list of journals has been changed or already downloaded files have been deleted. To overwrite existing files, use `overwrite=True`
instruction="Below is the content of a website of a German law journal. For each member of the editorial board or the advisory board, extract the following information: lastname, firstname, title, position, affiliation, role. Return as a YAML list of dictionaries. Omit keys that you cannot find information for."
Iter 10: Train loss 1.275, It/sec 0.144, Tokens/sec 115.780
[WARNING] Some sequences are longer than 2048 tokens. Consider pre-splitting your data to save memory.
Iter 20: Train loss 1.052, It/sec 0.087, Tokens/sec 92.686
[WARNING] Some sequences are longer than 2048 tokens. Consider pre-splitting your data to save memory.
Iter 30: Train loss 1.230, It/sec 0.110, Tokens/sec 91.892
[WARNING] Some sequences are longer than 2048 tokens. Consider pre-splitting your data to save memory.
Iter 40: Train loss 1.032, It/sec 0.109, Tokens/sec 91.080
Iter 50: Train loss 0.977, It/sec 0.128, Tokens/sec 95.607
Iter 60: Train loss 1.021, It/sec 0.166, Tokens/sec 94.361
[WARNING] Some sequences are longer than 2048 tokens. Consider pre-splitting your data to save memory.
Iter 70: Train loss 1.077, It/sec 0.097, Tokens/sec 87.647
[WARNING] Some sequences are longer than 2048 tokens. Consider pre-splitting your data to save memory.
libc++abi: terminating due to uncaught exception of type std::runtime_error: [METAL] Command buffer execution failed: Insufficient Memory (00000008:kIOGPUCommandBufferCallbackErrorOutOfMemory)