Last active
July 20, 2023 14:16
-
-
Save y-lan/1d7574ba89f158fc9df36f25117671e8 to your computer and use it in GitHub Desktop.
Revisions
-
y-lan revised this gist
Jul 20, 2023 . 1 changed file with 1 addition and 1 deletion.There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode charactersOriginal file line number Diff line number Diff line change @@ -8,4 +8,4 @@ def count(text): def parallel_count(texts): from joblib import Parallel, delayed results = Parallel(n_jobs=-1)(delayed(count)(text) for text in texts) return sum(results) -
y-lan revised this gist
Jul 20, 2023 . 1 changed file with 2 additions and 2 deletions.There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode charactersOriginal file line number Diff line number Diff line change @@ -1,11 +1,11 @@ from transformers import LlamaTokenizer tokenizer = LlamaTokenizer.from_pretrained('decapoda-research/llama-7b-hf') def count(text): return len(tokenizer(text)['input_ids']) def parallel_count(texts): from joblib import Parallel, delayed results = Parallel(n_jobs=-1)(delayed(count)(text) for text in texts) return sum([results]) -
y-lan renamed this gist
Jul 20, 2023 . 1 changed file with 0 additions and 0 deletions.There are no files selected for viewing
File renamed without changes. -
y-lan created this gist
Jul 20, 2023 .There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode charactersOriginal file line number Diff line number Diff line change @@ -0,0 +1,11 @@ from transformers import LlamaTokenizer tokenizer = LlamaTokenizer.from_pretrained('meta-llama/Llama-2-7b') def count(text): return len(tokenizer(text)['input_ids']) def parallel_count(texts): from joblib import Parallel, delayed results = Parallel(n_jobs=-1)(delayed(count)(text) for text in texts)) return sum([results])