Skip to content

Instantly share code, notes, and snippets.

@varrek
Created January 16, 2022 14:51
Show Gist options
  • Save varrek/db2cc079652100bdefe9061e9a59bf2b to your computer and use it in GitHub Desktop.
Save varrek/db2cc079652100bdefe9061e9a59bf2b to your computer and use it in GitHub Desktop.

Revisions

  1. varrek created this gist Jan 16, 2022.
    10 changes: 10 additions & 0 deletions project_pipeline.py
    Original file line number Diff line number Diff line change
    @@ -0,0 +1,10 @@
    logreg = Pipeline(steps=[("vectorizer", TfidfVectorizer(max_df=MAX_DF,
    min_df=MIN_DF,
    stop_words=russian_stopwords,
    token_pattern=u'(?ui)\\b\\w*[а-я]+\\w*\\b')),
    ("log_reg", LogisticRegression(n_jobs=N_JOBS,
    solver="saga",
    multi_class="multinomial",
    random_state=100500))
    ],
    verbose=True)