Skip to content

Instantly share code, notes, and snippets.

@Balry
Balry / tf.py
Created February 16, 2017 05:08 — forked from satomacoto/tf.py
create a document-term frequency matrix
# create candidate sentense set
docs = [['a', 'b', 'c'],
['b', 'd']]
terms = ['a', 'b', 'c', 'd']
vlist = []
n = len(docs)
d = len(terms)
for doc in docs:
tmp = []
for term in doc:
@Balry
Balry / tfidf java
Created February 16, 2017 05:00 — forked from johnconroy/tfidf java
Term frequency Inverse Document Frequency Java
package tfidf;
import java.io.BufferedReader;
import java.io.File;
import java.io.FileNotFoundException;
import java.io.FileReader;
import java.io.FileWriter;
import java.io.IOException;
import java.text.DecimalFormat;
import java.util.ArrayList;