Skip to content

Instantly share code, notes, and snippets.

@iceout
iceout / 词性标记.md
Created October 18, 2017 07:19 — forked from luw2007/词性标记.md
词性标记: 包含 ICTPOS3.0词性标记集、ICTCLAS 汉语词性标注集、jieba 字典中出现的词性、simhash 中可以忽略的部分词性

词的分类

  • 实词:名词、动词、形容词、状态词、区别词、数词、量词、代词
  • 虚词:副词、介词、连词、助词、拟声词、叹词。

ICTPOS3.0词性标记集

n 名词

nr 人名

@iceout
iceout / gist:03a6334be8d7fcfaf44c
Created December 8, 2014 13:09
test scorePayload of PythonDefaultSimilarity in pylucene
#!/usr/bin/env python
# -*- coding: utf-8 -*-
import lucene
lucene.initVM(vmargs=['-Djava.awt.headless=true'])
from org.apache.lucene.util import Version
from org.apache.lucene.index import DirectoryReader
from org.apache.lucene.index import IndexWriterConfig
from org.apache.lucene.search import IndexSearcher
from org.apache.lucene.index import Term
@iceout
iceout / about.md
Created November 21, 2011 11:26 — forked from jasonrudolph/about.md
Programming Achievements: How to Level Up as a Developer
@iceout
iceout / about.md
Created November 21, 2011 11:26 — forked from blaix/about.md
Programming Achievements: How to Level Up as a Developer

Programming Achievements: How to Level Up as a Developer

  1. Select a particular experience to pursue.
  2. Pursue that experience to completion. (Achievement unlocked!)
  3. Reflect on that experience. Really soak it in. Maybe a blog post would be in order?
  4. Return to Step 1, this time selecting a new experience.

This gist is a fork of the gist from this blog post.