Skip to content
View shibing624's full-sized avatar
🐬
focus
🐬
focus

Organizations

@NLPchina
Block or Report

Block or report shibing624

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shibing624/README.md

Pinned Loading

  1. pycorrector pycorrector Public

    pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。

    Python 5.4k 1.1k

  2. text2vec text2vec Public

    text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

    Python 4.3k 387

  3. pytextclassifier pytextclassifier Public

    pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。

    Python 469 72

  4. MedicalGPT MedicalGPT Public

    MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

    Python 3k 458

  5. similarities similarities Public

    Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。

    Python 696 68

  6. ChatPilot ChatPilot Public

    ChatPilot: 实现AgentChat对话,支持Google搜索、文件网址对话(RAG)、代码解释器功能,复现了Kimi Chat(文件,拖进来;网址,发出来)。

    Svelte 409 40