Learning
  • Initial page
  • Natural Language Processing (NLP) (10%)
  • Chinese text segmentation (10%)
  • Vector Space Modeling (20%)
  • Chatbot (20%)
  • Machine Comprehension
  • Machine Learning (0%)
  • Deep Learning (0%)
  • Environment Setup
  • Git
  • Nivdia
  • Code Editor
  • Juypter Notebook (1%)
  • Reference
  • AWS
  • Azure
  • Azure WebApp PHP+Laravel
  • Competition
  • Python
  • Raspberry Pi
  • Mathematic
  • MS Bot Framework
  • 3rd API
  • Facebook Messenger
  • eBook
  • PHP
  • Tools
  • Image Recognition
  • URL
  • NLP Tools
  • Data Processing
  • sklearn
  • Stock Prediction
  • Seq2seq
  • Titanic
  • Open Data Source
  • Stopwords
  • Transfer Learning
  • Mac Tips
  • Markdown
  • AI Algorithms
  • Scrapping 爬蟲
  • Knowledge Graph (知識圖譜)
  • Web / iOS
  • Live2D
  • test
  • Voice (Speech)
  • VMWare
  • Statistics
  • Docker
Powered by GitBook
On this page
  • 通過全文相似度來尋找相同或相似的代碼
  • simhash與重複信息識別
  • Stock Prediction in Python

Reference

通過全文相似度來尋找相同或相似的代碼

http://blog.startry.com/2016/12/14/find-same-code-by-simhash-and-hamming-distance/blog.startry.com
LogoGitHub - startry/SameCodeFinder: A Text Scanner which can find same or similar sourcecodeGitHub

simhash與重複信息識別

Logo我的数学之美系列二 —— simhash与重复信息识别 - 让机器理解图像 - ITeye博客

按照Charikar在論文中闡述的,64位simhash,海明距離在3以內的文本都可以認為是近重複文本。當然,具體數值需要結合具體業務以及經驗值來確定

Stock Prediction in Python

https://towardsdatascience.com/stock-prediction-in-python-b66555171a2towardsdatascience.com

你的首個 Progressive Web App

Logo漸進式網路應用程式:離線  |  Google DevelopersGoogle Developers

PreviousJuypter Notebook (1%)NextAWS

Last updated 7 years ago