Web Scraping Dictionary
Learn about Web Scraping terms and what they mean.
Definition updated on July 2023
🧩
Simhash
What is Simhash?
Simhash is a method for creating a fixed-length "hash" or "fingerprint" of a variable-length input, like a text or document. Although it resembles a hash function and is a type of locally sensitive hashing, it is made to be more resilient to collision attacks, in which two separate inputs generate the same hash. Simhash creates a hash of each feature it divides into, which are referred to as "features," before combining them together to form the input. The final hash for the input is created by combining these hashes.
Learn more Web Scraping terms:
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Do you want to crawl the web?
Sign up now and enjoy an exclusive 100 requests for free
Pssss... You can contact us to get 2,000 requests!

Try the easiest API web scraping tool
Leverage the full potential of web data through easy-to-use APIs. Trusted by +10 market leaders!