Specialty Technologies

Special field specific technologies and tools

blockchainsearchsearch-enginecrawlertime-series
elastic
elasticsearch
elastic
73.1k

Elasticsearch - a distributed RESTful search engine

unionlabs
union
unionlabs
69.9k

Union is a hyper-efficient zero-knowledge infrastructure layer designed for general message passing, asset transfers, NFTs, and DeFi. It operates without dependencies on trusted third parties, oracles, multi-signatures, or MPC, leveraging Consensus Verification for security. Union is compatible with Cosmos chains via IBC and connects to EVM chains like Ethereum, Berachain, and Arbitrum. Its decentralized governance controls contract upgrades, connections, token configurations, and protocol evolution. Key components include `uniond` (node implementation), `galoisd` (ZK prover), `voyager` (cross-ecosystem relayer), and `hubble` (chain indexer). Built with Go, Rust, and Solidity, Union supports reproducible builds via Nix and offers a TypeScript SDK for interaction. It aims to align priorities among users, validators, and operators through decentralized governance.

redis
redis
redis
69.8k

A cache database, it is the indispensable "dessert" in your application! This open source project provides a high-performance and flexible data storage solution, and supports various data structures and complex operations.

TheAlgorithms
Java
TheAlgorithms
62.2k

A Java algorithm list, which provides a detailed demonstration of the built-in algorithm implementations in Java. It offers Java developers a convenient reference, showcasing the application of Java's built-in algorithms in handling various tasks. This project helps developers better understand and use Java's algorithms through clear code examples and illustrations.

prometheus
prometheus
prometheus
59.2k

Prometheus - CNCF project, used to monitor other systems or services. It collects metrics from the target at a given time interval, evaluates them according to rules, displays the results, and can also trigger alarms if certain monitoring conditions are met

meilisearch
meilisearch
meilisearch
52.1k

An open-source, free search engine known for its excellent performance, ease of use, and simple deployment. It offers instant search experiences, supports multiple languages, and is suitable for projects of various scales. Whether it's a small website or a large enterprise-level application, Meilisearch can provide fast and reliable search functionality.

mendableai
firecrawl
mendableai
41.3k

NaiboWang
EasySpider
NaiboWang
39.2k

An easy-to-use web crawler software that provides a graphical interface for users to easily design and execute crawling tasks without writing complex codes. EasySpider offers simple and user-friendly tools to help users quickly scrape the data they need, and supports customized data and exportation, suitable for various crawling applications and data collection needs.

jaywcjlove
linux-command
jaywcjlove
33.6k

A comprehensive Linux command search tool that provides detailed command manuals, explanations, and learning resources. Suitable for users of all levels, from beginners to advanced users, it offers useful commands and tips, making it an ideal choice for mastering the Linux command line.

influxdata
influxdb
influxdata
30.2k

InfluxData - a scalable data storage written in go, used for metrics, events and real-time analysis

iawia002
lux
iawia002
29.8k

A cross-platform video download command line tool written in Go, supporting almost all video platforms such as TikTok, Bilibili, YouTube, etc., and can control the format, clarity and subtitles of the downloaded videos.

eugeneyan
applied-ml
eugeneyan
28.1k

A selection of papers, technical articles and well-known blogs related to data science and machine learning, covering 24 technical directions such as data engineering, natural language processing, computer vision, reinforcement learning, etc. Most of the articles come from world-renowned universities and enterprises.

qdrant
qdrant
qdrant
24.4k

A vector database for next-generation AI applications. It provides efficient vector indexing and retrieval functions, supporting fast similarity search and relevance calculation, suitable for various AI application fields.

taosdata
TDengine
taosdata
24.1k

A big data platform specifically designed and optimized for industries such as the Internet of Things (IoT) and application monitoring. Its database insertion and query operations are 10 times faster than other databases! It also consumes very low costs compared to other typical solutions in this category. TDengine only requires less than 1/5 of computing resources, and it provides interfaces for development in Java, C/C++, Python, Go, RESTful API, etc. Are you still worried about the performance of data writing, reading, and computing? With it, your hair survival rate will definitely increase significantly.

ItzCrazyKns
Perplexica
ItzCrazyKns
22.9k

Perplexica is an open-source, AI-powered search engine designed to deliver precise, up-to-date answers by leveraging advanced machine learning techniques like similarity searching and embeddings. It integrates with SearxNG to ensure privacy and real-time information retrieval. Key features include support for local LLMs (e.g., Llama3, Mixtral via Ollama), two search modes (Normal and Copilot), and six focus modes tailored for specific needs like academic research, YouTube, Reddit, and Wolfram Alpha queries. Perplexica offers an API for seamless integration into applications and supports Docker for easy installation. It prioritizes user privacy and provides cited sources for transparency, making it a versatile tool for enhanced web searching.

assafelovic
gpt-researcher
assafelovic
22.1k

The GPT Researcher is an autonomous agent designed to conduct comprehensive online research on a variety of tasks. The agent can generate detailed, factual, and unbiased research reports, offering customized options that focus on relevant resources, outlines, and curricula. Inspired by AutoGPT and recent Plan-and-Solve papers, the GPT Researcher addresses issues of speed and determinacy by employing parallel agent operations (as opposed to synchronous operations), thereby delivering more stable performance and increased speed.

TheAlgorithms
C
TheAlgorithms
20.4k

An open-source organization that provides C language implementations of various fundamental algorithms and data structures. The project includes sample code for basic algorithms, covering multiple programming languages, offering valuable resources for learning and understanding algorithms.

dailydotdev
daily
dailydotdev
19.5k

A developer-centric information aggregation platform that provides more than 350+ developer information sources and aggregates more than 10,000 technical tags, making it a good channel to get the latest development information.

zincsearch
zincsearch
zincsearch
17.5k

projectdiscovery
katana
projectdiscovery
13.9k

A new generation of crawler and spider framework designed to provide efficient and flexible web crawling and data extraction capabilities. Katana supports various crawling strategies and data processing methods, enabling it to adapt to complex web structures and dynamic content. It is suitable for data collection, information retrieval, cybersecurity, and other fields, providing users with powerful crawling and analysis tools.

weaviate
weaviate
weaviate
13.8k

An open-source vector database that stores objects and vectors, allowing the combination of vector search with structured filtering, with the fault tolerance and scalability of cloud-native databases, all accessible via GraphQL, REST, and various language clients.

smartcontractkit
full-blockchain-solidity-course-js
smartcontractkit
13.4k

Through this tutorial, you will learn about the principles and applications of various technology stacks such as blockchain, web development, smart contracts, cryptography, NFTs, etc.

AmazingAng
WTF-Solidity
AmazingAng
12.8k

An educational program that is updated every week, mainly explaining the basic development skills of Web3, contract security, digital signature, time lock, etc. It aims to provide developers with practical knowledge about Solidity smart contract development.

UFund-Me
Qbot
UFund-Me
12.0k

Qbot is an AI-oriented quantitative investment platform designed to harness the potential of quantitative investing through AI technology. The platform offers a suite of tools and features that enable investors to utilize AI for data analysis, model construction, and trade execution, bringing greater intelligence and efficiency to the field of quantitative investment.

neuml
txtai
neuml
11.2k

An open-source platform that integrates an embedded database, supports semantic search, LLM (large language model) orchestration, and language model workflows. By using embedding technology, txtai provides powerful text search and analysis capabilities, enabling developers to easily automate and optimize natural language processing tasks.

ccfos
nightingale
ccfos
11.0k

An integrated observability platform that combines the strengths of Prometheus and Grafana. The platform provides a comprehensive solution for alert management, metric visualization, logs, and tracing. Its powerful web UI design makes monitoring and data analysis more intuitive and efficient.

benbusby
whoogle-search
benbusby
10.8k

An open source meta search engine that provides users with a clean, ad-free Google meta search engine, focusing on privacy security and supporting hosting on private servers.

ssssssss-team
spider-flow
ssssssss-team
10.3k

A highly flexible and configurable crawler platform that allows users to implement crawlers without writing code, using a flowchart.

open-falcon
falcon-plus
open-falcon
7.3k

Falcon - Xiaomi's enterprise-level, highly available and scalable open source monitoring solution

MystenLabs
sui
MystenLabs
7.2k

The Modern Web Application Development Kit is designed to help developers build modern web applications faster. It provides a complete set of component libraries, build tools, and best practices that enable developers to build user-friendly web applications in an efficient and maintainable manner. SUI offers rich features and flexible design options to help developers quickly start projects and provide excellent user experiences.

alirezamika
autoscraper
alirezamika
6.8k

A smart web crawler script, its main function is to quickly and intelligently obtain data from a specified website. These data can be web page text, URL addresses or other HTML elements.

vespa-engine
vespa
vespa-engine
6.2k

A platform for applications that need low-latency computation on large data sets. It stores and indexes your structured, text, and vector data so that queries, selections, and processing as well as machine learning model inference can be executed quickly at any scale within service time. Functionality can be customized and extended using application components hosted in Vespa.

graphite-project
graphite-web
graphite-project
6.0k

Graphite - Graphite is a highly scalable real-time graphics system that runs well on inexpensive hardware

RediSearch
RediSearch
RediSearch
5.9k

A full-text search engine based on Redis. It provides high-performance full-text search capabilities, supports complex queries and filters, and can seamlessly integrate into the existing Redis environment.

xushengfeng
eSearch
xushengfeng
5.7k

Screenshot offline OCR search translation picture search paste picture on screen screen recorder

houbb
sensitive-word
houbb
5.2k

A high-performance Java sensitive word filtering tool framework based on DFA algorithm, with 60,000+ sensitive word library, supporting sensitive word judgment, return, desensitization and other operations, with excellent performance, rich functions, simple use and other characteristics.

facontidavide
PlotJuggler
facontidavide
5.1k

A tool for visualizing time series, which allows you to perform data analysis more intuitively by visualizing the data in the editor.

ffffffff0x
Digital-Privacy
ffffffff0x
4.9k

To protect personal data security, GitHub user ffffffff0x has compiled a set of solutions that integrate digital privacy collection, protection, and cleaning, as well as open source intelligence (OSINT) countermeasures.

myreader-io
myGPTReader
myreader-io
4.4k

coinpride
CryptoList
coinpride
4.3k

A collection of some information about blockchain and cryptocurrency

slowmist
Knowledge-Base
slowmist
4.3k

Solidity Security: A Comprehensive List of Known Attack Methods and Common Defense Patterns, translated by the SlowMist Security Team

crate
crate
crate
4.3k

CrateDB - CrateDB is a distributed SQL database that makes it easy to store and analyze large amounts of machine data in real time

bigchaindb
bigchaindb
bigchaindb
4.0k

BigchainDB - BigchainDB is a blockchain database

Boris-code
feapder
Boris-code
3.3k

A simple and powerful Python crawling framework. The usage is similar to Scrapy, with 3 built-in crawlers, supporting distributed, batch collection, data loss prevention, breakpoint resume, monitoring alarm, browser rendering download, etc.

kizniche
Mycodo
kizniche
3.1k

An environment monitoring and regulation system that can run on a Raspberry Pi, supporting applications such as planting plants, cultivating microorganisms, maintaining the homeostasis of honey bee hives, incubating animals and eggs, and maintaining aquatic systems. The collected data can be monitored and visualized in a web interface.

jumper2014
lianjia-beike-spider
jumper2014
3.0k

lianjia-beike-spider - Real estate price crawler in major cities

curiousily
Getting-Things-Done-with-Pytorch
curiousily
2.4k

"Getting Things Done with Pytorch" will teach you the basics of PyTorch, neural networks, image classification, face detection, sentiment analysis, and more.

Fabsqrt
BitTiger
Fabsqrt
2.1k

amirgamil
apollo
amirgamil
1.4k

An open-source personal search engine and web crawler that automatically crawls website text content, video subtitles and stores them after indexing website URLs. Then, users can quickly view the crawled content or access the source page of the website through search.

lorey
mlscraper
lorey
1.4k

An open-source Python crawler script that can automatically extract data from HTML pages based on machine learning. After providing the crawler with output results as examples, it will automatically extract rules and crawl page data without specifying CSS selectors.

infinispan
infinispan
infinispan
1.3k

Infinispan - An open source data grid platform and highly scalable NoSQL cloud data store

MLNLP-World
AI-Paper-Collector
MLNLP-World
1.2k

wangschang
web3.0
wangschang
1.1k

A comprehensive collection of learning materials about Web3, covering the basics of Web3, applications and projects, related sharing bloggers, video tutorials, related books, development resources and tools, and also some job opportunities related to Web3.

district0x
ethlance
district0x
707

A meaningful Dapp ethlance.com a blockchain-based freelancing platform

3nock
SpiderSuite
3nock
651

An open source, multi-functional GUI network security crawler tool designed for cybersecurity professionals. Currently supports Windows and Linux operating systems.

lmmentel
awesome-time-series
lmmentel
608

Time series related learning resources organization. It covers multiple common developer tools, visualization open source libraries, technical papers, open source tutorials and other contents.

hylinux1024
awesome-blockchain-articles
hylinux1024
569

janreges
siteone-crawler
janreges
508

A simple and powerful website analysis tool that can complete website analysis, performance testing, SEO optimization suggestions with one click, and export complete offline HTML analysis results for website analysis optimization.

centreon
centreon
centreon
131

Centreon - One of the most flexible and powerful monitoring software on the market

© 2025 GitHub Fun. All rights reserved.