Top Trending Github Projects on 04/02/2023: Discovering the Latest and Greatest in the Open-Source Community
Projects that have already appeared in previous Github ranking lists will not display details, please refer to previous lists for information.
Rank #1 twitter/the-algorithm
https://github.com/twitter/the-algorithm
Source code for Twitter’s Recommendation Algorithm
Language: Scala
Stars: 37,060(7,075 stars today) Forks:6,796
The Twitter Recommendation Algorithm is a set of services and jobs responsible for constructing and serving the Home Timeline on Twitter. This algorithm is designed to provide users with personalized content based on their interests and behaviors. The algorithm takes into account various factors such as user interactions, tweet content, and user preferences to generate a timeline that is tailored to each user’s interests.
The project includes multiple components such as community detection and sparse embeddings, dense knowledge graph embeddings for users and tweets, models for detecting NSFW or abusive content, a model to predict the likelihood of a Twitter user interacting with another user, and a Page-Rank algorithm for calculating Twitter user reputation.
The Twitter Recommendation Algorithm can be applied in various fields such as social media, marketing, and advertising. The algorithm’s ability to provide personalized content to users can help businesses target their audience more effectively and increase engagement.
Overall, the Twitter Recommendation Algorithm has significant commercial applications, particularly in the social media and advertising industries, where personalized content is essential for driving user engagement and increasing revenue.
— — — — — — — — — — — — — — — —
Rank #2 twitter/the-algorithm-ml
https://github.com/twitter/the-algorithm-ml
Source code for Twitter’s Recommendation Algorithm
Language: Python
Stars: 5,403(1,276 stars today) Forks:1,240
This project open sources some of the ML models used at Twitter, specifically the “For You” Heavy Ranker and TwHIN embeddings. The “For You” Heavy Ranker is a neural network model used to rank tweets for the Home Timeline, while TwHIN embeddings are dense knowledge graph embeddings for users and tweets.
This project can be run inside a Python virtualenv and has been tested on Linux machines with Nvidia GPUs. The project’s READMEs contain instructions on how to run each project.
The applications of this project are diverse, and the models can be applied in various fields such as social media, marketing, and advertising. The “For You” Heavy Ranker can help businesses target their audience more effectively by providing personalized content to users, while TwHIN embeddings can be used to analyze user behavior and preferences.
Overall, this project has significant commercial applications, particularly in the social media and advertising industries, where personalized content and user behavior analysis are essential for driving user engagement and increasing revenue.
— — — — — — — — — — — — — — — —
Rank #3 lm-sys/FastChat
https://github.com/lm-sys/FastChat
The release repo for “Vicuna: An Open Chatbot Impressing GPT-4”
Language: Python
Stars: 3,087(657 stars today) Forks:272
FastChat is an open platform for training, serving, and evaluating large language model-based chatbots. The project includes a distributed serving system, a fine-tuning module, an evaluation pipeline, and a web UI for interacting with chatbots.
The latest release of FastChat features Vicuna, an open-source chatbot that impresses with 90% ChatGPT quality. The project’s Discord server and Twitter account provide the latest updates on the project. The model weights will be released once all legal concerns are addressed and a low-resource version of the inference code is ready.
FastChat can be applied in various fields such as customer service, healthcare, and education, where chatbots can provide personalized assistance and improve user experience. The platform’s AI-enhanced evaluation pipeline can help businesses assess the performance of their chatbots and improve their quality.
Overall, FastChat has significant commercial applications, particularly in industries where chatbots are used to improve user experience and reduce costs. The platform’s distributed serving system and evaluation pipeline can help businesses develop and deploy high-quality chatbots with ease.
— — — — — — — — — — — — — — — —
Rank #4 microsoft/JARVIS
https://github.com/microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community
Language: Python
Stars: 1,107(335 stars today) Forks:37
JARVIS is a collaborative system that connects numerous AI models for solving complicated AI tasks. The system consists of a large language model (LLM) as the controller and numerous expert models as collaborative executors. JARVIS uses ChatGPT to analyze user requests and disassemble them into possible solvable sub-tasks. Based on the sub-tasks, ChatGPT invokes the corresponding models hosted on HuggingFace Hub for task execution. Finally, ChatGPT integrates the prediction of all models and generates a response.
JARVIS can be applied in various fields such as healthcare, finance, and customer service, where complex tasks require collaboration between multiple AI models. The system can help businesses improve their efficiency and reduce costs by automating complex tasks that would otherwise require human intervention.
Overall, JARVIS has significant commercial applications, particularly in industries where complex tasks can be automated using AI models. The system’s ability to connect numerous AI models and integrate their predictions can help businesses improve their productivity and reduce costs.
— — — — — — — — — — — — — — — —
Rank #5 ggerganov/llama.cpp
https://github.com/ggerganov/llama.cpp
Port of Facebook’s LLaMA model in C/C++
Language: C
Stars: 17,880(661 stars today) Forks:2,441
— — — — — — — — — — — — — — — —
Rank #6 mckaywrigley/ai-code-translator
https://github.com/mckaywrigley/ai-code-translator
Use AI to translate code from one language to another.
Language: TypeScript
Stars: 538(245 stars today) Forks:65
AI Code Translator is a web application that uses AI to translate code from one programming language to another. The project includes a user interface that allows users to input code in one language and receive a translation in another language.
AI Code Translator can be applied in various fields such as software development and education, where code translation is essential for cross-platform compatibility and learning different programming languages. The application can help developers and students save time and effort by automating the code translation process.
Overall, AI Code Translator has significant commercial applications, particularly in the software development and education industries, where code translation is a common task. The application’s ability to automate code translation can help businesses and individuals improve their productivity and reduce costs.
— — — — — — — — — — — — — — — —
Rank #7 hahahumble/speechgpt
https://github.com/hahahumble/speechgpt
SpeechGPT is a web application that enables you to converse with ChatGPT.
Language: TypeScript
Stars: 1,128(271 stars today) Forks:149
— — — — — — — — — — — — — — — —
Rank #8 helix-editor/helix
https://github.com/helix-editor/helix
A post-modern modal text editor.
Language: Rust
Stars: 20,492(137 stars today) Forks:1,419
— — — — — — — — — — — — — — — —
Rank #9 twitter/opensource-website
https://github.com/twitter/opensource-website
Twitter’s open source website, identifying projects we’ve released, organizations we support, and the work we do to support open source.
Language: SCSS
Stars: 2,159(45 stars today) Forks:665
The opensource.twitter.dev project is a website built by Twitter that showcases their open source projects. The website is built using the Hugo static site generator and a custom-built theme. Python scripts are used to periodically update repository data. This project can be applied in various fields such as web development, data analysis, and social media management. The commercial applications of this project include providing developers with access to Twitter’s open source projects, allowing them to incorporate Twitter’s technology into their own projects, and promoting collaboration within the developer community.
— — — — — — — — — — — — — — — —
Rank #10 Synaptrix/ChatGPT-Desktop
https://github.com/Synaptrix/ChatGPT-Desktop
ChatGPT 跨平台客户端,快捷键快速唤醒窗口,问答快人一步!
Language: Vue
Stars: 838(248 stars today) Forks:86
— — — — — — — — — — — — — — — —
Rank #11 ZrrSkywalker/LLaMA-Adapter
https://github.com/ZrrSkywalker/LLaMA-Adapter
Fine-tuning LLaMA to follow instructions within 1 Hour and 1.2M Parameters
Language: Python
Stars: 1,591(134 stars today) Forks:102
— — — — — — — — — — — — — — — —
Rank #12 ggerganov/kbd-audio
https://github.com/ggerganov/kbd-audio
🎤⌨️ Acoustic keyboard eavesdropping
Language: C++
Stars: 7,046(177 stars today) Forks:504
kbd-audio is a collection of command-line and GUI tools for capturing and analyzing audio data. The most interesting tool in the collection is called keytap, which can guess pressed keyboard keys by analyzing audio captured from the computer’s microphone. Another tool, keytap2, can recover text from audio using statistical information about the frequencies of letters and n-grams in the English language. Keytap3 is the latest version, which introduces significant algorithm improvements and better n-gram statistics compared to keytap2.
kbd-audio can be applied in various fields such as cybersecurity, where it can be used to test the security of keyboards and other input devices. The tool can help businesses and individuals improve their security by identifying vulnerabilities in their input devices.
Overall, kbd-audio has significant commercial applications, particularly in the cybersecurity industry, where identifying vulnerabilities in input devices is essential for protecting sensitive information. The tool’s ability to analyze audio data can help businesses and individuals improve their security and reduce the risk of data breaches.
— — — — — — — — — — — — — — — —
Rank #13 moeakwak/chatgpt-web-share
https://github.com/moeakwak/chatgpt-web-share
A web application that allows multiple users to share one ChatGPT account at the same time, developed using unofficial ChatGPT API & fastapi & vue3. Supports GPT-4! 共享 ChatGPT 账号给多用户同时使用,可用于朋友之间共享或合租 ChatGPT 账号。使用 FastAPI + Vue3 开发。支持 ChatGPT Plus / 使用 GPT-4 / 用户请求限制等功能。
Language: Vue
Stars: 401(52 stars today) Forks:66
The ChatGPT Web Share project is a web application developed using FastAPI and Vue3 that allows multiple users to share a ChatGPT account. This project can be applied in various fields such as natural language processing, chatbot development, and artificial intelligence. The commercial applications of this project include providing developers and businesses with a cost-effective way to access the ChatGPT API, allowing them to develop chatbots and other natural language processing applications. Additionally, this project allows multiple users to share a single ChatGPT account, making it ideal for collaborative projects or small businesses with limited resources. The project also supports advanced features such as the ability to export conversations as Markdown or PDF files, GPT-4 support, and user request limits.
— — — — — — — — — — — — — — — —
Rank #14 donnemartin/system-design-primer
https://github.com/donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Language: Python
Stars: 215,843(137 stars today) Forks:38,446
The System Design Primer is an organized collection of resources to help engineers learn how to design large-scale systems. It is a continually updated, open-source project that provides a vast amount of resources scattered throughout the web on system design principles. The project aims to help engineers become better at designing scalable systems and prepare for system design interviews, which are a required component of the technical interview process at many tech companies. The System Design Primer offers various topics for interview prep, including a study guide, tips on how to approach a system design interview question, system design interview questions with solutions, object-oriented design interview questions with solutions, and additional system design interview questions. The project also provides Anki flashcard decks to help users memorize key concepts and terminology. The System Design Primer can be applied in various fields, including software engineering, computer science, and information technology. It has commercial applications in the tech industry, where system design skills are essential for building scalable and reliable systems that can handle large amounts of data and traffic.
— — — — — — — — — — — — — — — —
Rank #15 Bin-Huang/chatbox
https://github.com/Bin-Huang/chatbox
A desktop app for ChatGPT API (OpenAI API) that supports Windows, Mac & Linux. 开源的ChatGPT桌面应用,prompt 开发神器,全平台支持,下载安装包就能用
Language: TypeScript
Stars: 5,194(305 stars today) Forks:496
— — — — — — — — — — — — — — — —
Rank #16 Torantulino/Auto-GPT
https://github.com/Torantulino/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
Language: Python
Stars: 737(590 stars today) Forks:68
Auto-GPT is an open-source experimental application that showcases the capabilities of the GPT-4 language model. This program is driven by GPT-4 and autonomously develops and manages businesses to increase net worth. Auto-GPT is one of the first examples of GPT-4 running fully autonomously, pushing the boundaries of what is possible with AI. The project can be applied in various fields, including artificial intelligence, machine learning, and natural language processing. The commercial applications of Auto-GPT are vast, as it can potentially be used to automate various business processes, such as customer service, content creation, and data analysis. However, it is important to note that Auto-GPT is an experimental project and should be used with caution. The project requires Python 3.7 or later and an OpenAI API key to run. Auto-GPT features include internet access for searches and information gathering, long-term and short-term memory management, GPT-4 instances for text generation, access to popular websites and platforms, file storage and summarization with GPT-3.5. The project is still in development, and users can contribute to its development by funding the API costs or becoming a sponsor.
— — — — — — — — — — — — — — — —
Rank #17 twitter/twemoji
https://github.com/twitter/twemoji
Emoji for everyone. https://twemoji.twitter.com/
Language: HTML
Stars: 15,681(104 stars today) Forks:1,852
— — — — — — — — — — — — — — — —
Rank #18 ymcui/Chinese-LLaMA-Alpaca
https://github.com/ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地部署 (Chinese LLaMA & Alpaca LLMs)
Language: Python
Stars: 1,670(268 stars today) Forks:136
— — — — — — — — — — — — — — — —
Rank #19 twitter/rezolus
https://github.com/twitter/rezolus
Systems performance telemetry
Language: Rust
Stars: 1,346(55 stars today) Forks:96
Rezolus is a tool designed for collecting systems performance telemetry and exposing burst patterns through high-resolution telemetry. It provides instrumentation of basic systems metrics, performance counters, and support for eBPF (extended Berkeley Packet Filter) telemetry. This project can be applied in various fields such as systems performance analysis, performance optimization, and performance diagnostics. The commercial applications of this project include providing businesses with a sophisticated tool for capturing performance anomalies, profiling systems performance, and conducting performance diagnostics. Rezolus collects metrics from traditional sources such as procfs and sysfs to provide basic telemetry for CPU, disk, and network. It exports CPU utilization, disk bandwidth, disk IOPs, network bandwidth, network packet rate, network errors, as well as TCP and UDP protocol counters. Additionally, Rezolus supports Perf Events for hardware and software events and BPF to instrument kernel and user space activities. With its support for BPF as well as more common telemetry sources, Rezolus is a powerful tool for capturing performance data and analyzing performance issues.
— — — — — — — — — — — — — — — —
Rank #20 PathOfBuildingCommunity/PathOfBuilding
https://github.com/PathOfBuildingCommunity/PathOfBuilding
Offline build planner for Path of Exile.
Language: Lua
Stars: 2,744(24 stars today) Forks:1,733
Path of Building Community Fork is a fork of Openarl’s Path of Building, an offline build planner for Path of Exile. The project includes many extra features not found in the base version of PoB, such as support for new and updated uniques from league 3.8 onwards, pantheon support, impale DPS support, enemy armor and physical damage reduction calculations, catalysts, dual influence crafting, and more. The project can be applied in the field of gaming and can be used to plan and optimize builds in Path of Exile. The commercial applications of Path of Building Community Fork are limited as it is an open-source project designed for personal use. However, the project can potentially be used by gaming companies to improve their game’s planning and optimization features. The project is available for download from the releases page, and users can contribute to its development by providing code and bug reports. The project features a comprehensive offense and defense calculation system, passive skill tree planner, support for jewels, and skill planning.
— — — — — — — — — — — — — — — —
Rank #21 twitter/GraphJet
https://github.com/twitter/GraphJet
GraphJet is a real-time graph processing library.
Language: Java
Stars: 567(31 stars today) Forks:93
GraphJet is a real-time graph processing library written in Java that maintains a full graph index over a sliding time window in memory on a single server. This index supports various graph algorithms, including personalized recommendation algorithms based on collaborative filtering. The project powers a variety of real-time recommendation services within Twitter, notably content (tweets/URLs) recommendations that require collaborative filtering over a heterogeneous, rapidly evolving graph. GraphJet is designed to support rapid ingestion of edges in an evolving graph while concurrently serving lookup queries through compact edge encoding and a dynamic memory allocation scheme. Each GraphJet server can ingest up to one million graph edges per second and compute up to 500 recommendations per second, translating into several million edge read operations per second. The project can be applied in the field of social media and can be used by companies to improve their recommendation services. The commercial applications of GraphJet are vast, as it can potentially be used to improve recommendation services in various fields, including e-commerce, content recommendation, and social media. The project is available on GitHub and includes a demo that reads from the Twitter public sample stream. Users can interact with the graph via a REST API, running on port 8888 by default.
— — — — — — — — — — — — — — — —
Rank #22 new-ai-company/ide
https://github.com/new-ai-company/ide
$NAME is an IDE powered by AI agents. Developers describe what they want to build by writing documentation. Then let AI agents with access to tools do the coding work.
Language: Python
Stars: 732(171 stars today) Forks:37
— — — — — — — — — — — — — — — —
Rank #23 twitter/finagle
https://github.com/twitter/finagle
A fault tolerant, protocol-agnostic RPC system
Language: Scala
Stars: 8,538(19 stars today) Forks:1,453
Finagle is an extensible RPC system for the JVM used to construct high-concurrency servers. The project implements uniform client and server APIs for several protocols and is designed for high performance and concurrency. Finagle’s code is protocol agnostic, simplifying the implementation of new protocols. The project is used in production at Twitter and many other organizations and is actively developed and maintained. Finagle can be applied in the field of server development and can be used to construct high-concurrency servers. The commercial applications of Finagle are vast, as it can potentially be used to improve server performance and concurrency in various fields, including e-commerce, social media, and content delivery networks. The project is available on GitHub and includes extensive documentation, a user guide, and API documentation. The project is used by several companies, including Foursquare, ING Bank, Pinterest, SoundCloud, Tumblr, and Twitter. The project welcomes contributions, and users are encouraged to follow Twitter’s Open Source Code of Conduct in all interactions with the community.
— — — — — — — — — — — — — — — —
Rank #24 curl/urler
https://github.com/curl/urler
command line tool for URL parsing and manipulation
Language: C
Stars: 213(63 stars today) Forks:7
Urler is a command-line tool for URL parsing and manipulation. The project allows users to parse and manipulate URLs through various command-line options. The project’s original idea came from a post on the curl mailing list. Urler can be applied in the field of web development and can be used to parse and manipulate URLs in various web applications. The commercial applications of Urler are limited, as it is an open-source project designed for personal use. However, the project can potentially be used by web development companies to improve their web application’s URL parsing and manipulation features. The project is available on GitHub and can be installed on Linux by compiling the C source with GCC. Users need to install the development files of libcurl (e.g., libcurl4-openssl-dev or libcurl4-gnutls-dev) for compilation. The project includes several example command lines demonstrating how to use Urler for URL parsing and manipulation.
— — — — — — — — — — — — — — — —
Rank #25 microsoft/ML-For-Beginners
https://github.com/microsoft/ML-For-Beginners
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Language: Jupyter Notebook
Stars: 45,906(223 stars today) Forks:9,522
Machine Learning for Beginners — A Curriculum is a 12-week, 26-lesson curriculum offered by Azure Cloud Advocates at Microsoft. The curriculum is designed to teach classic machine learning techniques using primarily Scikit-learn as a library and avoiding deep learning, which is covered in their forthcoming ‘AI for Beginners’ curriculum. The project-based pedagogy allows learners to learn while building, a proven way for new skills to ‘stick’. The curriculum includes pre- and post-lesson quizzes, written instructions to complete the lesson, a solution, an assignment, and more. The project can be applied in the field of education and can be used by students to learn machine learning techniques. The commercial applications of the project are vast, as machine learning is a rapidly growing field, and the demand for skilled professionals is high. The project is available on GitHub and includes contributions from several authors, illustrators, and Microsoft Student Ambassador authors, reviewers, and content contributors. Students can use the curriculum by forking the entire repo to their own GitHub account and completing the exercises on their own.
— — — — — — — — — — — — — — — —