A high-throughput and memory-efficient inference and serving engine for LLMs(Fork for contributing. All changes intended for upstream PRs.)
A universal sandbox platform for AI application scenarios, providing multi-language SDKs, unified sandbox protocols, and sandbox runtimes for LLM-related capabilities.
an easy-to-use dynamic service discovery, configuration and service management platform for building cloud native applications.
A Go web framework for quickly building recommendation online services based on JSON configuration.
Official repository for paper "LaTo: Landmark-tokenized Diffusion Transformer for Fine-grained Human Face Editing"