Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems Paper • 2604.04936 • Published Jan 8 • 26
view post Post 6062 Native tensor parallel has landed in transformers!!! https://github.com/huggingface/transformers/pull/34184 thanks a lot to the torch team for their support! Contributions are welcome to support more models! 🔥 🔥 15 15 ❤️ 5 5 🤯 3 3 🤝 3 3 + Reply
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval Paper • 2408.00441 • Published Aug 1, 2024 • 1
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation Paper • 2303.12343 • Published Mar 22, 2023 • 2
ZClip: Adaptive Spike Mitigation for LLM Pre-Training Paper • 2504.02507 • Published Apr 3, 2025 • 88
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper • 2506.16035 • Published Jun 19, 2025 • 89