← Artificial Intelligence

x/LocalLLaMA

Local LLMs, Self-Hosting, and Hardware

8.1K members
Topic: Artificial Intelligence
RestrictedJoinRequestsRequireModeratorApproval
8 tweets
Columns:
# Tweet User Followers Views Ratio Engagement Posted
1
[text] 1/ 🧵 I just cracked open the Claude Code source — and what I found isn’t “just a smarter terminal chat.” It’s a full-blown behavioral observatory running in your machine. 1. Keyword sniffers. 2. Hesitation trackers. 3. Hidden trigger words. 4. Telemetry that fingerprints
@UsmanReads 357 53.3K 149.3x 541 Mar 31
2
[image] Just some numbers so you don’t get misled RTX 3090 (7 years old) > 24GB VRAM > Bandwidth: 936.2 GB/s > Bi-directional NVLink 112GB/s RTX PRO 4000 > 24GB VRAM > Bandwidth: 672 GB/s > No Bi-directional NVLink, > need 32 Gen. 5 PCIe Lanes to pool 2 at 64GB/s
@TheAhmadOsman 52.2K 42.1K 0.8x 286 Apr 6
3
[image] How am I connecting the DGX Spark cluster > Mikrotik CRS804-4DDQ 1.6Tbps switch > (4) 400G QSFP-DD to 2x 200G QSFP56 Each DGX Spark has a ConnectX-7 supporting 200Gbps Each cable out of the switch goes into 2 DGX Sparks This allows 8x DGX Sparks cluster at the full 1.6Tbps
@TheAhmadOsman 43.3K 33.0K 0.8x 214 Feb 19
4
[image] okay let me say this out loud again. if you want to run local models on a single RTX 3090, your best option right now is qwen 3.5 27B dense Q4_K_M. 35 tok/s, flat from 4K to 300K+ context, zero speed degradation. thinking mode works. 262K native context on 24GB. slower than MoE
@sudoingX 20.2K 27.9K 1.4x 650 Mar 28
5
[text] Most people think VRAM = model size and that’s why their runs crash GPU memory math is complex and so are the implications Here’s how it actually works in a nutshell ↓
@TheAhmadOsman 51.8K 23.7K 0.5x 239 Apr 3
6
[text] Help me spread this, I am on a role and need to squeeze every last bit out of it.
@0xSero 32.3K 16.7K 0.5x 333 Mar 20
7
[image] which one of you is this?
@TheAhmadOsman 51.5K 12.1K 0.2x 235 Apr 2
8
[image] RTX PRO 6000 (96GB VRAM, ~$15K) GIVEAWAY FAQ Q: Cost to enter? A: $0. Free. Q: Do I have to register for GTC? A: Yes, virtual attendance is COMPLETELY FREE Q: Where do I enter? A: Tap the link in my bio, there’s a clear button on the page Q: How do I increase my chances? A:
@TheAhmadOsman 46.0K 9.5K 0.2x 89 Mar 7
9
[text] I meant to post this in here, I will be posting weekly models meant for limited hardware budgets, RN I am learning how to deal with the 30b~ class
@0xSero 37.1K 6.0K 0.2x 94 Mar 28