Keeps: post title, subreddit, score
<!-- SC_OFF --><div class="md"><p>Has anyone here found a good middle ground between fully local inference and using the big closed AI platforms?</p> <p>I’m asking because I’ve been experimenting with running Qwen3.6 through a hosted ChatGPT-style interface, mostly for people who
reddit:LocalLLaMA<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tvekng/microsoft_aion_10_instruct_and_aion_10_plan_models/"> <img alt="Microsoft Aion 1.0 Instruct and Aion 1.0 Plan models!" src="https://preview.redd.it/nuy17exhvz4h1.png?width=640&crop=smart&auto=w
reddit:LocalLLaMA- 2026-06-03Nous Research — Hermes Desktop
<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tve7qu/nous_research_hermes_desktop/"> <img alt="Nous Research — Hermes Desktop" src="https://external-preview.redd.it/YsrcFIGvS74zgZKjcNaDioCdkpUsGbcauVBqSrCHsvk.png?width=640&crop=smart&auto=webp&am
reddit:LocalLLaMA - 2026-06-03DCT2 interview
<!-- SC_OFF --><div class="md"><p>Hey all,</p> <p>So a couple of months ago, I started applying for roles at Google. I got set up with a recruiter and was put into the DCT1 round of interviews, which all went great! However, I later found out there were no open positions left at
reddit:datacenter - 2026-06-03Senior level interview questions
<!-- SC_OFF --><div class="md"><p>I want to know what are those questions that should be asked for senior level positions in process engineer and that interviewers expect you to ask to show seniority.</p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.c
reddit:semiconductors <!-- SC_OFF --><div class="md"><p>This post contains content not supported on old Reddit. <a href="https://sh.reddit.com/r/NVDA_Stock/comments/1tve34f">Click here to view the full post</a></p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.com/user/dail
reddit:NVDA_Stock- 2026-06-03Daily Discussion Wednesday 2026-06-03
  submitted by   <a href="https://www.reddit.com/user/AutoModerator"> /u/AutoModerator </a> <br /> <span><a href="https://www.reddit.com/r/AMD_Stock/comments/1tve2x3/daily_discussion_wednesday_20260603/">[link]</a></span>   <span><a href="https://www.reddit.com/r/AMD_
reddit:AMD_Stock <table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tvdqt8/discussions_about_the_tiananmen_square_incident/"> <img alt="Discussions about the Tiananmen Square incident on LocalLLaMA" src="https://preview.redd.it/4hxaqki0vz4h1.png?width=140&height=95&au
reddit:LocalLLaMA<table> <tr><td> <a href="https://www.reddit.com/r/hardware/comments/1tvdaym/inference_agentic_ai_race_groq_lpu_vs_sambanova/"> <img alt="Inference + Agentic AI race (groq LPU vs SambaNova RDU) vs alternatives for Decode" src="https://preview.redd.it/s24fyll0hz4h1.png?width=140&a
reddit:hardware<!-- SC_OFF --><div class="md"><p>This is a lesson from a great business book I read many years ago called The Goal by Eliyahu M. Goldratt. The book discusses this idea in the context of manufacturing but I think it widely applies to any production system including software.</p>
reddit:ExperiencedDevs<!-- SC_OFF --><div class="md"><p>Advanced Micro Devices, Inc. Bank of America 2026 Global Technology Conference June 2, 2026 2:20 PM EDT</p> <p><strong>Company Participants</strong></p> <p>Jean Hu - Executive VP, CFO & Treasurer<br /> Matthew Ramsay - Vice President of Finan
reddit:AMD_Stock<!-- SC_OFF --><div class="md"><p>Huawei Technologies announced on May 25 that it will produce industry-leading semiconductors using a new technology in five years, according to news coverage.</p> <p>Huawei Rotating Chairman and Deputy Chairman Xu Zhijun suggested in a recent int
reddit:semiconductors<!-- SC_OFF --><div class="md"><p>Trying to run MiniMax M2.7 NVFP4 via llama.cpp but not seeing any GGUFs anywhere on huggingface. So I’m guessing I would need to quantize to NVFP4.GGUF myself. Is this possible with llama.cpp, and if so, what commands need to be run to make this
reddit:LocalLLaMA<!-- SC_OFF --><div class="md"><p>The mixed precision quant discussion here lately, MoE aware stuff that keeps shared experts and the edge layers at higher precision is great, but it's almost all measured against perplexity and general output quality. What I never see is structur
reddit:LocalLLaMA- 2026-06-03Are GPUs getting cheaper?
<!-- SC_OFF --><div class="md"><p>I've noticed that GPUs on the <strong>lower end</strong> such as the 5060 TIs and even the Radeon 9700s are getting cheaper or having discounts online.</p> <p>It seemed to be in direct contrast to the trends that we see of more and more GPU manuf
reddit:LocalLLaMA <table> <tr><td> <a href="https://www.reddit.com/r/MachineLearning/comments/1tvameq/minimax_dropped_a_new_attention_architecture_n/"> <img alt="MiniMax dropped a new attention architecture. [N]" src="https://preview.redd.it/gvokff4l0z4h1.png?width=140&height=80&auto=webp&
reddit:MachineLearning<!-- SC_OFF --><div class="md"><p>Hiya!</p> <p>** I have read the rules**</p> <p>I am a few weeks out from beta launching a mobile app (iOS and Android) that tracks skincare, haircare, makeup, and wellness all in one place.</p> <p>The core problem I am trying to solve is that the
reddit:startups<!-- SC_OFF --><div class="md"><p>- Good pay compared to many other professions</p> <p>- Remote/hybrid work options</p> <p>- Opportunities to travel or work in different countries</p> <p>- Working with global teams</p> <p>- Good work-life balance</p> <p>- Problem-solving and cont
reddit:ExperiencedDevs- 2026-06-03Storing user accounts in Supabase?
<!-- SC_OFF --><div class="md"><p>Is it safe?</p> <p>I know nothing.. help me</p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.com/user/Free-Ant-463"> /u/Free-Ant-463 </a> <br /> <span><a href="https://www.reddit.com/r/webdev/comments/1tvadi6/storing_
reddit:webdev <!-- SC_OFF --><div class="md"><p>I got a direct `llama-bench` row over 100 t/s on AMD Strix Halo / Ryzen AI MAX+ 395.</p> <p>Broader framing: I’m trying to document what a ~$4k unified-memory local AI PC can actually do for LLMs, with raw data instead of scattered anecdotes.</p>
reddit:LocalLLaMA<!-- SC_OFF --><div class="md"><p>Pretty much the title. Interested in knowing what it is working for them and what salary should I negotiate if I get the offer or any signon bonus. TIA</p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.com/user/MotorOw
reddit:datacenter<table> <tr><td> <a href="https://www.reddit.com/r/hardware/comments/1tv9s4o/intel_is_struggling_to_supply_laptop_chips_built/"> <img alt="Intel is Struggling to Supply Laptop Chips Built Around its New 18A Node" src="https://external-preview.redd.it/88Y0A3ITAVJjmT3j7cy1KAgUJGZ2J
reddit:hardware- 2026-06-03Remember the real enemy
<!-- SC_OFF --><div class="md"><p>Last week I was part of an abrupt layoff that resulted in our whole team's contract not being renewed.</p> <p>Our VP of recruitment has been shopping around for companies that will pick up our team. One of the devs on the team had just escaped to
reddit:ExperiencedDevs - 2026-06-03Are my skills transferable?
<!-- SC_OFF --><div class="md"><p>18 years in HVAC, and with data centers going up near where I live eventually, I’m wondering if a strong mechanical background would help finding a job at one. Are there on site facility maintenance personnel? I’m not necessarily looking to leave
reddit:datacenter <!-- SC_OFF --><div class="md"><p>AI agents have made it much easier and efficient to deploy features quickly but I’m wondering how DevOps teams are thinking about the long-term consequences.</p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.com/user/M
reddit:devops- 2026-06-03[Machines & More] Noctua's next-gen low profile cooler is coming... (new NH-L12 prototype)
<table> <tr><td> <a href="https://www.reddit.com/r/hardware/comments/1tv8rko/machines_more_noctuas_nextgen_low_profile_cooler/"> <img alt="[Machines & More] Noctua's next-gen low profile cooler is coming... (new NH-L12 prototype)" src="https://external-preview.redd.it/UdPL2Pl
reddit:hardware - 2026-06-02Morality questions
<!-- SC_OFF --><div class="md"><p>I’ve been working in data centers for almost a decade now and I’m wondering if anyone else is starting to feel this unease with all the public scrutiny and hatred surrounding our industry. </p> <p>I am a generally a politically neutral person who
reddit:datacenter <!-- SC_OFF --><div class="md"><p>Building data center in hot climate is the dumbest thing i have heard yet. Building in drought areas can only be 1. That’s the cheapest land and they have inside knowledge on changing tech that won’t need water. Or 2. They plan on building water
reddit:datacenter- 2026-06-02Role now involves only reviewing code from more senior developers: have you experienced this?
<!-- SC_OFF --><div class="md"><p>I have been five years at my current job (my first developer job). Our team has recently experienced a lot of changes due to layoffs, financial headwinds, and, of course, AI mandates. </p> <p>I’m afraid, however, it’s taken a turn for the worse f
reddit:ExperiencedDevs <!-- SC_OFF --><div class="md"><p>I've been researching how founders are handling cybersecurity, especially with the current speed of development with AI.</p> <p>For those of you building companies, I'm curious:</p> <ul> <li>What are you using for cloud infrastructure and data st
reddit:startups- 2026-06-02Top 3 schools that can get you in
<!-- SC_OFF --><div class="md"><p>I'm curious to know from DC people where they got there degrees from. And how fast they got through it etc. I don't want to go to any school that has DC like programs, but typically won't get you in.</p> </div><!-- SC_ON -->   submitted by &#
reddit:datacenter - 2026-06-02Best database provider?
<!-- SC_OFF --><div class="md"><p>I'm trying to pick a stack to use for all my freelance web dev work.</p> <p>I plan on building scalable ecommerce websites.</p> <p>I am currently using Node, React, and Docker/Cloudify for deployment on VPS.</p> <p>What is the best option for dat
reddit:webdev <!-- SC_OFF --><div class="md"><p>I'm cases where ram is limited I've seen a preference for increasing kvcache precision instead of the weight precision.</p> <p>I.e. 8bit kvcache but only 4bit weights. </p> <p>But I can't seem to find a solid explanation as to why?</p> </div><!--
reddit:LocalLLaMA- 2026-06-02Weird issue with OpenCode and Qwen3.6
<!-- SC_OFF --><div class="md"><p>I’m using Qwen3.6-27B running on my server with llama-server for AI coding with OpenCode. Sometimes for some reason, the response stops when its reasoning like if it has finished outputting the full response. I have to type “continue” and it cont
reddit:LocalLLaMA <!-- SC_OFF --><div class="md"><p>Diamond is 5x more thermally conductive than copper. The demo shows it melting through ice in seconds. That same speed is what pulls heat off a GPU before it throttles. <a href="https://youtu.be/2D0MmRoEffg?si=LzkMigOnTCgx4Bt2">https://youtu.be/2
reddit:datacenter<!-- SC_OFF --><div class="md"><p>Can it be done? Thank you.</p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.com/user/ComfortablePost3664"> /u/ComfortablePost3664 </a> <br /> <span><a href="https://www.reddit.com/r/devops/comments/1tv5b9o/my_friend_t
reddit:devops<!-- SC_OFF --><div class="md"><h1>Potential Fiber Optic Career on Weekdays (5PM-12PM) and Weekends All day? Is it possible ?</h1> <p>Hi guys I was interested in starting a fiber optic tech career in the chicago land area and was wondering if working a traditional office job from
reddit:datacenter- 2026-06-02$15K for a Wix site?
<!-- SC_OFF --><div class="md"><p>I work for a nonprofit that’s had an outdated website for decades at this point. Upper management is kinda desperate and is getting quoted left and right.</p> <p>$15K for a Wix site which includes: event management, volunteer management, shop, do
reddit:webdev <!-- SC_OFF --><div class="md"><p>Are you using a specific third party memory system for your agents, like claude code but also Hermes and OpenClaw? Or are you using the memory system that ships with it? Curious to see if people here have made good experiences with third party me
reddit:LocalLLaMA<!-- SC_OFF --><div class="md"><p>I found many old LTO and DLT tapes during an office cleanup operation. They seem to date back to the early 2000s from the looks of it. Very few markings on these tapes, and absolutely nothing else. </p> <p>The IT department doesn't know what's on
reddit:datacenter<!-- SC_OFF --><div class="md"><p>Just found out that Eazzy, a home services and appliance lifecycle management platform, just got funded.<br /> I dont get it that if Urban Company exists and is dominating, and while a platform like this doesn't has a moat, why would VCs back the
reddit:startups<!-- SC_OFF --><div class="md"><p>I pulled some salary data to see what hardware engineer compensation at the mid-career level looked like across different industries. Most of the time I'm looking at software engineers in tech, so I thought it'd be interesting to dig into the har
reddit:semiconductors<!-- SC_OFF --><div class="md"><p>Thank you so much for your work. We love you!</p> <p>Also, PSA to the 5 people here who build llama.cpp on Nixos. Its working!</p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.com/user/Xyklone"> /u/Xyklone </a> <br />
reddit:LocalLLaMA- 2026-06-02In Q8_0 weight quantization, why can't we just skip blocks of 32 that have very large outliers?
<!-- SC_OFF --><div class="md"><p>Looking for someone with an expert-level understanding.</p> <p>I understand that we can skip layers and sub-layers when doing quantization, but why can't we skip blocks? I am using Q8_0 as it's a simple example. Every block of 32 values has a sca
reddit:LocalLLaMA <!-- SC_OFF --><div class="md"><p>Just found out that Eazzy, a home services and appliance lifecycle management platform, just got funded.<br /> I dont get it that if Urban Company exists and is dominating, and while a platform like this doesn't has a moat, why would VCs back the
reddit:startups<!-- SC_OFF --><div class="md"><p>Hey guys,</p> <p>I’ve been a PM in the Tech sector for >10 years and have just been made redundant. </p> <p>I’d love to be able to transition into Data Centre construction.</p> <p>I do not have a technical or construction background. </p> <p>A
reddit:datacenter- 2026-06-02Mellum2-12B-A2.5B-Thinking-GGUF at Q8
<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tv2q7r/mellum212ba25bthinkinggguf_at_q8/"> <img alt="Mellum2-12B-A2.5B-Thinking-GGUF at Q8" src="https://preview.redd.it/mc6qwqaufx4h1.png?width=320&crop=smart&auto=webp&s=17d5aa0368531c22844fefa0
reddit:LocalLLaMA - 2026-06-02Downlevel at AWS Loop
<!-- SC_OFF --><div class="md"><p>I am currently a manager at Accenture strategy, did loop for L6 (Senior category manager), however, have been offered L5. I believe my stories were solid with multibillion impact and I have led multiple procurement transformation engagements.<br
reddit:datacenter <table> <tr><td> <a href="https://www.reddit.com/r/NVDA_Stock/comments/1tv2hya/nvidia_showcases_aipowered_humanoid_robot_platform/"> <img alt="Nvidia showcases AI-powered humanoid robot platform" src="https://external-preview.redd.it/PcUxrZQMTde-uprJzgXaYRy4OO5K2Ttryv_EuYUkGko.jp
reddit:NVDA_Stock<!-- SC_OFF --><div class="md"><p>The AI Alliance just published the report from its first Project Tapestry workshop (30 partners in Paris, May 7–8). The core idea is an "N+1" architecture: one consortium-trained base model, plus many sovereign derivatives. Nodes keep t
reddit:LocalLLaMA