最新

Where's the Human Touch?

1 分•作者: Quizzical4230•大约 1 个月前•原帖

Mobileye acquires humanoid robot startup Mentee Robotics for $900M

1 分•作者: mhb•大约 1 个月前•原帖

Logitech Blames 'Inexcusable Mistake' After Certificate Expiry Breaks macOS Apps

1 分•作者: thm•大约 1 个月前•原帖

Save your OKLCH color palettes

1 分•作者: hnhsh•大约 1 个月前•原帖

Researchers poison stolen data to make AI systems return wrong result

1 分•作者: pseudolus•大约 1 个月前•原帖

Sora2

1 分•作者: xbaicai•大约 1 个月前•原帖

Show HN: YoloForge – Create object detection datasets using Gemini 3 Pro

3 分•作者: Olibier•大约 1 个月前•原帖

Hi HN, I’m the creator of YoloForge. I built this because I hit a wall with a hobby computer vision project: I needed a custom dataset, and zero-shot tools like Grounding DINO just weren't accurate enough for my specific classes. I decided I’d rather write code for a couple of weeks than draw another box by hand.I previously experimented with Grounding DINO and SAM3. While they are amazing for generic objects, I found they struggle with specific semantic requests (e.g. specific manufacturing parts, game characters or distinguishing "a worker" from "a worker without a helmet").I discovered that Gemini 3 Pro is surprisingly underrated for bounding box tasks if you prompt it with detailed visual descriptions. It handles semantic understanding significantly better than standard zero-shot detectors.url: yoloforge.comThe Workflow:Upload a zip of raw images (stored in Cloudflare R2). Describe class/classes in plain English. The system generates a .jsonl batch file and sends it to the Gemini Batch API. This allows us to process thousands of images in parallel at 50% of the standard cost. You review/correct boxes in the UI and export the YOLO train/val/test dataset.Technical Challenges:One hard part was getting valid JSON out of the LLM consistently. I ended up writing a robust parser that uses regex fallback strategies to literally "salvage" valid bounding boxes from malformed responses.The Stack:- Frontend: Next.js - Backend: FastAPI, Celery (for async zip processing and polling the batch API), Redis. - Storage: Supabase (Auth/DB), Cloudflare R2 (Image Storage). - Model: Google Gemini 3 Pro via Batch API.There is a live demo on the landing page (no sign-up required) where you can upload a single image to test the detection logic. But of course the tool really shines with datasets that have thousands of images with multiple classes.If you have any technical questions please ask!

Smartphone use cuts into school hours, with social media leading the way

1 分•作者: pseudolus•大约 1 个月前•原帖

SpotEdit: Selective Region Editing in Diffusion Transformers

1 分•作者: gessha•大约 1 个月前•原帖

LaTeX Coffee Stains [pdf]

3 分•作者: zahrevsky•大约 1 个月前•原帖

Show HN: Can you hit replacement? A fertility SIM with cited sources

3 分•作者: joshuafkon•大约 1 个月前•原帖

America's TFR is 1.67. I wanted to understand what it would actually take to get back to replacement (2.1), so I built a simulator where you can stack policies and see the projected effects. Every policy has cited effect sizes (Cohen, Milligan, Raute, etc.) with confidence levels. You can click any policy title to see the methodology and sources. The model includes:Fiscal tracking (policy costs, deficit impact, GDP effects) Diminishing returns when stacking similar interventions Immigration with selection mechanisms and generational convergence Tax increases and entitlement reform as funding options (with growth drag) A few "illiberal" policies for analytical completenessThe honest answer seems to be: it's really hard. Most realistic packages get you to ~1.9-2.0 at enormous cost, and that's assuming the effect estimates transfer to the US context (they might not). Built with vanilla JS. Feedback welcome - especially on the methodology or effect estimates I got wrong.

J.R.R. 托尔金朗读《霍比特人》30分钟（1952年）

31 分•作者: bookofjoe•大约 1 个月前•原帖

展示HN：30,000个宜家商品的平面文本（CommerceTXT）。比JSON小24%。

16 分•作者: tsazan•大约 1 个月前•原帖

这里是原帖作者。我使用了非官方的IKEA美国数据集（最初由jeffreyszhou抓取）并将所有30,511个产品转换成了一种扁平化的、类似于Markdown的协议，称为CommerceTXT。目标：看看扁平结构是否对大型语言模型（LLM）的上下文窗口更有效。结果： - 规模：30,000个产品，涵盖632个类别。 - 效率：文本版本相比于等效的压缩JSON，使用的token减少了约24%（总共节省了360万token）。 - 结构：文件按文件夹组织（例如 /products/category/），这有助于测试层次检索路由器。链接指向Hugging Face上的数据集，其中包含完整的基准测试。解析器代码在这里： [https://github.com/commercetxt/commercetxt](https://github.com/commercetxt/commercetxt) 欢迎提问关于转换逻辑的任何问题！

Sugar industry influenced researchers and blamed fat for CVD

84 分•作者: aldarion•大约 1 个月前•原帖

Show HN: RepoReaper – AST-aware, JIT-loading code audit agent (Python/AsyncIO)

2 分•作者: realdexter•大约 1 个月前•原帖

OP here. I built RepoReaper to solve code context fragmentation in RAG.Unlike standard chat-with-repo tools, it simulates a senior engineer's workflow: it parses Python AST for logic-aware chunking, uses a ReAct loop to JIT-fetch missing file dependencies from GitHub, and employs hybrid search (BM25+Vector). It also generates Mermaid diagrams for architecture visualization. The backend is fully async and persists state via ChromaDB.Link: <a href="https://github.com/tzzp1224/RepoReaper" rel="nofollow">https://github.com/tzzp1224/RepoReaper</a>

Show HN: Arabic Calligraphy Generator – 11 styles, free, no signup

2 分•作者: zaochen1224•大约 1 个月前•原帖

I built a small web tool that lets people create Arabic calligraphy without needing design software. Most existing tools are either too complex or very limited, so I wanted something simple and accessible.Features: • Write Arabic directly or translate from English • 11 classic calligraphy styles (Thuluth, Naskh, Kufi, Diwani, etc.) • Adjust layout, colors, line height, stroke, and rotation • Export as PNG, JPG, or SVG • No signup requiredI’d appreciate any feedback on performance, UI, or calligraphy accuracy. This is a solo side project and still evolving.Site: <a href="https://arabiccalligraphygenerator.online" rel="nofollow">https://arabiccalligraphygenerator.online</a>

Show HN: A simple way to find open source issues to contribute to

2 分•作者: K-dash•大约 1 个月前•原帖

Finding open source issues is easy. Deciding which ones are worth your time is not.I built Contrib.FYI as a simple web app to reduce that decision cost.Instead of relying on static, curated lists, it uses live GitHub API data and shows issues in chronological order, so discovery stays fresh.On top of that, it surfaces a few early signals (language, stars, no comments, no linked PRs) to help you avoid opening issues that are already being worked on.The goal is not to find more issues, but to find better candidates to spend your time on.Source code is available here: <a href="https://github.com/K-dash/contrib-fyi" rel="nofollow">https://github.com/K-dash/contrib-fyi</a>Feedback is welcome.

Show HN: Milkyboard – Synth Keyboard with Milkdrop Visualizer

2 分•作者: amadeuspagel•大约 1 个月前•原帖

Automotive industry expands open source collaboration

1 分•作者: foundart•大约 1 个月前•原帖

Show HN: Deep learning without gradient descent, 500 layers, no skip connections

1 分•作者: Yuriy_Bakhvalov•大约 1 个月前•原帖

上一页 1...654 655 656 657 658...4924 下一页