展示HN:我教GPT-OSS-120B使用Google Lens和OpenCV进行视觉识别

2作者: vkaufmann3 个月前原帖
我搭建了一个MCP服务器,可以为任何本地的LLM提供真实的谷歌搜索和视觉能力——无需API密钥。 最新功能:google_lens_detect使用OpenCV在图像中识别物体,裁剪每个物体,并将其发送到Google Lens进行识别。GPT-OSS-120B是一个仅支持文本的模型,完全没有视觉支持,但它成功地从一张桌子照片中识别出了NVIDIA DGX Spark和SanDisk USB驱动器。 该服务器还包括谷歌搜索、新闻、购物、学术、地图、金融、天气、航班、酒店、翻译、图片、趋势等,共17个工具。 两个命令:pip install noapi-google-search-mcp && playwright install chromium GitHub: https://github.com/VincentKaufmann/noapi-google-search-mcp PyPI: https://pypi.org/project/noapi-google-search-mcp/ 太棒了!
查看原文
I built an MCP server that gives any local LLM real Google search and now vision capabilities - no API keys needed.<p><pre><code> The latest feature: google_lens_detect uses OpenCV to find objects in an image, crops each one, and sends them to Google Lens for identification. GPT-OSS-120B, a text-only model with zero vision support, correctly identified an NVIDIA DGX Spark and a SanDisk USB drive from a desk photo. Also includes Google Search, News, Shopping, Scholar, Maps, Finance, Weather, Flights, Hotels, Translate, Images, Trends, and more. 17 tools total. Two commands: pip install noapi-google-search-mcp &amp;&amp; playwright install chromium GitHub: https:&#x2F;&#x2F;github.com&#x2F;VincentKaufmann&#x2F;noapi-google-search-mcp PyPI: https:&#x2F;&#x2F;pypi.org&#x2F;project&#x2F;noapi-google-search-mcp&#x2F; </code></pre> Booyah!