展示HN:我教GPT-OSS-120B使用Google Lens和OpenCV进行视觉识别
我搭建了一个MCP服务器,可以为任何本地的LLM提供真实的谷歌搜索和视觉能力——无需API密钥。
最新功能:google_lens_detect使用OpenCV在图像中识别物体,裁剪每个物体,并将其发送到Google Lens进行识别。GPT-OSS-120B是一个仅支持文本的模型,完全没有视觉支持,但它成功地从一张桌子照片中识别出了NVIDIA DGX Spark和SanDisk USB驱动器。
该服务器还包括谷歌搜索、新闻、购物、学术、地图、金融、天气、航班、酒店、翻译、图片、趋势等,共17个工具。
两个命令:pip install noapi-google-search-mcp && playwright install chromium
GitHub: https://github.com/VincentKaufmann/noapi-google-search-mcp
PyPI: https://pypi.org/project/noapi-google-search-mcp/
太棒了!
查看原文
I built an MCP server that gives any local LLM real Google search and now vision capabilities - no API keys needed.<p><pre><code> The latest feature: google_lens_detect uses OpenCV to find objects in an image, crops each one, and sends them to Google Lens for identification. GPT-OSS-120B, a text-only model with
zero vision support, correctly identified an NVIDIA DGX Spark and a SanDisk USB drive from a desk photo.
Also includes Google Search, News, Shopping, Scholar, Maps, Finance, Weather, Flights, Hotels, Translate, Images, Trends, and more. 17 tools total.
Two commands: pip install noapi-google-search-mcp && playwright install chromium
GitHub: https://github.com/VincentKaufmann/noapi-google-search-mcp
PyPI: https://pypi.org/project/noapi-google-search-mcp/
</code></pre>
Booyah!