-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Usecase? #1
Comments
Hello, https://github.com/dzhng/deep-research needs a Firecrawl with the /search endpoint. It only use the /search endpoint to make and SERP and scraped the pages. This repo allows you to use for example SearxNG as a self hosted search engine and then use your self-hosted version of Firecrawl or Crawl4AI or Jina (that can be self hosted) to scrape the pages. In the deep-research repo you have to set up FIRECRAWL_BASE_URL="http://localhost:8000" (or to the ip where you run crawlrouter |
Brother I desperately need this. Was actively looking for solution. Was even ready to implement it myself. Thanks for this. I am closely following this repo. Is there any roadmap for this project? |
@loorisr For this /deep-research just swapping the url will work perfectly? Or I have to do some changes? Again following this project. If there is some roadmap please me know. May as well help with the code. |
I'm glad it can help :) I'm using it with deep-research and it works fine, you just need to set FIRECRAWL_BASE_URL in the env file of deep-research to where you host crawlrouter. Then on crawlrouter you need to set SEARCH_BACKEND to the one you want (for example searxng) and SCRAPE_BACKEND to, for example crawl4ai or firecrawl. For the roadmap, I'm currently implementing the /crawl endpoint. It will also speed up the /search endpoint when activating the scape mode (by default /search only return the url, title and a description of the link, but deep-research also need to have the complete page scraped). And then it will depends what people could need. Other backend (https://scrapingant.com, ...) or other functions! |
This is a promising project. |
Hello there.
I am running a self hosted version of firecrawl. I saw your comment here in /dzhng/deep-research/issues/77.
I wanted to ask you, have you configured crawlrouter by yourself for deep-research repo locally?
If that is then how did you configure it?
I mean is it hot swappable or I have to do some configuration in that deep-research repo?
Thanks. I think this is awesome tool.
The text was updated successfully, but these errors were encountered: