You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
curl -X POST https://api.firecrawl.dev/v1/scrape \
-H 'Content-Type: application/json' \
-H "Authorization: Bearer $FIRECRAWL_API_KEY" \
-d '{
"url": "http://httpstat.us/200",
"formats": [ "markdown" ]
}'
<html><head>
<meta http-equiv="content-type" content="text/html;charset=utf-8">
<title>502 Server Error</title>
</head>
<body text=#000000 bgcolor=#ffffff>
<h1>Error: Server Error</h1>
<h2>The server encountered a temporary error and could not complete your request.<p>Please try again in 30 seconds.</h2>
<h2></h2>
</body></html>
rawHtml:
curl -X POST https://api.firecrawl.dev/v1/scrape -H 'Content-Type: application/json' -H "Authorization: Bearer $FIRECRAWL_API_KEY" -d '{
"url": "http://httpstat.us/200",
"formats": [ "rawHtml" ]
}'
{"success":false,"error":"(Internal server error) - All scraping engines failed! -- Double check the URL to make sure it's not broken. If the issue persists, contact us at [email protected]."}
Expected Behavior
OK API response containing empty content
Screenshots
The text was updated successfully, but these errors were encountered:
This is likely because the returned content is not long enough. We should probably be returning the short content anyways if the content is confirmed by multiple scraping engines.
mogery
changed the title
[Bug] Scraping httpstat.us/200 reliably triggers 500 responses from the Firecrawl API
[Bug] Scraping httpstat.us/200 reliably triggers 500 responses from the Firecrawl API (FIR-519)
Jan 7, 2025
Describe the Bug
Looks like empty scraped page responses cause 500 errors.
To Reproduce
Steps to reproduce the issue:
https://www.firecrawl.dev/playground?url=http%3A%2F%2Fhttpstat.us%2F200&mode=scrape
markdown:
rawHtml:
Expected Behavior
OK API response containing empty content
Screenshots
The text was updated successfully, but these errors were encountered: