2018-02-27
by Jerome Choo- URL Report downloads are now sorted in newest-first order
- Crawlbot now indexes the seed URL of each extracted object in the
fromSeedUrlfield.
fromSeedUrl field.Crawlbot and Bulk Service data retrieval no longer requires access to port :18100. Data downloads are also now HTTPS-only.
url value would retain HTML escaping if present within the original page source.<video> elements could be returned in the Article API.Fixed an issue in the Global Index in which complicated Boolean (OR) queries would return no results.
brand detection in the Product API.humanLanguage could be mis-identified on some Spanish-language pages.