OpenAI launches web crawler 'GPTBot' amid plans for next model: GPT-5

Artificial intelligence firm OpenAI has launched “GPTBot” — its new web crawling tool which it says could potentially be used to improve future ChatGPT models.

“Web pages crawled with the GPTBot user agent may potentially be used to improve future models,” OpenAI said in a new blog post, adding it could improve accuracy and expand the capabilities of future iterations.

A web crawler, sometimes called a web spider, is a type of bot that indexes the content of websites across the internet. Search engines like Google and Bing use them in order for the websites to show up in search results.

OpenAI said the web crawler will collect publicly available data from the world wide web, but will filter out sources that require paywalled content, or is known to gather personally identifiable information, or has text that violates its policies.

Breaking

OpenAI just launched GPTBot, a web crawler designed to automatically scrape data from the entire internet.

This data will be used to train future AI models like GPT-4 and GPT-5!

GPTBot ensures that sources violating privacy and those behind paywalls are excluded. pic.twitter.com/oR3kY4buaU

— Shubham Saboo (@Saboo_Shubham_) August 7, 2023

It should be noted that website owners can deny the web crawler by adding a “disallow” command to a standard file on the server.

Instructions to “disallow” GPTBot for ChatGPT users. Source: OpenAI

The new crawler comes three weeks after the firm filed a trademark application for “GPT-5,” the anticipated successor to the current GPT-4 model.

The application was filed at the United States Patent and Trademark Office on July 18, and covers the use of the term “GPT-5,” which includes software for AI-based human speech and text, converting audio into text and voice and speech recognition.

OpenAI has filed a trademark application for:

“GPT-5”

which includes “software for”:

“the artificial production of human speech and text”

“conversion of audio data files into text”

“voice and speech recognition”

“machine-learning based language and speech processing”

pic.twitter.com/54aJBovDNB

— YK aka CS Dojo (@ykdojo) August 1, 2023

However, observers may not want to hold their breath for the next iteration of ChatGPT just yet. In June, OpenAI’s founder and CEO Sam Altman said the firm is “nowhere close” to beginning training GPT-5, explaining that several safety audits need to be conducted prior to starting.

Related: 11 ChatGPT prompts for maximum productivity

Meanwhile, Concerns have been raised over OpenAI’s data-collecting tactics of late, particularly revolving around copyright and consent.

Japan’s privacy watchdog issued a warning to OpenAI about collecting sensitive data without permission in June, while Italy temporarily banned the use of ChatGPT after alleging it breached various European Union privacy laws in April.

In late June, a class action was filed against OpenAI by 16 plaintiffs alleging the AI firm to have accessed private information from ChatGPT user interactions.

If these allegations are proven to be accurate, OpenAI — and Microsoft, who was named as a defendant — will be in breach of the Computer Fraud and Abuse Act, a law with a precedent for web-scraping cases.

Magazine: AI Eye: AI travel booking hilariously bad, 3 weird uses for ChatGPT, crypto plugins

Source link

Name	Price	24H %
Bitcoin(BTC)	$0.00	-0.92%
Ethereum(ETH)	$0.00	-1.80%
Tether(USDT)	$0.00	0.020%
BNB(BNB)	$0.00	-2.16%
USD Coin(USDC)	$0.00	0.110%
Solana(SOL)	$0.00	-1.25%
XRP(XRP)	$0.00	-2.62%
Cardano(ADA)	$0.00	-1.97%
Terra(LUNA)	$0.00	-7.21%
Avalanche(AVAX)	$0.00	-4.23%

OpenAI launches web crawler ‘GPTBot’ amid plans for next model: GPT-5

GameStop Discloses Weaker Than Expected Q3 Financial Figures

Real Finance Raises $29M to Expand Institutional RWA Platform

Banking giant PNC teams with Coinbase to enable direct Bitcoin trading for wealthy clients

CoreWeave Plans $2B Convertible Note Offering for Expansion

WisdomTree Launches Tokenized Options-Income Fund EPXC Onchain

Bitmine Buys $199M ETH as Smart Money Traders Short ETH

Leave a Reply Cancel reply

You may have missed

ETHZilla Buys 15% of Zippy to Tokenize Manufactured Home Loans

Bitcoin Exchange Paxful Agrees to Plead Guilty, Hit With $7.5 Million in Penalties

Polygon-Based Soccerverse Secures FIFPRO Deal, Unlocks 65,000 Real Players for Blockchain Football

Long-dormant ETH wallet and major BTC holders move funds before Fed meeting

Sitemap

Legal Information

Pin It on Pinterest

More Stories

Leave a Reply Cancel reply

You may have missed

Sitemap

Legal Information

Categories

Pin It on Pinterest