Coinlive- We Make Blockchain Simpler
Download and install the Coinlive app
Open

OpenAI, which was far ahead, has slowed down

"If GPT-5 is released, OpenAI is still far ahead. If it is AI Search or voice assistant, it means OpenAI has fallen."

An AI big model practitioner told Huxiu that the industry's expectations for OpenAI are too high. Unless it is a disruptive innovation like GPT-5, it is difficult to satisfy the audience's "appetite".

Although Sam Altman had announced before the OpenAI online live broadcast that GPT-5 (or GPT-4.5) would not be released, the outside world's expectations for OpenAI have long been unstoppable.

In the early morning of May 14th, Beijing time, OpenAI announced the latest GPT-4o, where o stands for Omnimodel. The more than 20-minute live demonstration showed an AI interactive experience that far exceeds all current voice assistants, which basically coincides with the news previously revealed by foreign media.

Although the demonstration effect of GPT-4o can still be called "explosive",industry insiders generally believe that it is difficult to live up to the word "magic" in Altman's preview. Many people believe that these functional products are "deviating from OpenAI's mission."

OpenAI's PR team also seemed to have anticipated this trend of public opinion. Altman explained this at the event and in a blog post after the event:

"A key part of our mission is to make very powerful AI tools available to people for free (or at a discounted price). I'm very proud that we're making the world's best models available for free in ChatGPT, with no ads or anything like that.

When we founded OpenAI, our original idea was that we would create AI and use it to create all kinds of benefits for the world. Instead, it now looks like we're going to create AI and then other people will use it to create all kinds of amazing things that benefit us all."

"If we had to wait 5 seconds for 'each' response, the user experience would plummet. Even if the synthetic audio itself sounds real, it would destroy the immersion and make people feel lifeless."

" On the eve of the OpenAI conference, Jim Fan, head of Nvidia’s Embodied AI, predicted the voice assistant that OpenAI would release at X and proposed:

Almost all voice AI will go through three stages:

1. Speech recognition or “ASR”: audio -> text 1, such as Whisper;

2. LLM that plans what to say next: text1 -> text2;

3. Speech synthesis or “TTS”: text2 -> audio, such as ElevenLabs or VALL-E.

Going through 3 stages will result in huge delays.

GPT-4o has almost solved the latency problem in terms of response speed. The shortest response time of GPT-4o to audio input is 232 milliseconds, and the average response time is 320 milliseconds, which is almost similar to that of humans. The average delay of the ChatGPT voice dialogue function without GPT-4o is 2.8 seconds (GPT-3.5) and 5.4 seconds (GPT-4).

GPT-4o not only greatly improves the experience by shortening the delay, but also makes many upgrades based on GPT-4, including:

  • Excellent multimodal interaction capabilities, including voice, video, and screen sharing.

  • It can recognize and understand human expressions, text, and mathematical formulas in real time.

  • The interactive voice is rich in emotion, and the voice tone and style can be changed, and it can also imitate and even sing "improvised".

  • It has ultra-low latency, and can interrupt the AI ​​in real time during the conversation, add information or start a new topic.

  • All ChatGPT users can use it for free (with a usage cap).

  • It is twice as fast as GPT-4 Turbo, with 50% lower API costs and 5 times higher rate limits.

"Breaking through these limitations is innovation."

Some industry experts believe that GPT-4o's multimodal capabilities only "look" good, and in fact OpenAI has not demonstrated a truly "breakthrough" feature for visual multimodality.

Here we follow the habits of the large model industry and compare it with Claude 3 from Anthropic, the factory next door.

The technical documentation of Claude 3 mentions that "although Claude's image understanding capabilities are cutting-edge, some limitations need to be noted."

These include:

  • Person recognition: Claude cannot be used to identify (i.e. name) people in images and will refuse to do so.

  • Accuracy: Claude may hallucinate or make mistakes when interpreting low-quality, rotated, or very small images under 200 pixels.

  • Spatial Reasoning: Claude has limited spatial reasoning abilities. It may have difficulty with tasks that require precise positioning or layout, such as reading an analog clock face or describing the exact position of a chess piece.

  • Counting: Claude can give an approximate count of objects in an image, but may not always be precisely accurate, especially for large numbers of small objects.

  • AI-generated images: Claude does not know if an image is AI-generated and may not be correct if asked. Do not rely on it to detect fake or synthetic images.

  • Inappropriate content: Claude will not process inappropriate or explicit images that violate our Acceptable Use Policy.

  • Healthcare Applications: While Claude can analyze general medical images, it is not designed to interpret complex diagnostic scans such as CT or MRI. Claude's output should not be considered a substitute for professional medical advice or diagnosis.

Among the cases published on the GPT-4o website, there are some capabilities related to "spatial reasoning", but they are still difficult to be considered breakthroughs.

In addition, it is easy to see from the output of GPT-4o in the live demonstration at the press conference that its model capabilities are not much different from GPT-4.

GPT-4o running score

Although the model can add tone to the conversation and even sing impromptu, the content of the conversation is still lacking in details and creativity like GPT-4.

In addition, after the press conference, OpenAI's official website also released a series of application case explorations of GPT-4o. Including: photo conversion to comic style; meeting minutes; image synthesis; 3D content generation based on images; handwriting and draft generation; stylized posters, and comic strip generation; artistic font generation, etc.

Among these capabilities, photo conversion to comic style, meeting minutes, etc., are also some seemingly ordinary Wensheng pictures or AI large model functions.

"If I register 5 free ChatGPT accounts, do I not need to subscribe to ChatGPT Plus for $20 per month?"

OpenAI's announced GPT-4o usage policy is that ChatGPT Plus users have a traffic limit that is 5 times higher than that of ordinary users.

GPT-4o is free for everyone, and the first challenge seems to be OpenAI's own business model.

Data released by the third-party market analysis platform Sensor Tower show that in the past month, ChatGPT has been downloaded 7 million times in the global App Store and has a subscription revenue of 12 million US dollars; the global Google Play market has been downloaded 90 million times and has a subscription revenue of 3 million US dollars.

Currently, the subscription price of ChatGPT Plus in both app stores is $19.99. According to subscription data, ChatGPT Plus has 750,000 paid subscribers through the app store in the past month. Although ChatGPT Plus still has a large number of direct paying users, from the perspective of mobile phone revenue, the annual revenue is less than 200 million US dollars, and it is difficult to support OpenAI's valuation of nearly 100 billion even if it doubles several times.

From this point of view, OpenAI does not need to consider too much about individual user recharges.

What's more, GPT-4o focuses on good experience. If you are chatting with AI and the conversation is interrupted, and you have to change the account to chat again, will you recharge angrily?

"The original ChatGPT hinted at the possibility of language interfaces; this new thing feels fundamentally different. It's fast, smart, fun, natural, and helpful."

Sam Altman's latest blog mentions the "possibility of language interfaces," which is exactly what GPT-4o may do next: challenge all GUIs (graphical interactive interfaces) and those who want to work on LUIs (voice interactive interfaces).

Combined with the recent news of OpenAI's cooperation with Apple revealed by foreign media, it can be speculated that GPT-4o may soon "throw an olive branch" or "turn the table" to all AI PC and AI mobile phone manufacturers.

No matter what kind of voice assistant or AI big model, the core value for AIPC and AI mobile phones is to optimize the experience, and GPT-4o optimizes the experience to the extreme.

GPT-4o is likely to involve all known apps, even the SaaS industry. In the past year or so, all AI agents that have been developed and are being developed on the market will face threats.

A product manager of a resource aggregation app once told Huxiu, "My operating process is the core of the product. If the operating process is optimized by your ChatGPT, it means that my app has no value."

Imagine that if the UI of the takeaway ordering app becomes a sentence "Order for me", then it will be the same for users whether they open Meituan or Ele.me.

The next step for manufacturers can only be to compress the profit margins of the supply chain and ecology, or even a vicious price war.

From the current situation, it may take some time for other manufacturers to defeat OpenAI in terms of model capabilities.

If a product wants to benchmark OpenAI, it may only be through making a more "cheap" model.

"I've been so busy lately that I haven't paid attention to them."

A founder of a large industrial AI model told Huxiu that he has been busy communicating strategic cooperation, product releases, customer exchanges and capital exchanges recently, and has no time to pay attention to releases like OpenAI.

Before OpenAI was released, Huxiu also asked a number of domestic AI practitioners from all walks of life. Their predictions and opinions on OpenAI's latest release were very consistent: I'm looking forward to it, but it has nothing to do with me.

A practitioner said that judging from the current progress in China, it is not realistic to catch up with OpenAI in the short term. So if you care about what OpenAI has released, at most you can just look at the latest technical direction.

At present, domestic companies generally pay more attention to engineering and vertical models in the research and development of large AI models, which are more pragmatic and easy to monetize.

In terms of engineering, Deepseek, which has recently become popular, is setting off a token price war in the domestic large model industry. In terms of vertical models, many industry insiders told Huxiu that in the short term, the research and development of small models and vertical models will basically not be affected by OpenAI.

"Sometimes OpenAI's technical direction is not very worth learning from."A model expert told Huxiu that Sora is a good example. In February 2024, OpenAI released the video model Sora, which achieved a stable output of 60 seconds of video. Although it looks very effective, there is almost no subsequent practice and the landing speed is very slow.

Before Sora, many domestic companies and institutions working in the field of viz video had achieved 15-second stable video generation. After Sora came out, the R&D, financing, and product rhythm of some companies were disrupted, and even the development of the entire viz video industry evolved into a "technological leap forward."

Fortunately, this time GPT-4o is very different from Sora. OpenAI CTO Muri Murati said that in the next few weeks, we will continue our iterative deployment to provide you with all the functions.

Soon after the press conference, GPT-4o was already available for online trial.

More news about chatgpt 속도 느림

  • Jun 11, 2024 9:07 am
    Elon Musk: If Apple device operating systems integrate ChatGPT, they will be banned from entering the company
    Elon Musk posted on the X platform that if Apple integrates OpenAI's ChatGPT into iPhones, iPads and Mac computers, he will ban Apple devices from entering his company. "If Apple integrates OpenAI at the operating system level, then Apple devices will be banned from entering my company. This is an unacceptable security violation." Musk even suggested that visitors to Tesla, Space Exploration Technologies Corp and other companies he runs need to "store their Apple devices in a Faraday cage" when entering. (Cointelegraph)
  • May 30, 2024 4:45 pm
    XRP Records Slower Growth But Maintains Bullish Pace
    According to U.Today, XRP has experienced a slower growth rate in the past 24 hours compared to other altcoins such as Shiba Inu (SHIB). Currently, XRP is trading at $0.5299, a 0.6% decrease, while the overall cryptocurrency market cap has increased by 1.2% to $2.56 trillion. Despite this, XRP has achieved a 5.72% growth this month, according to data from Cryptorank. If it continues at this rate, XRP is expected to end the month on a bullish note, similar to its performance in May 2023 when it rallied by 9.84%. XRP's historical data reveals a mixed sentiment, with the digital currency recording mostly losses in the month of May since 2014. After experiencing declines of 4.4%, 34.4%, and 28.4% in May 2020, 2021, and 2022 respectively, XRP broke its bearish streak with a 9.84% rally in May 2023. This rebound has been sustained, with XRP's trading volume indicating a bullish sentiment from both spot and derivatives traders. However, there are concerns about XRP's prospects in the upcoming month. Cryptorank's historical data shows that June has been one of the most bearish months for the coin since 2018, raising questions about how XRP will overcome this negative trend. A key factor that could influence this is the potential lawsuit settlement between Ripple Labs and the U.S. SEC. Both parties are currently awaiting the court's decision. Additionally, developments in the XRP Ledger ecosystem could also contribute to a rebound that could be crucial for XRP in June.
  • May 29, 2024 8:25 pm
    Former SEC Official Criticizes Agency's Slow Pace in Regulating Rapidly Evolving Markets
    According to Odaily, Marc Fagel, a former official of the U.S. Securities and Exchange Commission (SEC), has criticized the agency for its slow pace in regulating rapidly evolving markets. He expressed hope that Congress would intervene and regulate the cryptocurrency industry. Fagel stated that the SEC has a habit of ignoring rapidly developing new spaces. In relation to the SEC's investigation of unregistered companies without violation records, Fagel explained that registration is a requirement that facilitates information disclosure. He asserted that waiting for companies to commit violations before taking action is a passive approach that keeps the SEC constantly catching up.
  • May 24, 2024 3:47 pm
    알고랜드, BTC·ETH·SOL 저격 광고..."느리고 비싸"
    코인텔레그래프에 따르면 레이어1 블록체인 알고랜드(ALGO)가 비트코인, 이더리움(ETH), 솔라나(SOL)를 저격한 광고를 공개했다. 알고랜드 재단이 23일 유튜브를 통해 게시한 영상 광고에는 한 남성이 등장한다. 이 남성은 마트에서 암호화폐로 결제를 시도하는데 △비트코인 결제시 27분 소요 △이더리움 결제시 수수료 112달러 △솔라나 결제시 체인 중단으로 결제 실패 등을 겪는다. 이후 영상에서는 "ALGO는 낮은 수수료, 속도감 있는 라이프 스타일을 제공한다"는 문구가 흘러나온다. 이 광고와 관련해 커뮤니티에서는 "알고랜드는 익스플로러도 제대로 운영하지 못하면서 광고비로 10만 달러 이상을 지출했다"는 비판이 나오고 있다고 미디어는 설명했다. 디파이라마 기준 알고랜드의 총 락업 예치금(TVL)은 9,600만 달러 수준으로 이더리움(650억 달러), 솔라나(48억 달러)에 크게 못 미친다.
  • May 06, 2024 7:47 pm
    Matrixport: The issuance of new USDT coins has slowed down
    Matrixport published a post on the X platform saying that although the recent market focus is on the flow of funds to Bitcoin ETFs, even during the consolidation of Bitcoin prices in the past two months, the inflow of stablecoins has continued to increase, indicating that the application of cryptocurrencies is still growing rapidly. Recently, the issuance of new USDT coins has slowed down. If the issuance speed is accelerated, it may have a positive impact on Bitcoin.
  • Apr 20, 2024 12:03 pm
    Slow Mist Cosine: Inscriptions are, to some extent, a test bed for runes.
    Yu Xian, the founder of SlowMist, posted on the X platform: "Inscriptions are to some extent a test field for runes. Various large files and a large amount of "meaningless" BRC-20 trace information of inscriptions appeared in the Taproot data of Bitcoin. After being numbered by the CVE vulnerability, it caused an uproar. But inscriptions bring violent aesthetics and are full of topics. Runes are based on the UTXO model, and the data is stored in OP_RETURN (the space is very limited). It is more simple and compact, but due to the limited space, what it can do is also very limited. I am afraid that it is mainly for the circulation of coins. Therefore, inscriptions are to some extent a test bed for runes. The "to some extent" here mainly refers to BRC-20. The runes I have seen, at least in their current state, are used to solve the "awkward" inscription form of BRC-20. It doesn't matter how the heat and controversy go. At least it is a good thing to promote the development of the Bitcoin network. If not, time can give a choice."
  • Apr 06, 2024 7:29 pm
    시바견의 미래는 암울해 보입니다: 느린 네트워크 활동의 원인
    시바 이누(SHIB) 가격은 다양한 시장 지표를 탐색하면서 미묘한 움직임을 보이고 있습니다. 한편으로 특정 지표는 활동의 안정화를 시사하며 투자자들 사이에서 신중한 옵티미즘을 암시합니다. 반면에 새로운 기술적 패턴과 변동하는 투자자들의 관심은 향후 변동성이 커질 가능성을 시사합니다. 느린 시바 네트워크 활동 지난 한 달간 시바의 평균 거래 규모를 분석해보면 급격한 성장 후 안정화되는 모습을 볼 수 있습니다. 초기에는 평균 거래 규모가 2주 만에 81.23% 감소하는 등 큰 폭으로 감소했습니다. 그러나 이러한 하락 이후 거래 규모가 안정화되기 시작했고, 이는 시바 가격이 보다 안정적인 단계로 나아가고 있음을 나타냅니다. 시바 이누 거래 규모. 출처:... source: https://kr.beincrypto.com/base-news/52866/
  • Dec 08, 2023 9:39 pm
    FCA too slow on crypto enforcement, says UK’s spending watchdog
    The UK's Financial Conduct Authority (FCA) has been criticized by the National Audit Office for acting too slow to enforce crypto laws. source: https://protos.com/fca-too-slow-on-crypto-enforcement-says-uks-spending-watchdog/
  • Nov 28, 2023 7:53 pm
    Slow Adoption and Retention for Ethereum's ERC-4337 Smart Wallets
    According to Blockworks, the introduction of Ethereum Foundation's ERC-4337 account abstraction standard earlier this year has seen slow adoption and retention rates. The standard was designed to transform Ethereum accounts into smart accounts, offering more flexibility for wallet holders. However, data from BundleBear reveals that weekly retention for ERC-4337 smart wallets drops as low as 1% for accounts older than five weeks, and an average smart account only sends five user operations. Revenue for transaction bundlers is also low, currently less than $8,000 per week. At the time of writing, daily active users are at around 3%, according to sixdegree data. Most smart account users are on the Polygon network, making up over 66% of all smart account holders and almost all monthly new users by chain. Despite the low adoption rates on Ethereum, industry participants remain optimistic about the future of smart accounts and are actively working on proposals to spur usage. John Rising, the co-founder of Stackup, an account abstraction infrastructure company, believes that many existing issues with ERC-4337 can be resolved with the first-ever Rollup Improvement Proposal, RIP-7560. The existing ERC-4337 standard is considered a semi-native+ smart account, where a user does not need a private key because the trustless relay network is designed to forward transactions to the blockchain. Rising aims to move towards a native smart account, where accounts would be able to specify their own validation logic, completely removing the need for a private key, something that RIP-7560 hopes to achieve. "ERC-4337 has always been intended as a stepping stone to native account abstraction. The account abstraction proposal, RIP-7560, is designed to be backwards compatible with ERC-4337," Rising told Blockworks. Further discussions around the proposal and its applicability must still be considered. Native account abstraction has drawn concerns over the complexity it adds, as it introduces consensus layer changes rather than just high-level infrastructure layer modifications.
  • Nov 28, 2023 1:17 am
    Fed's Preferred Inflation Measure to Recede Slower, Keeping Interest Rates Higher for Longer
    According to Yahoo News, the Federal Reserve's preferred underlying inflation measure is expected to recede at a slower pace, resulting in higher interest rates for a longer period, as per Bloomberg's latest survey of economists. Forecasters have increased their projections for the annual core personal consumption expenditures (PCE) index, which excludes volatile food and energy categories, through the end of next year. The index is predicted to be at 2.5% by the end of 2024, up from 2.4% in the previous month's poll. Meanwhile, the overall PCE metric and the alternative consumer price index are anticipated to recede faster than previously thought through mid-2024, mainly due to a pullback in energy prices. Although recent reports show signs of easing price pressures, Fed officials have emphasized the need for sustained signs of cooling before declaring victory on inflation. Policymakers consider the core gauge as a better indicator of underlying price pressures. Economists still expect the Fed to begin loosening monetary policy in the second quarter of next year, but they now predict the central bank will maintain higher interest rates through the end of 2025. Kathy Bostjancic, chief economist at Nationwide Life Insurance Co., stated that the recent slowdown in inflation, employment growth, and consumer spending supports the belief that the Fed is done raising rates for this cycle. However, she added that the Fed will wait to cut rates until mid-2024, and the easing of policy will be gradual. Forecasters anticipate the economy to expand at an annualized 1.2% pace in the current quarter, up from 0.7% in the previous survey. Although stronger consumer and government spending are expected to aid the economy in the short term, economists now project a significant slowdown in private investment to dampen growth through early 2025. The job market remains broadly strong, but demand for workers is slowly starting to soften. Economists still expect the unemployment rate to peak at 4.4% but now see it taking longer to come down. They also predict the US will add fewer payrolls on average through 2025.

More news about chatgpt 속도 느림

0 Comments
Earliest
Load more comments