Home Knowledge Base DeepSeek

DeepSeek

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Hong Kong launches DeepSeek-based AI model designed to run on domestic chips

Hong Kong launches DeepSeek-based AI model designed to run on domestic chips Agent Workshop operated stably for up to 28 hours without interruption in a single session to produce a research report The Hong Kong Generative AI Research and Development Centre (HKGAI) has officially launched a new DeepSeek-based large language model that can run on domestic chips, as the government-backed lab seeks to commercialise its products and export Chinese AI overseas. The HKGAI-V3 model, built on...

South China Morning Post 7d ago

Bringing Up DeepSeek-V4-Flash on AMD MI300X

Bringing up DeepSeek-V4-Flash on AMD MI300X At Doubleword we are building an inference cloud designed for volume. To do that we have to reckon with the enveloping compute shortage. AMD’s MI300X launched in December 2023At AMD’s “Advancing AI” event, 6 December 2023.

Hacker News 8d ago

DeepSeek makes the V4 Pro price discount permanent

DeepSeek has announced a permanent price reduction for its V4 Pro model API. Following a 75% discount promotion that concludes on May 31, 2026, the model's pricing will be permanently set at one-quarter of its original cost.

Hacker News 19d ago

A Comprehensive Anatomy of Human and DeepSeek-R1 LLM Mathematical Reasoning

new Abstract: The emergence of "Aha moments" in large language models, particularly DeepSeek-R1-0120, has raised the question of whether these systems genuinely reason or merely imitate the appearance of reasoning. We conduct a comprehensive empirical comparison between model and human reasoning across all 30 problems from AIME 2025, exhaustively annotating 10,247 reasoning steps into five functional categories: Analysis, Inference, Branch, Backtrace, and Reflection. We find a clear...

arXiv CS 2d ago

DeepSeek V4 Pro beats GPT-5.5 Pro on precision

DeepSeek V4 Pro takes this matchup 38.0 to 33.0, and the margin feels earned. Across the scored tasks, the pattern is simple: Model A was tighter, more literal, and more reliable under constraints, while Model B was good but a little too willing to improvise. The clearest technical win came in python log redactor .

Hacker News 2d ago

Huawei chips refine DeepSeek model in major leap for China’s AI self-reliance

Huawei chips refine DeepSeek model in major leap for China’s AI self-reliance While Chinese chipmakers have found success in supporting AI inference, they are struggling with the far more complex process of training While Chinese chipmakers have found success in supporting AI inference – the relatively simple process of running an already-finished model to answer user prompts – they have struggled with training, the far more complex process of building or refining a model’s brain. If initial...

South China Morning Post 5d ago

DeepSeek nears US$7b haul in first-ever funding round, with backing from Tencent, CATL

Chinese artificial intelligence start-up DeepSeek is finalising its first external fundraising round, securing over 50 billion yuan (US$7.4 billion) at a valuation of just under US$60 billion, according to people familiar with the matter – marking a six-fold leap from its US$10 billion valuation in April. The blockbuster round highlights intensifying global competition and a shifting strategy for the AI breakout star, which had previously resisted external capital. Market-oriented investors...

South China Morning Post 7d ago

FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention

arXiv:2606.09079v1 Announce Type: new Abstract: Conventional LLMs keep the full KV cache loaded during decoding, causing a severe GPU memory bottleneck for ultra-long context serving. In this report, we propose Lookahead Sparse Attention (LSA), a novel inference paradigm powered by a Neural Memory Indexer built upon the DeepSeek-V4 architecture. Rather than passively attending to all historical tokens, LSA proactively predicts future context demands and preserves only the query-critical KV...

arXiv CS 1d ago

Institutional Trust and the Domestic AI Advantage: Evidence from DeepSeek and ChatGPT Users in China

Announce Type: new Abstract: Public trust in generative artificial intelligence exhibits increasingly divergent patterns across national contexts, yet prevailing research largely overlooks the macro-structural forces underlying this divergence. This study argues that trust in AI is not merely a technical response to performance but a product of institutional refraction. We propose an ``Institutional Prism'' framework to demonstrate how institutional trust shapes user trust in domestic...

arXiv CS 8d ago