DeepSeek
No mentions found
This entity hasn't been tracked yet, or Iris is still building its knowledge base.
Related Articles from SNS
Hong Kong launches DeepSeek-based AI model designed to run on domestic chips
Hong Kong launches DeepSeek-based AI model designed to run on domestic chips Agent Workshop operated stably for up to 28 hours without interruption in a single session to produce a research report The Hong Kong Generative AI Research and Development Centre (HKGAI) has officially launched a new DeepSeek-based large language model that can run on domestic chips, as the government-backed lab seeks to commercialise its products and export Chinese AI overseas. The HKGAI-V3 model, built on...
Bringing Up DeepSeek-V4-Flash on AMD MI300X
Bringing up DeepSeek-V4-Flash on AMD MI300X At Doubleword we are building an inference cloud designed for volume. To do that we have to reckon with the enveloping compute shortage. AMD’s MI300X launched in December 2023At AMD’s “Advancing AI” event, 6 December 2023.
DeepSeek makes the V4 Pro price discount permanent
DeepSeek has announced a permanent price reduction for its V4 Pro model API. Following a 75% discount promotion that concludes on May 31, 2026, the model's pricing will be permanently set at one-quarter of its original cost.
A Comprehensive Anatomy of Human and DeepSeek-R1 LLM Mathematical Reasoning
new Abstract: The emergence of "Aha moments" in large language models, particularly DeepSeek-R1-0120, has raised the question of whether these systems genuinely reason or merely imitate the appearance of reasoning. We conduct a comprehensive empirical comparison between model and human reasoning across all 30 problems from AIME 2025, exhaustively annotating 10,247 reasoning steps into five functional categories: Analysis, Inference, Branch, Backtrace, and Reflection. We find a clear...
DeepSeek V4 Pro beats GPT-5.5 Pro on precision
DeepSeek V4 Pro takes this matchup 38.0 to 33.0, and the margin feels earned. Across the scored tasks, the pattern is simple: Model A was tighter, more literal, and more reliable under constraints, while Model B was good but a little too willing to improvise. The clearest technical win came in python log redactor .
Huawei chips refine DeepSeek model in major leap for China’s AI self-reliance
Huawei chips refine DeepSeek model in major leap for China’s AI self-reliance While Chinese chipmakers have found success in supporting AI inference, they are struggling with the far more complex process of training While Chinese chipmakers have found success in supporting AI inference – the relatively simple process of running an already-finished model to answer user prompts – they have struggled with training, the far more complex process of building or refining a model’s brain. If initial...
DeepSeek nears US$7b haul in first-ever funding round, with backing from Tencent, CATL
Chinese artificial intelligence start-up DeepSeek is finalising its first external fundraising round, securing over 50 billion yuan (US$7.4 billion) at a valuation of just under US$60 billion, according to people familiar with the matter – marking a six-fold leap from its US$10 billion valuation in April. The blockbuster round highlights intensifying global competition and a shifting strategy for the AI breakout star, which had previously resisted external capital. Market-oriented investors...
FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention
arXiv:2606.09079v1 Announce Type: new Abstract: Conventional LLMs keep the full KV cache loaded during decoding, causing a severe GPU memory bottleneck for ultra-long context serving. In this report, we propose Lookahead Sparse Attention (LSA), a novel inference paradigm powered by a Neural Memory Indexer built upon the DeepSeek-V4 architecture. Rather than passively attending to all historical tokens, LSA proactively predicts future context demands and preserves only the query-critical KV...
Institutional Trust and the Domestic AI Advantage: Evidence from DeepSeek and ChatGPT Users in China
Announce Type: new Abstract: Public trust in generative artificial intelligence exhibits increasingly divergent patterns across national contexts, yet prevailing research largely overlooks the macro-structural forces underlying this divergence. This study argues that trust in AI is not merely a technical response to performance but a product of institutional refraction. We propose an ``Institutional Prism'' framework to demonstrate how institutional trust shapes user trust in domestic...