21 days of returns to build a portfolio? Complexity strikes again
Following the series of papers about the virtue of complexity, another recent contribution (Hellum et al, 2026) documents that the number of signals matters more than the number of data points available for portfolio optimization.
When the number of available data points T is large compared to the number of assets N, Markowitz portfolio theory (MPT) works fine. However, as the complexity ratio c (N/T) approaches 1, MPT breaks down. But what happens beyond that critical point? What happens if we allow the number of assets to rise to 2, 3 or more times the available returns?
A "double ascent" takes place: the portfolio out-of-sample Sharpe ratio starts to recover and beats by far the well behaved case when T >> N. This happens through an implicit regularization mechanism: high complexity actually mitigates noise thanks to the abundance of assets and results in a better-diversified portfolio.
The research goes as far as showing that if you have a massive cross-section of assets, you can build a well-performing portfolio with as little as 21 days of returns.
After reading the paper, I decided to replicate the section that deals with daily data and builds portfolios of risk factors with as little as 21 data points.
The animation below illustrates how the Sharpe ratio of the Markowitz and Min Variance portfolio collapse for a complexity ratio (N/T) close to one, but recovers and raises afterwards.
I added two elements:
- confidence intervals of the Sharpe ratio based on random permutations of the portfolio components. With M=100 permutations, I used relatively narrow confidence bands (66%). Yet, these bands help finding regions where the virtue of complexity makes the Markowitz and the Min Variance portfolio statistically not different from each other in terms of Sharpe. This happens for rather larger complexity ratios.
- the mean-variance frontier of the min. variance portfolios computed at different complexity ratios. Very interesting to see that for complexity beyond 1, there is a new frontier far more to the left than the initial one when complexity is lower than 1.
Original article: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6443319
Stop Reading, Start Seeing: this may solve a long-standing problm in AI
The great capabilities of LLMs come with enormous energy and hardware costs, which are partly due to one inherent weakness: when processessing text, the computational cost and memory requirements grow quadratically in text length. This quadratic scaling of costs is very limiting especially when LLMs are employed for long, complex and potentially multi-step tasks.
To tackle this issue, researchers at DeepSeek-AI have proposed a radical new approach: instead of forcing an AI to "read" every single word in a long document, what if we could turn the entire text into a picture and have the AI "see" it instead?
The core idea behind "contexts optical compression" is simple yet powerful. The DeepSeek-OCR model takes long pages of text, renders them as a single image, and then uses a highly efficient vision encoder to represent that image with a small number of "vision tokens." A language model can then decode these vision tokens to reconstruct the original tex.
This compression is extremely powerful. At a compression ratio of nearly 10-to-1 (for example, representing 600-700 text tokens with just 64 vision tokens), the model can reconstruct the original text with 96.5% precision. Higher compression rates (e.g. 20-to-1) lead to lower accuracy (e.g. 60%).
One of the biggest technical hurdles for vision models is processing high-resolution images. Doing so consumes massive amounts of computer memory. But worry not: DeepSeek-AI thought about this, too: they developed a novel architecture called the "DeepEncoder."
The DeepEncoder cleverly combines two different components. The first part, using window attention, focuses on local details within the image, much like reading words in a small area. The second part, which uses global attention, is designed to see the "big picture" and understand the overall context. The true innovation, however, lies in what connects them: a 16x token compressor. This module sits between the two components and dramatically reduces the amount of data that the most memory-intensive part has to handle. This smart design allows the model to process high-resolution documents with incredible detail without overloading its memory.
Another profound implication discussed in the paper is its potential to create a more natural and efficient memory system for AI. The researchers propose that optical compression could be used to simulate a form of memory decay, much like how human memory works.
The proposed mechanism is elegant. Recent conversations or newly added information could be stored as high-res images, making them crystal clear and easily accessible. As that information gets older, the images could be progressively downsized or blurred. In other words, the older the memory, the lower the resolution, similarly to what happens to our older memories. In this way, contexts optical compression method enables a form of memory decay that mirrors biological forgetting curves, where recent information maintains high fidelity while distant memories naturally fade through increased compression ratios.
Original article: https://arxiv.org/abs/2510.18234