Simone Bellavia's Web Page

Defeating Nondeterminism in LLM Inference

Thu, 11 Sep 2025 00:00:00 +0000

Link: https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/

iPhone Air

Wed, 10 Sep 2025 00:00:00 +0000

Apple presented the iPhone Air, the thinnest iPhone ever. This is the only new release from Apple that got my interest during their presentation event.

Its design is interesting: the entire logic board and A19 Pro chip are compacted into the camera bump (which includes both front and rear cameras). This iPhone is all battery and screen. IMHO, it seems like a strategic move for the coming years, for which this iPhone Air will serve as an experiment or a launchpad for ultra-thin devices, or simply as a research and development testbed for similar designs that enable powerful yet ultra-compact technologies.

Remarkable factor, iPhone Air has A19 Pro, which is Apple’s latest SoC. More in detail: it is built on TSMC’s N3P process node, and benefits from a 20% increase in transistor density compared to its predecessor, the N3E node, according to a 2023 IEEE study on semiconductor scaling. The A19 Pro features a six-core CPU with two high-performance cores and four efficiency cores, and 5-core GPU. Each GPU core has its own Neural Accelerators, which Apple claimed allows for MacBook Pro-level performance in an iPhone. On the new iPhone Pro, they are even more powerful. If the M5 chip will get this GPU upgrade… well, NVIDIA should start to feel some pressure.

To summarize: local AI to the Max. Next year, I want local LLMs on my phone.

npm debug and chalk packages compromised

Tue, 09 Sep 2025 22:10:00 +0200

Yesterday, a lot of npm packages have been compromised with malicious code. Following, a list of affected packages:

ansi-styles@6.2.2
debug@4.4.2 (appears to have been yanked as of 8 Sep 18:09 CEST)
chalk@5.6.1
supports-color@10.2.1
strip-ansi@7.1.1
ansi-regex@6.2.1
wrap-ansi@9.0.1
color-convert@3.1.1
color-name@2.0.1
is-arrayish@0.3.3
slice-ansi@7.1.1
color@5.0.1
color-string@2.1.1
simple-swizzle@0.2.3
supports-hyperlinks@4.1.1
has-ansi@6.0.1
chalk-template@1.1.1
backslash@0.2.1

and more, I think. I suggest to read the original post published on aikido.dev[1] and related HN discussion[2], both links are reported below.

All packages appear to contain a piece of code that would be executed on the client of a website, which silently intercepts crypto and web3 activity in the browser, manipulates wallet interactions, and rewrites payment destinations so that funds and approvals are redirected to attacker-controlled accounts without any obvious signs to the user (as shared from Aikido).

You can run grep or rg to check if your codebase has been impacted – thanks to sindresorhus for this suggestion:

rg -u --max-columns=80 _0x112fa8

This one requires ripgrep, but you can do the same with grep (ripgrep its Rust equivalent redesign).

My thoughts about this: dependency hell is real and these are the results. I agree with Mitchell Hashimoto when he says that npm should adopt some strategies to mitigate these risks, such as rejecting all dependencies tha have less than 1k LoC. I mean, let’s just avoid using external packages to determine if an object can act like an array.

Also, I would like to share one insight reported by DDerTyp on HN:

One of the most insidious parts of this malware’s payload, which isn’t getting enough attention, is how it chooses the replacement wallet address. It doesn’t just pick one at random from its list. It actually calculates the Levenshtein distance between the legitimate address and every address in its own list. It then selects the attacker’s address that is visually most similar to the original one. This is a brilliant piece of social engineering baked right into the code. It’s designed to specifically defeat the common security habit of only checking the first and last few characters of an address before confirming a transaction.

Needs a little bit of more investigation, for which I don’t have enough time, but looks interesting.

[1] Original post

[2] Hacker News discussion

New Weblog

Tue, 09 Sep 2025 00:00:00 +0000

I decided to change the template and layout of this site. I often find myself writing short notes very quickly during the day, which don’t always fit the traditional blog post format, since they are very brief. Also, I don’t always have the time to write long posts. For this reason, I modified the template to allow me to publish and share short Notes (like the one you are reading right now). Of course, I will continue to write and publish blog Posts when I have time.

On top of that, kinoroll.com no longer exists: the domain expired and I don’t intend to renew it. I preferred to move it here, onto my personal site! This way I can centralize everything and not have to manage different sites and domains. So the Kinoroll section will show the old kinoroll.com, but from now on I’ll share the links I like, along with some thoughts when possible, in the Link section, creating my own personal blogroll.

I hope my blog is easier to navigate now. I was strongly inspired by Simon Willison.

Using Claude Code to modernize a 25-year-old kernel driver

Tue, 09 Sep 2025 00:00:00 +0000

Link: https://dmitrybrant.com/2025/09/07/using-claude-code-to-modernize-a-25-year-old-kernel-driver

This inspires me to do something similar with an old printer that I have, a Canon MG3500, which does not work anymore because it’s not supported by last OS versions.

Fil-C, a memory safe implementation of C and C++

Sat, 06 Sep 2025 00:00:00 +0000

Yesterday, Fil-C popped up to the top of Hacker News. This time the submission got a fair amount of traction, sparking a lot of interest in the community, including a comment from Andrew Kelley. In fact, I’ve been interested in Fil-C for about a year already: my first submission on Hacker News was eight months ago. So I can say I’ve been actively following the project’s progress, also thanks to the activity of its creator, @filpizlo, on Twitter.

Fil-C is a compiler that implements the C and C++ languages with a memory-safe approach. Recently, Filip has published more documentation about the Garbage Collector and about the capabilities he calls “InvisiCaps”, which are more related to pointer safety.

Well, for me this is kind of a dream. I love the C language, it’s my favorite, but I admit I have some skill issues when it comes to memory management, though not because of the language itself, but rather due to my own code-writing proficiency, which could definitely be better. Recently, I’ve been exploring Rust and Zig precisely for this reason, and I’ve found myself appreciating Zig more than Rust because of its minimalism. Having a memory-safe implementation of C would therefore resolve a lot of the headaches caused by memory management.

Fil-C seems like the sweet spot between academic research and pragmatic work. Beyond the documentation, there’s also a list of programs already ported to Fil-C, showing that sometimes no code changes are required, and when they are, the effort is moderate.

So, the next step for me is to dig deeper into the topic and try it out myself! In the meantime, I thought it would be fair to personally share what Filip is doing, because the project deserves much more attention than it’s currently getting, imo.

Morphing the Divina Commedia into byte tokens with Zig

Mon, 18 Aug 2025 00:00:00 +0000

Nothing fancy. I just dumped the Divina Commedia into a contiguous u16 slice.

const path = "commedia.txt";
const buf = try tok.tokenizeFile(allocator, path);
defer allocator.free(buf.data);

Running it:

$ zig run src/main.zig -- commedia.txt
tokens: 300682 (expected 300682)
head: { 10, 32, 32, 78, 101, 108, 32, 109, 101, 122 }

Just 300682 u16s waiting for an embedding matrix :)

The more you fuck around, the more you find out

Wed, 13 Aug 2025 00:00:00 +0000

There is a very interesting law that I think is worth sharing:

I will apply it more often.

I am Sicilian and I support Strait of Messina Bridge

Mon, 11 Aug 2025 00:00:00 +0000

I’m Sicilian, and I support building the Strait of Messina Bridge.

Premise: I don’t vote for Matteo Salvini, the current Minister of Infrastructure and Transport and promoter of the project. Even though, to be fair, many before him have tried to start construction on the Bridge, since it’s been discussed ever since the post-World War II era. In modern times, both Silvio Berlusconi and Matteo Renzi (two politicians from opposing sides) have pushed the idea. So my support doesn’t come from political or ideological alignment, but from a pragmatic standpoint.

I’m in favor of building the Bridge because I’m in favor of progress. We’re talking about a world-class engineering project: a suspension bridge with a single span of 3,300 meters (over two miles), the longest in the world. Naturally, it will bring together some of the brightest minds to contribute to its construction. We’re daring to attempt something unprecedented and structurally thrilling, and we’re doing it between two Italian regions, Sicily and Calabria, that have historically been intellectual and cultural cradles not only for Italy, but for the entire world. That excites me.

It’s also an investment currently estimated at 13 billion euros. With projects of this scale, I doubt the final figure will really stay there, because you have to factor in circumstances and unforeseen events. So, to that number, you can probably add another 17 or 18 billion euros. On paper, it has all the potential to generate new jobs and opportunities, both for companies in the sector and beyond.

A bridge means greater connectivity and logistical efficiency. Right now, traveling from Sicily to Calabria happens via ferry, something that has become an icon for us Sicilians. Yes, this system could be improved and strengthened. But it’s also true that it still represents an obstacle to the continuous flow of rail traffic between regions. That problem could be solved with the Bridge, which, according to the plan, will have two highway lanes and, in the middle, two railway tracks. This would allow high-speed trains to reach Sicily.

These are some of the advantages that lead me to support the construction of the bridge, regardless of my political ideology, which has no influence on my opinion here. But what are the counterarguments from the public? Below the ones I read most often, with some of my thoughts.

Seismic risk: according to the plan, the bridge will have to withstand earthquakes up to a certain level on the Richter scale. I expect the design to largely incorporate structural and tolerance systems for such events.

Environmental impact: this factor would definitely need to be monitored, even though it has been overlooked in the construction of other major public works, not only in Italy but around the world. Speaking from ignorance, I imagine the terrain will need to be modified for the creation of the towers’ supporting foundations. I expect this to be done in compliance with existing environmental risk regulations.

High costs: this is undeniably a very expensive project. What’s often not taken into account, however, are the long-term returns and the economic flow generated by indirect effects: creation of new businesses, jobs, and so on.

I’m genuinely enthusiastic about the project. Above all, the concept of “connection” fascinates me and sparks my imagination.

Helm: what I like and dislike

Sun, 22 Jun 2025 00:00:00 +0000

I have been working with Helm for some time now and I’ve developed a love-hate relationship with it. It seems to have become the de-facto package manager for K8s, and there are good reasons for that. But like any tool, it comes with its own set of frustrations that can make you question your life choices. Some honest thoughts follow.

What I like

I find Helm to be fairly simple to get started. For all its complexity under the hood, Helm has a gentle learning curve in my opinion. You can start deploying applications with basic helm install commands, then gradually learn about values, dependencies, and templating (unlucky) as they need to.

Dependency management is good. Helm dependencies solve this quite elegantly, handling resolution, download and installation.

Multi-environment support lets you deploy the same chart with different configurations. It’s a good feature when you are forced to deal with multiple environments. Also, in that regard rollbacks sometimes save you. helm rollback app 3 and that’s it. You’re back at revision 3. It just works.

What I don’t like

At the same time, I think Helm has fundamental design flaws that make it increasingly unsuitable for managing complex applications in modern, stratified infrastructures. First one: Go templating system. It is Helm’s biggest strength and its greatest weakness. It offers immense flexibility which is necessary for complex applications, but it results in code that is hard to reason about. The syntax is unintuitive and verbose, error messages are cryptic and unhelpful. Which value is nil? What’s the context? Why can’t I just get a proper stacktrace? It’s not rare to end up debugging templates and ending up commenting out sections and re-rendering repeatedly. This happened because I guess it was a natural choice to inherit Go templates, since they are “native”. In any case, the fundamental issue isn’t just that Go templates are bad, it’s that templating YAML is inherently problematic. It often leads to indentation bugs that break file-parsing and difficulty validating templates before rendering.

I don’t know if I am the only one, but I miss some kind of Drift detection logic. Someone manually edits a deployment and Helm has no idea. The next helm upgrade might work, might partially fail, or might silently ignore the drift. The fact is that in complex installations with numerous microservices and dependencies, manual interventions are often necessary because Helm lacks native installation ordering. When one service fails to start while waiting for dependencies, the pragmatic solution is manual editing—but this silently breaks Helm’s understanding of your system state.

Helm’s approach to secrets is “just base64 encode it and hope for the best.” Tools like Helm Secrets exist, but they feel like band-aids on a fundamental design issue.

Helm is purely client-side and imperative, even if we consider it as partially-declarative. This goes back to point 3. Helm fires commands and just trusts that what it thinks is deployed actually matches reality. Also, everything requires manual intervention. In fact, many teams end up with (imho) awkward combinations: ArgoCD + Helm, Flux + Helm, or just Helm + CI/CD. The fact that we’re retrofitting declarative behavior onto an imperative tool shows just how much the ecosystem has evolved past Helm’s original assumption.

Helm doesn’t manage an install order or readiness. Helm simply ensures the sub-charts are included in the final rendered manifests. There is absolutely no guarantee of install order or readiness. As I said in point 3 I think, you usually solve this with hacks like init containers, complex readiness probes, or by running multiple helm install commands in a specific order. I don’t like it.

The Alternative

I don’t have an alternative as of now. Also, I don’t think there could be a better, community-supported alternative to Helm. Not because I think it’s not feasible, quite the contrary! Helm is widely used at the enterprise level and is fully supported by the CNCF, so I just believe that an alternative must be truly worthwhile to justify a change. In any case, I believe that the next step beyond Helm is a native Kubernetes system that uses CRDs, is declarative, and imposes a package structure standard similar to Linux. I hope to be able to create a proof of concept in the future :-)

Recurrent Neural Networks (RNNs) explained

Fri, 16 Feb 2024 00:00:00 +0000

Recurrent Neural Networks (RNNs) are a class of neural networks designed to process sequential data, such as time series, text, audio, or any other type of sequential data. RNNs were developed to overcome the limitations of feedforward networks that don’t maintain a memory of past information.

From Feedforward to RNNs

Let’s take a general look at Feedforward Networks, to then better understand RNNs.

In feedforward networks, input is processed in a single pass, from input to output, without retaining any memory of previous inputs. Each input is treated independently from the others, making feedforward networks suboptimal for tasks requiring an understanding of data sequences or temporal contexts. This behavior is a direct consequence of the structure of a feedforward network: relatively simple and linear, with layers of neurons connecting directly one after the other in one direction, without cycles. Each layer receives input only from the previous layer and sends output only to the next layer. Feedforward networks are employed in specific areas, such as classification and regression tasks where the order of inputs isn’t relevant (image classification or predicting time-independent values).

RNNs, unlike feedforward networks with unidirectional information flow and independent layer weights, feature recurrent connections enabling each hidden layer to be shaped by both the current input and the previous hidden state’s output. This creates an “internal memory”, useful for processing data sequences by allowing the network to consider past inputs, thus handling temporal dependencies. Ideal for tasks like language modeling, speech recognition, and sequence generation, RNNs use this memory to manage the sequence context, a key difference from the simpler feedforward approach. We will see their graphical visualization later, but first, a mathematical digression is useful to better understand how an RNN works.

RNNs Key Equations

In RNNs, there are multiple key equations and important mathematical concepts to understand. However, we will focus on the two most comprehensive equations for an RNN, which specify all the necessary calculations for computation at each time step on the forward pass in a simple recurrent neural network. $$h^{(t)}=\sigma(W^{hx}x^{(t)}+W^{hh}h^{t-1}+b_h)$$ This equation represents how the hidden state $h^{(t)}$ is updated at time $t$. Each hidden state is calculated based on three components:

The product of the weight matrix between the input and the hidden layer $W^{hx}$ and the input at time $t$, $x^{(t)}$.
The product of the recurrent weight matrix between the hidden layer and itself at adjacent time steps $W^{hh}$ and the hidden state at the previous time $t-1$, $h^{(t-1)}$.
The bias vector $b_h$.

The same equation can be interpreted in a slightly different and alternative way, as follows:

$$h^{(t)}=f(h^{(t-1)}, x^{(t)}; \theta)$$

These three terms are summed together and then passed through an activation function $\sigma$, such as the sigmoid function or the hyperbolic tangent, which updates the RNN’s hidden state $h^{(t)}$.

The purpose of the activation function is to introduce non-linearity into the model, allowing the model itself to represent complex and non-linear relationships between input and output variables.

The matrices mentioned in the formula are weight matrices that connect the previous hidden state to the current hidden state, the input to the hidden state, and the hidden state to the output. $$\hat{y}=softmax(W^{yh}h^{(t)}+b_y)$$ This equation is a typical representation of the output phase in an RNN, where the output $\hat{y}^{(t)}$ at time $t$ is calculated using the softmax function. In detail:

$\hat{y}^{(t)}$ is the output predicted by the network at time $t$,
$W^{yh}$ is the weight matrix that connects the hidden state $h^{(t)}$ to the output,
$h^{(t)}$ is the hidden state at time $t$,
$b_y$ is the bias vector associated with the output,
the softmax function is an activation function used in multi-class classifications to transform scores (logits) into probabilities.

In an RNN, the softmax output $\hat{y}^{(t)}$ is often used to determine the probability of each possible next element in a sequence, such as the next word in a text. It can be used to evaluate performance during training or to generate new sequences during the inference process.

Design Patterns and Principles

Based on the definition we have given of RNNs, and in light of their computational structure, we can identify the following principles or design patterns of RNNs:

Cyclic Structure: Each unit of the RNN receives two inputs: the current element of the data sequence and the “hidden state” from the previous unit, which acts as a form of memory that carries information from one element to another in the sequence.
Hidden State: The hidden state is the heart of RNNs, allowing the network to accumulate knowledge throughout the sequence. Recurrent hidden layers leverage a cyclic memory mechanism that makes them “stateful,” allowing the network to accumulate knowledge over the sequence. At each time step, the hidden state is updated based on both the current input and the previous hidden state. This feature ensures that, alongside the evolution of the hidden layers, a hidden state is also developed, which is crucial for the RNN’s ability to process sequential information.
Shared Parameters: Unlike feedforward networks, where each layer has its own set of parameters, in an RNN, the same set of parameters (weights) is used at every time step. This concept is known as “shared parameters” and allows the RNN to process sequences of variable length with a fixed model.
Sequential Structure: RNNs are designed to work with sequential data, and their architecture reflects this characteristic. Inputs are processed one after the other, and the output from one step can influence the processing of the next.
Output: At each time step, the RNN can produce an output based on the current hidden state. The output can be generated at each step (for example, in language modeling) or only at the end of the sequence (for example, in sequence classification).

With an understanding of the fundamental building blocks of RNNs, we now delve into their computational representation.

Computational Graph and Backpropagation

It is important to delve deeper into the concept of unfolding. In the context of RNNs, the term unfolding refers to the process of transforming the recurrent network, which intrinsically has a cyclic structure due to its hidden state passing from one time step to the next, into an extended (or unfolded) version that displays the entire sequence of operations across time steps. This unfolding transforms the cyclic structure into a chain of replicas of the network, one for each time step in the sequence, making evident how information flows through the time steps.

Following the concept of unfolding, it’s essential to examine Backpropagation Through Time (BPTT): it is an algorithm that emerges as a variant of the standard feedforward Backpropagation algorithm, specifically designed for the training of RNNs. This variation of the standard algorithm is the necessary consequence of the recurrent nature of RNNs, which requires a different approach for the calculation of gradients and for the updating of weights during the training process.

The BPTT algorithm works by following these steps:

Unfolding the network: as we have previously seen, the RNN is unfolded over time to transform it into a feedforward network. This allows treating the temporal dependency between sequential inputs as connections within a larger network.
Forward propagation: the input is fed through the unfolded network, and the output is calculated for each time step.
Error calculation: the error (the difference between the expected value and the actual one) for each output in the sequence is calculated.
Backward error propagation (Backpropagation): the error is propagated backward to calculate the gradients of the weights relative to the error. This is the most critical step for training, as it determines how to modify the weights to reduce the error.
Weight update: finally, the weights are updated based on the calculated gradients. Stochastic Gradient Descent is typically used as the optimization algorithm.

Stochastic Gradient Descent (SGD) is a variant of the Gradient Descent algorithm, which is a technique for minimizing the cost (or loss) function associated with a model, i.e., the measure of how “far” the model is from making accurate predictions. I believe I will write a dedicated blog post on the topic. For now, we mentioned it only to connect to the next paragraph, where we will address the problem of Vanishing/Exploding Gradient, the major limitation of RNNs.

Vanishing/Exploding Gradient Dilemma

The problem of vanishing gradients and exploding gradients are two significant challenges in training RNNs, especially when working with very long sequences. These issues affect an RNN’s ability to learn long-term dependencies between elements in a sequence. Let’s look in detail at what these problems entail and how they manifest.

The vanishing gradient problem occurs when the gradient (the derivative of the loss function with respect to the network’s weights) tends towards zero during the backpropagation process. In RNNs, this is particularly problematic for long-term dependencies because of the repeated multiplication of small gradients across timesteps during BPTT. This can result in a gradient that becomes extremely small. Consequently, the weight update becomes insignificant, effectively rendering the network unable to learn from inputs that are distant in time. For this reason, the network may struggle to learn and retain information from earlier parts of the sequence.

Conversely, the exploding gradient problem occurs when gradients become excessively large during backpropagation. This can lead to overly large weight updates, causing instability in the network and making the training process diverge. In RNNs, this is often due to the repeated multiplication of large gradients across timesteps, which can exponentially increase the gradient’s value as it is backpropagated in time.

To overcome this issue, various techniques have been adopted that led to the development of new neural networks, such as Long Short-Term Memory (LSTM) networks. LSTMs were designed as a specific type of RNN cell with additional gating mechanisms, allowing them to selectively remember or forget information based on its relevance. These gating mechanisms help preserve important long-term dependencies in the hidden state vector and enable LSTMs to better handle sequences with long-term dependencies than traditional RNNs.

Ending

I’ve tried to explain clearly and summarize the most important points about RNNs. I dedicated a blog post to RNNs because I consider them important and foundational for the study of subsequent models, like Transformers. Anyway, even though traditional RNNs face difficulties in capturing long-term dependencies, variants like LSTM and GRU have been developed to overcome these limits. To understand and use them, I believe it’s necessary to know how RNN architectures are constructed.

Sources

Applying Deep Learning to detect Rhegmatogenous Retinal Detachment

Sat, 20 Jun 2020 00:00:00 +0000

Retinal detachment is an eye disease and is one of the most serious ocular emergencies. It occurs after a layer of the retina - essential tissue for vision - is lifted from the pigmented epithelial tissue, dragging the blood vessels that supply nutrients and the eye with it. If the retina is no longer nourished through contact with the pigment epithelium layer, cell death and a progressive and functional loss of vision or of the detached portion of the retina occur after 48 hours.

However, the retina can be re-attached surgically, resulting in ocular improvement, as long as the detachment time is not excessive (Heussen N. et al., 2012). Below it is demonstrated how machine learning and deep learning techniques can play a crucial role in quickly saving the eyes of a patient with Rhegmatogenous Retinal Detachment.

Retinal detachment: types and consequences

There are four types of retinal detachment:

Rhegmatogenous Retinal Detachment (RRD): the most frequent, is due to a break in the retina that allows the vitreous, gelatinous liquid that is inside the eye, to enter the subretinal space, thus allowing the detachment of its adherent portion;
Tractional: typical in cases of retinal ischemia, it is generated by fibro-vascular tissue bridles that exert a centrifugal traction on the retina capable of ungluing it;
Exudative: it is due to the presence of fluid under the retina following inflammation or vascular lesions;
Mixed forms.

Rhegmatogenous Retinal Detachment is a serious condition that can lead to blindness, nevertheless the possibility of cure is directly proportional to the timeliness of appropriate treatment that anticipates the period of cell death of the retina. Early diagnosis and rapid intervention play a fundamental role, and in this case Deep Learning techniques represent an aid capable of meeting these needs. In the study by Ohsugi and collaborators, Machine Learning technologies are applied to detect RRD, using ultra-wide field fundus images and studying their performance (Ohsugi, H., Tabuchi, H., Enno, H. et al., 2017).

These images can be realized thanks to techniques conducted by means of scanning laser ophthalmoscope (SLO), an instrument used for the evaluation of the fundus of the retina. It was designed to capture images of the retinal layers simultaneously with confocal images of the fundus (W H Woon, F W Fitzke, A C Bird, J Marshall, 1992). Wide field fundus images can be acquired easily without pupillary mydriasis and without causing medical complications, so even examiners not qualified to perform ophthalmological surgeries can acquire the images in a completely safe way, making this instrument ideal especially in cases of emergency in which ophthalmologists are not available.

Representative fundus images obtained by ultra wide field scanning laser ophthalmoscopy. Ultra wide field right fundus images without rhegmatogenous retinal detachment (RRD) (a) and with RRD (b). The arrow indicates the retinal break and the arrowheads indicate the areas of RRD.

Machine Learning and RRD early diagnosis

The study evaluates precisely the ability of a deep learning technology to detect RRD using images obtained through SLO.

A Machine Learning model uses a multilayer CNN capable of automatically learning the characterizing patterns of the images and making them a classification system (Deng, J. et al., 2009).

CNN architecture used. The Input is represented by the RGB 96x96 pixel image. Each of the convolutional layers (Conv1–3) is followed by an activation function layer (ReLU), pooling layers (MP1–3), and two fully connected layers (FC1, FC2).

The convolutional level obtains the characteristics of the input through convolutional filters, the maximum levels of pooling (MP1, MP2 and MP3) allow a more generic recognition. The last two layers (FC1, FC2) are completely connected and remove spatial information from the quantities of extracted features and statistically recognize the objective that CNN aims to achieve. For the training of the neural network, 100 images were processed, in addition an optimization algorithm called AdaGrad was implemented to correctly train the network weights.

411 images from 407 RRD patients and 420 images from 238 non-RRD patients were used. The deep learning model demonstrated a high sensitivity of 97.6% [95% confidence interval (CI), 94.2-100%] and a high specificity of 96.5%. The model used, therefore, can improve medical care and increase the timeliness of intervention, resulting in an early diagnosis that can prevent RRD-derived blindness.

Bibliography

Heussen N. et al. (2012). Scleral buckling versus primary vitrectomy in rhegmatogenous retinal detachment study (SPR study): Risk assessment of anatomical outcome. SPR study report no. 7. Acta Ophthalmologica.
Ohsugi, H., Tabuchi, H., Enno, H. et al. (2017). Accuracy of deep learning, a machine-learning technology, using ultra–wide-field fundus ophthalmoscopy for detecting rhegmatogenous retinal detachment. Scientific Reports, Article number: 9425.
W H Woon, F W Fitzke, A C Bird, J Marshall. (1992). Confocal imaging of the fundus using a scanning laser ophthalmoscope. British Journal of Ophthalmology.
Deng, J. et al. (2009). Imagenet: a large-scale hierarchical image database. Computer Vision and Pattern Recognition. IEEE Conference on Computer Vision and Pattern Recognition, 248–255.

A generic introduction to Clinical Decision Support Systems

Sat, 13 Jun 2020 00:00:00 +0000

A Clinical Decision Support System is defined as an “active knowledge system, which uses two or more patient data elements to generate case-specific advice” (DSS, 2001). This implies that a CDSS is simply a Decision Support System focused on the use of knowledge management in order to obtain clinical advice for patient care based on multiple variables inherent to it. The main purpose of the modern CDSS, therefore, is to assist doctors in patient treatment cases or in emergency situations (Benner, 2007).

CDSS: an introduction

As anticipated, at the beginning the CDSS were conceived to literally take important decisions, effectively replacing the figure of the doctor: the latter entered the information and waited for the CDSS to provide the “right” choice, so as to be able to simply act on that. However, today a CDSS only provides suggestions for the doctor, who examines and selects the informations provided, keeping the useful suggestions and discarding those he deems unsuitable (DSS, 2001). An important example of CDSS is certainly CBR, Case-based Reasoning (Shahina Begum, Mobyen Uddin Ahmed, Peter Funk, N. Xiong, Mia Folke, 2001), which uses data from previous clinical cases to indicate a specific health treatment. A CBR system was developed to facilitate radiotherapy treatment planning for brain cancer: given a new patient case, the CBR system retrieves a similar case from an archive of patients successfully treated with the suggested treatment plan, re-adapting it to meet the specific needs of the new case - the results obtained using real-world brain cancer patient cases have shown that the success rate of the new CBR recovery is higher than that of the original system (Khussainova, Gulmira; Petrovic, Sanja; Jagannathan, Rupa, 2015).

A Clinical Decision Support System for Prevention of Venous Thromboembolism: Effect on Physician Behavior

In the study by Durieux et al. the effect of a CDSS in physician behavior during clinical decisions for the prevention of venous thromboembolism was investigated (Durieux P, Nizard R, Ravaud P, Mounier N, Lepage E, 2000). In hospital clinical practice, venous thromboembolism remains a serious problem and pulmonary embolism is a leading cause of death (Frederick A. Anderson, H. Brownell Wheeler, Robert J. Goldberg, 1991). The most effective way to prevent fatal and non-fatal venous thromboembolism is to use routine prophylaxis for high-risk patients. Optimal decisions on the use of anticoagulants in the prevention of venous thromboembolism require access to a large amount of complex information to assess the degree of risk of hospitalized patients. The objective of the study was to determine whether the presentation of guidelines for the prophylaxis of venous thromboembolism using a CDSS increased the percentage of appropriate clinical practice decisions made regarding the proportion of anticoagulant prescription.

The study took place in time series between December 1997 and July 1999 in an orthopedic surgery department of a university hospital located in Paris, on a sample of 1971 patients undergoing orthopedic surgery. A CDSS designed to provide immediate information related to the prevention of venous thromboembolism among surgical patients was integrated into daily medical practice during three 10-week intervention periods, alternating with four 10-week control periods, with a 4-week washout between each period. The proportion of appropriate prescriptions for anticoagulants in line with the established clinical protocols was then analyzed during the intervention periods, and then compared with the control group. The results show the importance of using the CDSS: doctors complied with the guidelines in 82.8% of cases during the control periods and in 94.9% of cases during the intervention periods. During each intervention period, the adequacy of prescribing increased significantly, and each time the CDSS was removed, the medical practice reverted to that observed prior to the intervention. It can be confirmed that the implementation of clinical guidelines for venous thromboembolism prophylaxis through a CDSS used and integrated into the hospital information system has changed physician behavior and improved compliance with the guidelines.

Bibliography

DSS. (2001). Open Clinical - Knowledge Management for Medical Care
Benner, E. (2007). Clinical Decision Support Systems. Springer.
Shahina Begum, Mobyen Uddin Ahmed, Peter Funk, N. Xiong, Mia Folke. (2001). Case-Based Reasoning Systems in the Health Sciences: A Survey of Recent Trends and Developments. Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions, 421-434.
Khussainova, Gulmira; Petrovic, Sanja; Jagannathan, Rupa. (2015). Retrieval with clustering in a case-based reasoning system for radiotherapy treatment planning. Journal of Physics: Conference Series.
Durieux P, Nizard R, Ravaud P, Mounier N, Lepage E. (2000). A Clinical Decision Support System for Prevention of Venous Thromboembolism: Effect on Physician Behavior. JAMA.
Frederick A. Anderson, H. Brownell Wheeler, Robert J. Goldberg. (1991). A Population-Based Perspective of the Hospital Incidence and Case-Fatality Rates of Deep Vein Thrombosis and Pulmonary Embolism. JAMA, 933-938.

A (Long) Peek into Reinforcement Learning

Mon, 01 Jan 0001 00:00:00 +0000

(Re)Using rustc components in gccrs

Mon, 01 Jan 0001 00:00:00 +0000

1 week with David Beazley and SICP

Mon, 01 Jan 0001 00:00:00 +0000

418 I'm a teapot

Mon, 01 Jan 0001 00:00:00 +0000

5000x faster CRDTs: An Adventure in Optimization

Mon, 01 Jan 0001 00:00:00 +0000

6 Techniques I Use to Create a Great User Experience for Shell Scripts

Mon, 01 Jan 0001 00:00:00 +0000

7 Common Mistakes in Architecture Diagrams

Mon, 01 Jan 0001 00:00:00 +0000

7 Databases in 7 Weeks for 2025

Mon, 01 Jan 0001 00:00:00 +0000

A calculator app? Anyone could make that

Mon, 01 Jan 0001 00:00:00 +0000

A computer can never be held accountable

Mon, 01 Jan 0001 00:00:00 +0000

A generalised solution to distributed consensus

Mon, 01 Jan 0001 00:00:00 +0000

A half-hour to learn Rust

Mon, 01 Jan 0001 00:00:00 +0000

A short introduction to RLHF and post-training focused on language models

Mon, 01 Jan 0001 00:00:00 +0000

A Vanity VNC server (or Joke over RFB)

Mon, 01 Jan 0001 00:00:00 +0000

A year of funded FreeBSD

Mon, 01 Jan 0001 00:00:00 +0000

AAA gaming on Asahi Linux

Mon, 01 Jan 0001 00:00:00 +0000

About

Mon, 01 Jan 0001 00:00:00 +0000

Hi, I’m Simone Bellavia! I’m passionate about software engineering, deep learning, and exploring the intersection of technology and human creativity.

On this weblog, I share my thoughts on various topics that inspire me: from technical insights in AI and machine learning to reflections on building software and the tools that shape our digital world.

Connect

You can find me on Twitter where I share updates and engage with the tech community.

Thanks for stopping by!

Adding syntax to the CPython Interpreter

Mon, 01 Jan 0001 00:00:00 +0000

Advanced Python Features

Mon, 01 Jan 0001 00:00:00 +0000

AI for Real-time Fusion Plasma Behavior Prediction and Manipulation

Mon, 01 Jan 0001 00:00:00 +0000

Algorithm = Logic + Control

Mon, 01 Jan 0001 00:00:00 +0000

All the data can be yours

Mon, 01 Jan 0001 00:00:00 +0000

An almost pointless exercise in GPU optimization

Mon, 01 Jan 0001 00:00:00 +0000

An analysis of DeepSeek's R1-Zero and R1

Mon, 01 Jan 0001 00:00:00 +0000

An Illustrated Proof of the CAP Theorem

Mon, 01 Jan 0001 00:00:00 +0000

An Intro to DeepSeek's Distributed File System

Mon, 01 Jan 0001 00:00:00 +0000

An Intuitive Explanation of Sparse Autoencoders for LLM Interpretability

Mon, 01 Jan 0001 00:00:00 +0000

An Ultra Opinionated Guide to Reinforcement Learning

Mon, 01 Jan 0001 00:00:00 +0000

Analyzing IPv4 trades with gnuplot

Mon, 01 Jan 0001 00:00:00 +0000

Analyzing the codebase of Caffeine: a high performance caching library

Mon, 01 Jan 0001 00:00:00 +0000

API design note: Beware of adding an “Other” enum value

Mon, 01 Jan 0001 00:00:00 +0000

Apple’s Darwin OS and XNU Kernel Deep Dive

Mon, 01 Jan 0001 00:00:00 +0000

ARC-AGI without pretraining

Mon, 01 Jan 0001 00:00:00 +0000

Are We Web Yet?

Mon, 01 Jan 0001 00:00:00 +0000

Artificial photosynthesis directed toward organic synthesis

Mon, 01 Jan 0001 00:00:00 +0000

Async Rust can be a pleasure to work with (without Send + Sync + 'static)

Mon, 01 Jan 0001 00:00:00 +0000

Atomics And Concurrency

Mon, 01 Jan 0001 00:00:00 +0000

Attacking My Landlord's Boiler

Mon, 01 Jan 0001 00:00:00 +0000

Attacking Unix Systems Via CUPS, Part I

Mon, 01 Jan 0001 00:00:00 +0000

AVX Bitwise ternary logic instruction busted

Mon, 01 Jan 0001 00:00:00 +0000

B-trees and database indexes

Mon, 01 Jan 0001 00:00:00 +0000

Bayleaf Wireless Keyboard

Mon, 01 Jan 0001 00:00:00 +0000

Beating cuBLAS in Single-Precision General Matrix Multiplication

Mon, 01 Jan 0001 00:00:00 +0000

Blogging in the age of AI

Mon, 01 Jan 0001 00:00:00 +0000

Build a Tiny Certificate Authority For Your Homelab

Mon, 01 Jan 0001 00:00:00 +0000

Build Your Own Text Editor

Mon, 01 Jan 0001 00:00:00 +0000

Building a full-text search engine in 150 lines of Python code

Mon, 01 Jan 0001 00:00:00 +0000

Building an AI Server on a budget

Mon, 01 Jan 0001 00:00:00 +0000

Caches in Rust

Mon, 01 Jan 0001 00:00:00 +0000

Can your terminal do emojis? How big?

Mon, 01 Jan 0001 00:00:00 +0000

Cheating the Reaper in Go

Mon, 01 Jan 0001 00:00:00 +0000

Claude 4 System Card

Mon, 01 Jan 0001 00:00:00 +0000

Compiling a Neural Net to C for a 1,744x speedup

Mon, 01 Jan 0001 00:00:00 +0000

Composable SQL

Mon, 01 Jan 0001 00:00:00 +0000

Concise Machine Learning

Mon, 01 Jan 0001 00:00:00 +0000

Conditionally Disabling Code with Comptime in Zig

Mon, 01 Jan 0001 00:00:00 +0000

Constraints in Go

Mon, 01 Jan 0001 00:00:00 +0000

Convolutions, Polynomials and Flipped Kernels

Mon, 01 Jan 0001 00:00:00 +0000

Crafting an event system

Mon, 01 Jan 0001 00:00:00 +0000

Creating a Proxmox or QEMU ChromeOS Flex VM

Mon, 01 Jan 0001 00:00:00 +0000

Debugging Hetzner: Uncovering Failures with Powerstat, Sensors, and Dmidecode

Mon, 01 Jan 0001 00:00:00 +0000

Debugging: Indispensable rules for finding even the most elusive problems

Mon, 01 Jan 0001 00:00:00 +0000

Decompiling 2024: A Year of Resurgance in Decompilation Research

Mon, 01 Jan 0001 00:00:00 +0000

Deep Reinforcement Learning: Pong from Pixels

Mon, 01 Jan 0001 00:00:00 +0000

DeepSeek-R1 with Dynamic 1.58-bit Quantization

Mon, 01 Jan 0001 00:00:00 +0000

Demystifying Debuggers

Mon, 01 Jan 0001 00:00:00 +0000

Denuvo Analysis

Mon, 01 Jan 0001 00:00:00 +0000

Design Patterns Are Temporary, Language Features Are Forever

Mon, 01 Jan 0001 00:00:00 +0000

Development On Apple Silicon with UTM

Mon, 01 Jan 0001 00:00:00 +0000

Diagrams AI Can, and Cannot, Generate

Mon, 01 Jan 0001 00:00:00 +0000

Differentiable Logic Cellular Automata

Mon, 01 Jan 0001 00:00:00 +0000

Discovery Coding

Mon, 01 Jan 0001 00:00:00 +0000

Distributed Systems Programming Has Stalled

Mon, 01 Jan 0001 00:00:00 +0000

Distributed Systems Shibboleths

Mon, 01 Jan 0001 00:00:00 +0000

Distributed Transactions at Scale in Amazon DynamoDB

Mon, 01 Jan 0001 00:00:00 +0000

Ditching Obsidian and building my own

Mon, 01 Jan 0001 00:00:00 +0000

Ditherpunk — The article I wish I had about monochrome image dithering

Mon, 01 Jan 0001 00:00:00 +0000

Don't defer Close() on writable files

Mon, 01 Jan 0001 00:00:00 +0000

Easy 6502

Mon, 01 Jan 0001 00:00:00 +0000

Efficient Distributed LLM Inference with Dynamic Partitioning

Mon, 01 Jan 0001 00:00:00 +0000

Emerging reasoning with reinforcement learning

Mon, 01 Jan 0001 00:00:00 +0000

Entropy of a Large Language Model output

Mon, 01 Jan 0001 00:00:00 +0000

Equals as Assignment

Mon, 01 Jan 0001 00:00:00 +0000

Exotic Data Structures

Mon, 01 Jan 0001 00:00:00 +0000

Exploring Polymorphism in C: Lessons from Linux and FFmpeg's Code Design

Mon, 01 Jan 0001 00:00:00 +0000

Fast Allocations in Ruby 3.5

Mon, 01 Jan 0001 00:00:00 +0000

Fast LLM Inference From Scratch

Mon, 01 Jan 0001 00:00:00 +0000

Faster interpreters in Go: Catching up with C++

Mon, 01 Jan 0001 00:00:00 +0000

Faster sorting with SIMD CUDA intrinsics

Mon, 01 Jan 0001 00:00:00 +0000

File Over App: A Philosophy for Digital Longevity

Mon, 01 Jan 0001 00:00:00 +0000

Finding Exploits in Video Games

Mon, 01 Jan 0001 00:00:00 +0000

Finding Paths of Least Action with Gradient Descent

Mon, 01 Jan 0001 00:00:00 +0000

Flattening ASTs (and Other Compiler Data Structures)

Mon, 01 Jan 0001 00:00:00 +0000

Flattening Rust's Learning Curve

Mon, 01 Jan 0001 00:00:00 +0000

Foundations of Large Language Models

Mon, 01 Jan 0001 00:00:00 +0000

From ASCII to ASIC: Porting donut.c to a tiny slice of silicon

Mon, 01 Jan 0001 00:00:00 +0000

From the Transistor

Mon, 01 Jan 0001 00:00:00 +0000

Fun with C++26 reflection - Keyword Arguments

Mon, 01 Jan 0001 00:00:00 +0000

Game Hacking - Valve Anti-Cheat (VAC)

Mon, 01 Jan 0001 00:00:00 +0000

Gaussian integration is cool

Mon, 01 Jan 0001 00:00:00 +0000

Gemini Diffusion

Mon, 01 Jan 0001 00:00:00 +0000

Generalized Transformers from Applicative Functors

Mon, 01 Jan 0001 00:00:00 +0000

Generating Pixels One by One

Mon, 01 Jan 0001 00:00:00 +0000

ggml : x2 speed for WASM by optimizing SIMD

Mon, 01 Jan 0001 00:00:00 +0000

Ghostty Is Native—So What?

Mon, 01 Jan 0001 00:00:00 +0000

Git and jujutsu: in miniature

Mon, 01 Jan 0001 00:00:00 +0000

Go 1.24's go tool is one of the best additions to the ecosystem in years

Mon, 01 Jan 0001 00:00:00 +0000

Go Data Structures

Mon, 01 Jan 0001 00:00:00 +0000

Go Data Structures: Interfaces

Mon, 01 Jan 0001 00:00:00 +0000

Go Garbage Collector

Mon, 01 Jan 0001 00:00:00 +0000

Go Optimization Guide

Mon, 01 Jan 0001 00:00:00 +0000

Going down the rabbit hole of Git's new bundle-uri

Mon, 01 Jan 0001 00:00:00 +0000

Golang on the PlayStation 2

Mon, 01 Jan 0001 00:00:00 +0000

GPU Glossary

Mon, 01 Jan 0001 00:00:00 +0000

GPU-driven clustered forward renderer

Mon, 01 Jan 0001 00:00:00 +0000

Hacker News

Mon, 01 Jan 0001 00:00:00 +0000

Hacking 700 Million Electronic Arts Accounts

Mon, 01 Jan 0001 00:00:00 +0000

Hacking Subaru: Tracking and Controlling Cars via the STARLINK Admin Panel

Mon, 01 Jan 0001 00:00:00 +0000

Hallucinations in code are the least dangerous form of LLM mistakes

Mon, 01 Jan 0001 00:00:00 +0000

Hard Mode Rust

Mon, 01 Jan 0001 00:00:00 +0000

Hard numbers in the Wayland vs X11 input latency discussion

Mon, 01 Jan 0001 00:00:00 +0000

HardBreak - a Hardware Hacking Wiki

Mon, 01 Jan 0001 00:00:00 +0000

Haskell: A Great Procedural Language

Mon, 01 Jan 0001 00:00:00 +0000

Highly efficient matrix transpose in Mojo

Mon, 01 Jan 0001 00:00:00 +0000

How Discord Stores Trillions of Messages

Mon, 01 Jan 0001 00:00:00 +0000

How Does Ada's Memory Safety Compare Against Rust?

Mon, 01 Jan 0001 00:00:00 +0000

How I helped fix sleep-wake hangs on Linux with AMD GPUs

Mon, 01 Jan 0001 00:00:00 +0000

How I like to install NixOS (declaratively)

Mon, 01 Jan 0001 00:00:00 +0000

How I program with Agents

Mon, 01 Jan 0001 00:00:00 +0000

How I ruined my vacation by reverse engineering WSC

Mon, 01 Jan 0001 00:00:00 +0000

How I ship projects at big tech companies

Mon, 01 Jan 0001 00:00:00 +0000

How I use my terminal

Mon, 01 Jan 0001 00:00:00 +0000

How I wrote my own 'proper' programming language

Mon, 01 Jan 0001 00:00:00 +0000

How linear regression works intuitively and how it leads to gradient descent

Mon, 01 Jan 0001 00:00:00 +0000

How rqlite is tested

Mon, 01 Jan 0001 00:00:00 +0000

How to create value objects in Ruby, the idiomatic way

Mon, 01 Jan 0001 00:00:00 +0000

How to do distributed locking

Mon, 01 Jan 0001 00:00:00 +0000

How to do Distributed Locking

Mon, 01 Jan 0001 00:00:00 +0000

How to get to that European cloud?

Mon, 01 Jan 0001 00:00:00 +0000

How to Optimize a CUDA Matmul Kernel for cuBLAS-like Performance: a Worklog

Mon, 01 Jan 0001 00:00:00 +0000

How to post when no one is reading

Mon, 01 Jan 0001 00:00:00 +0000

How to program a text adventure in C

Mon, 01 Jan 0001 00:00:00 +0000

How to Supercharge your Java Project with Rust

Mon, 01 Jan 0001 00:00:00 +0000

How We Built a Self-Healing System to Survive a Terrifying Concurrency Bug At Netflix

Mon, 01 Jan 0001 00:00:00 +0000

HTAP is Dead

Mon, 01 Jan 0001 00:00:00 +0000

http:, ftp:, and ... dict:?

Mon, 01 Jan 0001 00:00:00 +0000

Hunting for Gems

Mon, 01 Jan 0001 00:00:00 +0000

I bought a Mac

Mon, 01 Jan 0001 00:00:00 +0000

I made a multiplayer shooter game in Lisp

Mon, 01 Jan 0001 00:00:00 +0000

I made an open-source laptop from scratch

Mon, 01 Jan 0001 00:00:00 +0000

I sent an Ethernet packet

Mon, 01 Jan 0001 00:00:00 +0000

I spent 18 years in the Linux console and I don't regret it

Mon, 01 Jan 0001 00:00:00 +0000

I use Zip bombs to Protect my Server

Mon, 01 Jan 0001 00:00:00 +0000

I used o3 to find a remote zeroday in the Linux SMB implementation

Mon, 01 Jan 0001 00:00:00 +0000

I Write Type Safe Generic Data Structures in C

Mon, 01 Jan 0001 00:00:00 +0000

I Wrote a Wasm Interpreter in C

Mon, 01 Jan 0001 00:00:00 +0000

I'm Not Mutable, I'm Partially Instantiated

Mon, 01 Jan 0001 00:00:00 +0000

Implementing a tiny CPU rasterizer

Mon, 01 Jan 0001 00:00:00 +0000

Implementing LLaMA3 in 100 Lines of Pure Jax

Mon, 01 Jan 0001 00:00:00 +0000

Improving Recommendation Systems and Search in the Age of LLMs

Mon, 01 Jan 0001 00:00:00 +0000

Improving Steam Client stability on Linux: setenv and multithreaded environments

Mon, 01 Jan 0001 00:00:00 +0000

In search of a faster SQLite

Mon, 01 Jan 0001 00:00:00 +0000

Inheritance was invented as a performance hack

Mon, 01 Jan 0001 00:00:00 +0000

Inside M4 chips: E and P cores

Mon, 01 Jan 0001 00:00:00 +0000

Introduction to CUDA Programming for Python Developers

Mon, 01 Jan 0001 00:00:00 +0000

Introduction to GGML

Mon, 01 Jan 0001 00:00:00 +0000

Introduction to Zig

Mon, 01 Jan 0001 00:00:00 +0000

IPv6 is hard

Mon, 01 Jan 0001 00:00:00 +0000

Is NixOS Truly Reproducible?

Mon, 01 Jan 0001 00:00:00 +0000

It is not a compiler error

Mon, 01 Jan 0001 00:00:00 +0000

It's hard to write code for computers, but it's even harder to write code for humans

Mon, 01 Jan 0001 00:00:00 +0000

Java for Everything

Mon, 01 Jan 0001 00:00:00 +0000

Jetstream: Shrinking the AT Proto Firehose by >99%

Mon, 01 Jan 0001 00:00:00 +0000

JTAG Hacking the Original Xbox in 2023

Mon, 01 Jan 0001 00:00:00 +0000

Kubernetes on Hetzner

Mon, 01 Jan 0001 00:00:00 +0000

Let's Learn x86-64 Assembly

Mon, 01 Jan 0001 00:00:00 +0000

Life pro tip: Oracle Linux is the best local VM for MacBooks

Mon, 01 Jan 0001 00:00:00 +0000

Linux From Scratch

Mon, 01 Jan 0001 00:00:00 +0000

Linux Kernel Exploitation CVE-2025-21756: Attack of the Vsock

Mon, 01 Jan 0001 00:00:00 +0000

Linux/4004: booting Linux on Intel 4004 for fun, art, and no profit

Mon, 01 Jan 0001 00:00:00 +0000

llama3 implemented from scratch

Mon, 01 Jan 0001 00:00:00 +0000

LLM abstraction levels inspired by fish eye lens

Mon, 01 Jan 0001 00:00:00 +0000

Lockfree Algorithms

Mon, 01 Jan 0001 00:00:00 +0000

Look ma, no bubbles designing a low-latency megakernel for Llama-1B

Mon, 01 Jan 0001 00:00:00 +0000

Low-Poly Image Generation Using Evolutionary Algorithms in Ruby

Mon, 01 Jan 0001 00:00:00 +0000

Machine Learning in Production (CMU Course)

Mon, 01 Jan 0001 00:00:00 +0000

Making the rav1d Video Decoder 1% Faster

Mon, 01 Jan 0001 00:00:00 +0000

Mastering Ruby Debugging: From puts to Professional Tools

Mon, 01 Jan 0001 00:00:00 +0000

MDN Web Docs

Mon, 01 Jan 0001 00:00:00 +0000

Memory Consistency Models: A Tutorial

Mon, 01 Jan 0001 00:00:00 +0000

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

Mon, 01 Jan 0001 00:00:00 +0000

ML in Go with a Python sidecar

Mon, 01 Jan 0001 00:00:00 +0000

Model Context Protocol Quickstart

Mon, 01 Jan 0001 00:00:00 +0000

Modern C

Mon, 01 Jan 0001 00:00:00 +0000

Motivating Adam Optimizer

Mon, 01 Jan 0001 00:00:00 +0000

Move semantics in rust, C++, and Hylo

Mon, 01 Jan 0001 00:00:00 +0000

My 71 TiB ZFS NAS After 10 Years and Zero Drive Failures

Mon, 01 Jan 0001 00:00:00 +0000

My AI Skeptic Friends Are All Nuts

Mon, 01 Jan 0001 00:00:00 +0000

My failed attempt to shrink all npm packages by 5%

Mon, 01 Jan 0001 00:00:00 +0000

My first superoptimizer

Mon, 01 Jan 0001 00:00:00 +0000

My Framework for Writing Systems in C++

Mon, 01 Jan 0001 00:00:00 +0000

My Homelab Setup

Mon, 01 Jan 0001 00:00:00 +0000

MySQL at Uber

Mon, 01 Jan 0001 00:00:00 +0000

MySQL transactions per second vs fsyncs per second

Mon, 01 Jan 0001 00:00:00 +0000

Negotiating PoE+ Power in the Pre‑Boot Environment

Mon, 01 Jan 0001 00:00:00 +0000

Never Missing the Train Again, Thanks to Rust

Mon, 01 Jan 0001 00:00:00 +0000

NixOS is a good server OS, except when it isn't

Mon, 01 Jan 0001 00:00:00 +0000

No-Panic Rust: A Nice Technique for Systems Programming

Mon, 01 Jan 0001 00:00:00 +0000

normcore-llm / anti-hype LLM reading list

Mon, 01 Jan 0001 00:00:00 +0000

Notes on Distributed Systems for Young Bloods

Mon, 01 Jan 0001 00:00:00 +0000

Nous DisTrO

Mon, 01 Jan 0001 00:00:00 +0000

Odin is a weird programming language to advertise/market for

Mon, 01 Jan 0001 00:00:00 +0000

On Designing and Deploying Internet-Scale Services

Mon, 01 Jan 0001 00:00:00 +0000

On File Formats

Mon, 01 Jan 0001 00:00:00 +0000

One year after switching from Java to Go

Mon, 01 Jan 0001 00:00:00 +0000

Open-R1: a fully open reproduction of DeepSeek-R1

Mon, 01 Jan 0001 00:00:00 +0000

OpenBSD: innovations

Mon, 01 Jan 0001 00:00:00 +0000

Operating System in 1,000 Lines

Mon, 01 Jan 0001 00:00:00 +0000

Optimization adventures: making a parallel Rust workload 10x faster with (or without) Rayon

Mon, 01 Jan 0001 00:00:00 +0000

Optimizations with Zig

Mon, 01 Jan 0001 00:00:00 +0000

Optimizing a Rust GPU matmul kernel

Mon, 01 Jan 0001 00:00:00 +0000

Optimizing a WebGPU Matmul Kernel for 1TFLOP+ Performance

Mon, 01 Jan 0001 00:00:00 +0000

Overview of cross-architecture portability problems

Mon, 01 Jan 0001 00:00:00 +0000

Ownership

Mon, 01 Jan 0001 00:00:00 +0000

Packed Data Support in Haskell

Mon, 01 Jan 0001 00:00:00 +0000

Parsing JSON in 500 lines of Rust

Mon, 01 Jan 0001 00:00:00 +0000

Parsing PDFs (and more) in Elixir using Rust

Mon, 01 Jan 0001 00:00:00 +0000

Performance of the Python 3.14 tail-call interpreter

Mon, 01 Jan 0001 00:00:00 +0000

Perhaps Rust needs defer

Mon, 01 Jan 0001 00:00:00 +0000

Pointers Are Complicated II, or: We need better language specs

Mon, 01 Jan 0001 00:00:00 +0000

Porting Terraria and Celeste to WebAssembly

Mon, 01 Jan 0001 00:00:00 +0000

Prototyping in Rust

Mon, 01 Jan 0001 00:00:00 +0000

PyTorch internals

Mon, 01 Jan 0001 00:00:00 +0000

PyTorch is dead. Long live JAX

Mon, 01 Jan 0001 00:00:00 +0000

Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens

Mon, 01 Jan 0001 00:00:00 +0000

Rearchitecting Core Services at X

Mon, 01 Jan 0001 00:00:00 +0000

Rearchitecting: Redis to SQLite

Mon, 01 Jan 0001 00:00:00 +0000

Reinforcement Learning – A Reference

Mon, 01 Jan 0001 00:00:00 +0000

Replacing Kubernetes with systemd

Mon, 01 Jan 0001 00:00:00 +0000

Rethinking LoRA Initialization for Faster Convergence

Mon, 01 Jan 0001 00:00:00 +0000

Rethinking The C Time API

Mon, 01 Jan 0001 00:00:00 +0000

Returning to my roots in hardware

Mon, 01 Jan 0001 00:00:00 +0000

Reverse Engineering and Dismantling Kekz Headphones

Mon, 01 Jan 0001 00:00:00 +0000

Reverse Engineering and Modding Mario Pinball Land (GBA)

Mon, 01 Jan 0001 00:00:00 +0000

Reverse Engineering Bambu Connect

Mon, 01 Jan 0001 00:00:00 +0000

Reverse Engineering Call Of Duty Anti-Cheat

Mon, 01 Jan 0001 00:00:00 +0000

Reverse Engineering for Noobs

Mon, 01 Jan 0001 00:00:00 +0000

Reverse Engineering iOS 18 Inactivity Reboot

Mon, 01 Jan 0001 00:00:00 +0000

Reverse Engineering OpenAI Code Execution to make it run C + JavaScript

Mon, 01 Jan 0001 00:00:00 +0000

Reverse engineering the 386 processor's prefetch queue circuitry

Mon, 01 Jan 0001 00:00:00 +0000

Reverse Engineering the Constants in the Pentium FPU

Mon, 01 Jan 0001 00:00:00 +0000

Reverse engineering the Sega Channel game image file format

Mon, 01 Jan 0001 00:00:00 +0000

Reverse-engineering and analysis of SanDisk High Endurance microSDXC card

Mon, 01 Jan 0001 00:00:00 +0000

Rewriting Rust

Mon, 01 Jan 0001 00:00:00 +0000

Root Shell on Credit Card Terminal

Mon, 01 Jan 0001 00:00:00 +0000

Ruby, Ractors, and Lock-Free Data Structures

Mon, 01 Jan 0001 00:00:00 +0000

RubyLLM: A delightful Ruby way to work with AI

Mon, 01 Jan 0001 00:00:00 +0000

Rust Atomics and Locks – Low-Level Concurrency in Practice

Mon, 01 Jan 0001 00:00:00 +0000

Rust by Example

Mon, 01 Jan 0001 00:00:00 +0000

Rust Dependencies Scare Me

Mon, 01 Jan 0001 00:00:00 +0000

Rust GPU – The future of GPU programming

Mon, 01 Jan 0001 00:00:00 +0000

Rust: Investigating a Strange Out-of-Memory Error

Mon, 01 Jan 0001 00:00:00 +0000

Rust's incremental compiler architecture

Mon, 01 Jan 0001 00:00:00 +0000

Rust's Sneaky Deadlock With `if let` Blocks

Mon, 01 Jan 0001 00:00:00 +0000

Rust's Ugly Syntax

Mon, 01 Jan 0001 00:00:00 +0000

rustlings

Mon, 01 Jan 0001 00:00:00 +0000

Safe C++

Mon, 01 Jan 0001 00:00:00 +0000

Self-Host & Tech Independence: The Joy of Building Your Own

Mon, 01 Jan 0001 00:00:00 +0000

Self-Hosted x86 Backend is Now Default in Debug Mode

Mon, 01 Jan 0001 00:00:00 +0000

Shadertoys ported to Rust GPU

Mon, 01 Jan 0001 00:00:00 +0000

Sharing everything I could understand about gradient noise

Mon, 01 Jan 0001 00:00:00 +0000

SIMD-friendly algorithms for substring searching

Mon, 01 Jan 0001 00:00:00 +0000

Simple CPU design

Mon, 01 Jan 0001 00:00:00 +0000

Slabs, sheaves, and barns

Mon, 01 Jan 0001 00:00:00 +0000

Smolderingly fast B-Trees

Mon, 01 Jan 0001 00:00:00 +0000

So you wanna write Kubernetes controllers?

Mon, 01 Jan 0001 00:00:00 +0000

So You Want to Build Your Own Data Center

Mon, 01 Jan 0001 00:00:00 +0000

Softmax Forever, or Why I Like Softmax

Mon, 01 Jan 0001 00:00:00 +0000

Software development topics I've changed my mind on after 10 years in the industry

Mon, 01 Jan 0001 00:00:00 +0000

Software is About Promises

Mon, 01 Jan 0001 00:00:00 +0000

Solving Passport Application with Haskell

Mon, 01 Jan 0001 00:00:00 +0000

Some Go web dev notes

Mon, 01 Jan 0001 00:00:00 +0000

Some Thoughts on Autoregressive Models

Mon, 01 Jan 0001 00:00:00 +0000

Speculative attacks on Apple CPUs (SLAP and FLOP)

Mon, 01 Jan 0001 00:00:00 +0000

SQLite on Rails: The how and why of optimal performance

Mon, 01 Jan 0001 00:00:00 +0000

Start your own Internet Resiliency Club

Mon, 01 Jan 0001 00:00:00 +0000

Stop saying Rust is Complicated

Mon, 01 Jan 0001 00:00:00 +0000

Stupid Smart Pointers in C

Mon, 01 Jan 0001 00:00:00 +0000

Succinct Data Structures

Mon, 01 Jan 0001 00:00:00 +0000

Supercharge SQLite with Ruby Functions

Mon, 01 Jan 0001 00:00:00 +0000

Susctl CVE-2024-54507: A particularly 'sus' sysctl in the XNU kernel

Mon, 01 Jan 0001 00:00:00 +0000

Sync Engines are the Future

Mon, 01 Jan 0001 00:00:00 +0000

The 3 Gurus of 90s Web Design: Zeldman, Siegel, Nielsen

Mon, 01 Jan 0001 00:00:00 +0000

The Architecture of Open Source Applications: nginx

Mon, 01 Jan 0001 00:00:00 +0000

The Best Programmers I Know

Mon, 01 Jan 0001 00:00:00 +0000

The Cost of Go's Panic and Recovery

Mon, 01 Jan 0001 00:00:00 +0000

The curious case of shell commands, or how 'this bug is required by POSIX'

Mon, 01 Jan 0001 00:00:00 +0000

The Era of Exploration

Mon, 01 Jan 0001 00:00:00 +0000

The Evolution of Caching Libraries in Go

Mon, 01 Jan 0001 00:00:00 +0000

The Fastest Mutexes

Mon, 01 Jan 0001 00:00:00 +0000

The Illustrated DeepSeek-R1

Mon, 01 Jan 0001 00:00:00 +0000

The intuition behind Rust’s borrowing rules and ownership

Mon, 01 Jan 0001 00:00:00 +0000

The little book about OS development

Mon, 01 Jan 0001 00:00:00 +0000

The Lost Decade of Small Data?

Mon, 01 Jan 0001 00:00:00 +0000

The Most Elegant Configuration Language

Mon, 01 Jan 0001 00:00:00 +0000

The Perfect Voxel Engine

Mon, 01 Jan 0001 00:00:00 +0000

The principles of database design, or, the Truth is out there

Mon, 01 Jan 0001 00:00:00 +0000

The Rust Programming Language (The Book)

Mon, 01 Jan 0001 00:00:00 +0000

The Secret Inside One Million Checkboxes

Mon, 01 Jan 0001 00:00:00 +0000

The Transformer Family Version 2.0

Mon, 01 Jan 0001 00:00:00 +0000

The Ultimate Guide to Error Handling in Python

Mon, 01 Jan 0001 00:00:00 +0000

The Ultra-Scale Playbook: Training LLMs on GPU Clusters

Mon, 01 Jan 0001 00:00:00 +0000

The world in which IPv6 was a good design

Mon, 01 Jan 0001 00:00:00 +0000

There Isn't Much Point to HTTP/2 Past The Load Balancer

Mon, 01 Jan 0001 00:00:00 +0000

Things Zig comptime Won't Do

Mon, 01 Jan 0001 00:00:00 +0000

This blog is hosted on a Nintendo Wii

Mon, 01 Jan 0001 00:00:00 +0000

This Website is Hosted on BlueSky

Mon, 01 Jan 0001 00:00:00 +0000

Time, Clocks and the Ordering of Events in a Distributed System

Mon, 01 Jan 0001 00:00:00 +0000

TinyCompiler: a compiler in a week-end

Mon, 01 Jan 0001 00:00:00 +0000

Tmux: The Essentials

Mon, 01 Jan 0001 00:00:00 +0000

Tokio + prctl = nasty bug

Mon, 01 Jan 0001 00:00:00 +0000

Too much efficiency makes everything worse: overfitting and the strong version of Goodhart's law

Mon, 01 Jan 0001 00:00:00 +0000

Towards Fearless SIMD

Mon, 01 Jan 0001 00:00:00 +0000

Towards high-performance AI compilers

Mon, 01 Jan 0001 00:00:00 +0000

Tracing the thoughts of a large language model

Mon, 01 Jan 0001 00:00:00 +0000

Training Language Models to Self-Correct via Reinforcement Learning

Mon, 01 Jan 0001 00:00:00 +0000

Traits are a Local Maxima

Mon, 01 Jan 0001 00:00:00 +0000

Transformer^2: Self-Adaptive LLMs

Mon, 01 Jan 0001 00:00:00 +0000

Tune Llama3 405B on AMD MI300x (our journey)

Mon, 01 Jan 0001 00:00:00 +0000

Turning the database inside-out

Mon, 01 Jan 0001 00:00:00 +0000

Two Years of Rust

Mon, 01 Jan 0001 00:00:00 +0000

Type-erased generic functions for C: A modest non-proposal

Mon, 01 Jan 0001 00:00:00 +0000

Under the microscope: Ecco the Dolphin — Defender of the Future

Mon, 01 Jan 0001 00:00:00 +0000

Understanding Memory Management, Part 5: Fighting with Rust

Mon, 01 Jan 0001 00:00:00 +0000

Understanding Round Robin DNS

Mon, 01 Jan 0001 00:00:00 +0000

Understanding the Tilde Operator in Python

Mon, 01 Jan 0001 00:00:00 +0000

Unlocking Ractors: class instance variables

Mon, 01 Jan 0001 00:00:00 +0000

Using AI for Coding: My Journey with Cline and Large Language Models

Mon, 01 Jan 0001 00:00:00 +0000

Using Claude Code to modernize a 25-year-old kernel driver

Mon, 01 Jan 0001 00:00:00 +0000

Using Rust in Non-Rust Servers to Improve Performance

Mon, 01 Jan 0001 00:00:00 +0000

Using the most unhinged AVX-512 instruction to make the fastest phrase search algo

Mon, 01 Jan 0001 00:00:00 +0000

Vision Transformer in pure JAX

Mon, 01 Jan 0001 00:00:00 +0000

Visual guide to SSH tunneling and port forwarding

Mon, 01 Jan 0001 00:00:00 +0000

WASM will replace containers

Mon, 01 Jan 0001 00:00:00 +0000

Ways to use torch.compile

Mon, 01 Jan 0001 00:00:00 +0000

We Cracked a 512-Bit DKIM Key for Less Than $8 in the Cloud

Mon, 01 Jan 0001 00:00:00 +0000

We were wrong about GPUs

Mon, 01 Jan 0001 00:00:00 +0000

Weird Lexical Syntax

Mon, 01 Jan 0001 00:00:00 +0000

What Every Developer Should Know About GPU Computing

Mon, 01 Jan 0001 00:00:00 +0000

What Every Programmer Should Know About Memory

Mon, 01 Jan 0001 00:00:00 +0000

What is an Invariant?

Mon, 01 Jan 0001 00:00:00 +0000

What is System Programming, Really?

Mon, 01 Jan 0001 00:00:00 +0000

What is Vim?

Mon, 01 Jan 0001 00:00:00 +0000

What Would a Kubernetes 2.0 Look Like

Mon, 01 Jan 0001 00:00:00 +0000

What's OAuth2 Anyway?

Mon, 01 Jan 0001 00:00:00 +0000

Whence newline?

Mon, 01 Jan 0001 00:00:00 +0000

Why and how we are migrating many of our servers from Linux to the BSDs

Mon, 01 Jan 0001 00:00:00 +0000

Why Distributed Computing?

Mon, 01 Jan 0001 00:00:00 +0000

Why do we have both CSRF protection and CORS?

Mon, 01 Jan 0001 00:00:00 +0000

Why does man print 'gimme gimme gimme' at 00:30?

Mon, 01 Jan 0001 00:00:00 +0000

Why Haskell?

Mon, 01 Jan 0001 00:00:00 +0000

Why I love Rust for tokenising and parsing

Mon, 01 Jan 0001 00:00:00 +0000

Why I rewrote my Rust keyboard firmware in Zig: consistency, mastery, and fun

Mon, 01 Jan 0001 00:00:00 +0000

Why I stopped everything and started writing C again

Mon, 01 Jan 0001 00:00:00 +0000

Why Ruby on Rails still matters

Mon, 01 Jan 0001 00:00:00 +0000

Why Rust nextest is process-per-test

Mon, 01 Jan 0001 00:00:00 +0000

World's First MIDI Shellcode

Mon, 01 Jan 0001 00:00:00 +0000

Writing a simple pool allocator in C

Mon, 01 Jan 0001 00:00:00 +0000

Writing an LLM from scratch, part 8 -- trainable self-attention

Mon, 01 Jan 0001 00:00:00 +0000

Writing C for cURL

Mon, 01 Jan 0001 00:00:00 +0000

Writing into unitialized buffers in Rust

Mon, 01 Jan 0001 00:00:00 +0000

Writing secure Go code

Mon, 01 Jan 0001 00:00:00 +0000

Wrong ways to use the databases, when the pendulum swung too far

Mon, 01 Jan 0001 00:00:00 +0000

XOR

Mon, 01 Jan 0001 00:00:00 +0000

Zen 5's AVX-512 Frequency Behavior

Mon, 01 Jan 0001 00:00:00 +0000

Zig Bits 0x4: Building an HTTP client/server from scratch

Mon, 01 Jan 0001 00:00:00 +0000

Zig Build System Internals

Mon, 01 Jan 0001 00:00:00 +0000

Zig defer Patterns

Mon, 01 Jan 0001 00:00:00 +0000

Zig running on the Nintendo 64

Mon, 01 Jan 0001 00:00:00 +0000

Zig's new LinkedList API (it's time to learn @fieldParentPtr)

Mon, 01 Jan 0001 00:00:00 +0000

zlib-rs is faster than C

Mon, 01 Jan 0001 00:00:00 +0000

Simone Bellavia's Web Page

Defeating Nondeterminism in LLM Inference

iPhone Air

npm debug and chalk packages compromised

New Weblog

Using Claude Code to modernize a 25-year-old kernel driver

Fil-C, a memory safe implementation of C and C++

Morphing the Divina Commedia into byte tokens with Zig

The more you fuck around, the more you find out

I am Sicilian and I support Strait of Messina Bridge

Helm: what I like and dislike

What I like

What I don’t like

The Alternative

Recurrent Neural Networks (RNNs) explained

From Feedforward to RNNs

RNNs Key Equations

Design Patterns and Principles

Computational Graph and Backpropagation

Vanishing/Exploding Gradient Dilemma

Ending

Sources

Applying Deep Learning to detect Rhegmatogenous Retinal Detachment

Retinal detachment: types and consequences

Machine Learning and RRD early diagnosis

Bibliography

A generic introduction to Clinical Decision Support Systems

CDSS: an introduction

A Clinical Decision Support System for Prevention of Venous Thromboembolism: Effect on Physician Behavior

Bibliography

A (Long) Peek into Reinforcement Learning

(Re)Using rustc components in gccrs

1 week with David Beazley and SICP

418 I'm a teapot

5000x faster CRDTs: An Adventure in Optimization

6 Techniques I Use to Create a Great User Experience for Shell Scripts

7 Common Mistakes in Architecture Diagrams

7 Databases in 7 Weeks for 2025

A calculator app? Anyone could make that

A computer can never be held accountable

A generalised solution to distributed consensus

A half-hour to learn Rust

A short introduction to RLHF and post-training focused on language models

A Vanity VNC server (or Joke over RFB)

A year of funded FreeBSD

AAA gaming on Asahi Linux

About

Connect

Adding syntax to the CPython Interpreter

Advanced Python Features

AI for Real-time Fusion Plasma Behavior Prediction and Manipulation

Algorithm = Logic + Control

All the data can be yours

An almost pointless exercise in GPU optimization

An analysis of DeepSeek's R1-Zero and R1

An Illustrated Proof of the CAP Theorem

An Intro to DeepSeek's Distributed File System

An Intuitive Explanation of Sparse Autoencoders for LLM Interpretability

An Ultra Opinionated Guide to Reinforcement Learning

Analyzing IPv4 trades with gnuplot

Analyzing the codebase of Caffeine: a high performance caching library

API design note: Beware of adding an “Other” enum value

Apple’s Darwin OS and XNU Kernel Deep Dive

ARC-AGI without pretraining

Archive

Are We Web Yet?

Artificial photosynthesis directed toward organic synthesis

Async Rust can be a pleasure to work with (without Send + Sync + 'static)

Atomics And Concurrency

Attacking My Landlord's Boiler

Attacking Unix Systems Via CUPS, Part I

AVX Bitwise ternary logic instruction busted

B-trees and database indexes

Bayleaf Wireless Keyboard

Beating cuBLAS in Single-Precision General Matrix Multiplication

Blogging in the age of AI

Build a Tiny Certificate Authority For Your Homelab

Build Your Own Text Editor

Building a full-text search engine in 150 lines of Python code

Building an AI Server on a budget