Keep up-to-date with the latest news of this category.
Note - neverland
|
Search My Site
Wilhelm von Humboldt on “the individual man, and the highest ends of his existence” (via Henrik Karlsson):
The true end of Man, or that which is prescribed by the eternal and immutable dictates of reason, and not suggested by vague and transient desires, is the highest and most harmonious developme
[...]
Weeknotes #348 — Born ugly – Andrew Doran
|
Search My Site
There was something about Monday. You could just feel it. I got to the office, checked my diary, and for the first time in months I saw a meeting-free afternoon stretched out before me, full of pos…
How Big is TBPN?
|
Search My Site
Communications expert and an amateur researcher of productivity.
Self-hosting versus lots of small indieweb providers – This day’s portion
|
Search My Site
To self-host or to get a small, indie service to host for you? Sometimes self-hosting might be the right option.
Malaffare · mzll
|
Search My Site
‘Se Natale nun venesse ‘cchiù,
chi chiagnesse ‘e lacreme ‘cchiù amare?
We Are Open Access And We’re Reclaiming Knowledge Together
|
CORE
This International Open Access Week, the global research community is asking a vital question: Who owns our knowledge? At CORE (COnnecting REpositories), our answer is clear and unapologetic:We all do. For over a decade, CORE has stood at the forefront of the open access movement, not as a passive
[...]
Co-Designing the Next 15 Years: Highlights from CORE’s Board of Supporters Meeting
|
CORE
Twice each year, CORE’s Board of Supporters (BoS) meeting brings together our members, partners, and collaborators to exchange ideas, share progress, and shape the priorities that guide our development. The October 2025 meeting marked yet another successful, well-attended session and the second of t
[...]
CORE Founder to Present at Yale University CS Talk Series
|
CORE
We’re pleased to share that Professor Petr Knoth, Founder and Head of CORE (core.ac.uk) and Professor of Data Science at The Open University’s Knowledge Media Institute, will be giving a Computer Science Talk at Yale University on 13 October 2025. In his talk, titled “COnnecting REpositories (CORE)
[...]
Discovering History, Powered by CORE
|
CORE
Every research article, thesis, and working paper, accessible to anyone, anywhere. That’s the reality CORE has been building for 15 years, making knowledge discoverable and usable for students, educators, researchers, and curious minds across the globe. “It’s extraordinary to witness how CORE (COnne
[...]
Language Support for Marginalia Search
|
Marginalia Search
One of the big ambitions for the search engine this year has been to enable searching in more languages than English, and a pilot project for this has just been completed, allowing experimental support for German, French and Swedish.
These changes are now live for testing, but with an extremely smal
[...]
From Principles to Practice – A UKCORR webinar
|
CORE
On 25 September 2025, CORE was invited by the UK Council of Open Research and Repositories (UKCORR) to present a webinar for their members, titled “From Principles to Practice: Making Repository Content Discoverable with the CORE Data Provider’s Guide.” The session focused on one of the most pressi
[...]
Introducing Kagi News
|
Kagi
*A comprehensive daily press review with global news.
Silent No Longer
|
Mwmbl
This article was originally posted on my personal blog on 2nd August 2025.
Dear friends,
I am constantly besieged by the feeling that I am not doing enough. A
genocide is unfolding before our eyes. I
feel the guilt with every mother holding a starving child,
with every doctor killed,
with every jour
[...]
Through the Omenpaths added, plus English printed text support
|
Scryfall
Find out how Scryfall is handling data entry for Through the Omenpaths.
Mojeek is Not an Answer Engine
|
Mojeek
Mojeek is about Search. AI is not the Answer.
Building a Better Web Takes a Village: Introducing Kagi Specials
|
Kagi
-------------------------------------------------------------------
Our curation of privacy-first projects worth knowing and supporting
-------------------------------------------------------------------
Nothing excites us more than bringing together people who believe the web should respect its us
[...]
The many benefits of paying for search
|
Kagi
“Wait, you PAY for search?” We get this reaction a lot about Kagi.
Faster Index I/O with NVMe SSDs
|
Marginalia Search
The Marginalia Search index has been partially rewritten to perform much better, using new data structures designed to make better use of modern hardware. This post will cover the new design, and will also touch upon some of the unexpected and unintuitive performance characteristics of NVMe SSDs whe
[...]
Update July 2025
|
Mwmbl
It’s been so long since we’ve had an update on the blog that people
are often confused as to whether the project is still active. It
definitely is! I’m just bad at updating the blog. Most of the updates
have been going to the Matrix channel.
So an update is long overdue.
Most of the recent work has
[...]
Scryfall + Cardmarket
|
Scryfall
Scryfall is proud to announce that we’ve entered into a new partnership with Cardmarket. In the coming weeks, you should see a lot more richness in our available data for European pricing.
Finding Dead Websites
|
Marginalia Search
As some of the work planned for Marginalia Search this year has been progressing a bit faster than anticipated, there was time to implement an unplanned change.
This post details the implementation of a system for detecting when servers are online, to avoid serving dead links and improve data qualit
[...]
Celebrating 50K users with Kagi free search portal, Kagi for libraries, and more...
|
Kagi
Just last week, we celebrated three years since Kagi was launched.
Kagi status update: First three years
|
Kagi
Three years ago, Kagi officially launched with a splash on popular technology forum Hacker News (to which we are eternally grateful for helping put Kagi on the map).
Profiling Websites
|
Marginalia Search
The most recent change to the search engine is a system that profiles websites based on their rendered DOM. The goal is identifying advertisements, trackers, nuisance popovers, and similar elements.
The search engine already tries to do this, but isn’t very good at it because it’s only looking at st
[...]
PDF to Text, a challenging problem
|
Marginalia Search
The search engine has recently gained the ability to index the PDF file format. The change will deploy over a few months.
Extracting text information from PDFs is a significantly bigger challenge than it might seem. The crux of the problem is that the file format isn’t a text format at all, but a gr
[...]
A Secret Web
|
clew
The web is mind-bogglingly huge; let's look at how personal websites can thrive and interact despite that.
Searchception
|
Mojeek
The illusion created by the merging of browsers with search engines.
Introducing is:default
|
Scryfall
Scryfall now offers a search term for cards that use the default frame. That is, cards that aren't showcases, borderless, extended art, and so on.
Learning and Sharing about Alternatives to Big Tech
|
Mojeek
What you can do once you've decided to avoid Big Tech?
Errata Notice: Aetherdrift
|
Scryfall
In early February 2025, Gatherer released a sweep of nearly 12,000 card Oracle text updates.
Leaving Big Tech
|
Mojeek
A range of tools available to help you kick Big Tech companies out of your life...
The MTG Wiki is now at mtg.wiki, hosted by Scryfall
|
Scryfall
The Magic: The Gathering wiki is moving to mtg.wiki and will no longer be hosted on Fandom.
Topical Custom Search Engines
|
Mojeek
How the Mojeek API can be used to build topical search engines...
The New Ariadne Architecture
|
clew
While on a fourteen-hour international flight, I finally managed to come up with an architecture for Clew's web crawler that I'm happy with. Here's the run-down.
Redesigning the Index
|
clew
I believe I've reached a point in Clew's development where, armed with the knowledge I've acquired from months of crawling sites and using that data to search the index, it's time to wipe the index and start over.
Re-ranking search results on the client side in Rust
|
Mwmbl
By many measures, Mwmbl is doing great. We have
indexed over half a billion pages, we have over 4,000 registered
users, and over 30,000 curations from those users. Our volunteers are
crawling around 5 million pages a day.
But the score that I care about most right now is
NDCG. This
measures the qual
[...]
I'm Losing Faith in BM25
|
clew
The current way that result ranking works in Clew is very different from what I want.
Welcome to the Madness
|
clew
In which we launch the insanity that is this development blog for Clew.
Indexing a billion pages
|
Mwmbl
It’s two years since we launched Mwmbl, the open
source, non-profit search engine, on Boxing Day 2021. A good time to
take stock of where we are and where we’re going.
We’ve indexed over 100 million pages
Thanks to our volunteers, who crawl the web using the Firefox
extension
and command line script
[...]
Why is curation of web search results important?
|
Mwmbl
Mwmbl is the first search engine to allow users
to change the search results:
You can add results, delete them, and rerank them. The changes you
made are saved instantly to the index and will be shown to other users
who run the same query.
But what is the point of users changing search results? Th
[...]
Description
Websites that help you find a specific information on various topics.
Suggest a website
Surf this category
News
Recent additions
All feeds
All websites