Search results for: " sitesearch Cloudflare Algolia" Found 5 relevant passage(s) from hoeijmakers.net ──────────────────────────────────────────────────────────── [1] "Back in the Search Arena, This Time With Vectors" URL: https://hoeijmakers.net/back-in-the-search-arena-this-time-with-vectors/ Published: 2026-05-26 Relevance score: 0.728 Three years ago I wrote about site search as if it were a solved problem. Collect, index, serve. It made sense at the time. Ghost had a search widget, it worked well enough, and I moved on.What I did not register then was the ceiling. Ghost's built-in search indexes titles and excerpts only. The body of every post, where the actual thinking lives, stays invisible. For a site with a handful of posts, that is fine. For a site with six hundred, it means your own archive is mostly unsearchable.I knew this. I just never had the right combination of tools, time, and nerve to do something about it.The returnThis spring, that changed. I came back to the problem with Claude Code, which turned out to be the missing piece. Not because it writes perfect code, but because it removes the activation energy. The setup that used to require a developer and a week of back-and-forth now takes an afternoon and a working session.I integrated Algolia. Fast, full-text, generous free tier. It indexes everythin ──────────────────────────────────────────────────────────── [2] "Site Search explained" URL: https://hoeijmakers.net/website-search/ Published: 2021-10-02 Relevance score: 0.720 The third step is to offer a search box and page to do the actual searching. In the simplest variant, it is just entering search words and see a list of results. But you can go much further, and there are a number of possibilities to improve the experience of searching. You might want to have filters or facets, autosuggest, spelling corrections, highlight of results in context, recommendations or push certain results you deem more valuable. Also, here Natural Language Processing (NLP) might come in, and the person searching can type in daily language to search. Basically this third step is the goal and if the first two are done properly the visitor should be able to get what is necessary and maybe even be inspired to look further because of recommendations. It depends on how well the interactions are shown and working, if there is an enticing User Experience. The site search on this website In the past, I mostly worked with Google Search Application and a solution based on Elasticsearc ──────────────────────────────────────────────────────────── [3] "Delegating Past Your Own Ceiling" URL: https://hoeijmakers.net/delegating-past-your-own-ceiling/ Tags: AI in Practice, #Ghost Published: 2026-04-19 Relevance score: 0.716 This one goes a level deeper than usual. If you run a self-hosted blog or manage your own web infrastructure, stay with it. The payoff is real. Cloudflare, the network and security layer that sits in front of most of this blog, has been part of my setup since the beginning. DNS, CDN, basic security. The layer in front of Ghost, my publishing platform, that I configured once and mostly left alone. Not because there was nothing more to do, but because the gap between what Cloudflare can do and what I could confidently operate was wide enough to leave alone. That gap closed recently. Not because Cloudflare got simpler, but because I stopped being the one operating it. The ceiling Every tool has a capability ceiling for any given user. For most practitioners running a Ghost site, Cloudflare's ceiling sits somewhere around DNS and caching. The dashboard is capable but not intuitive. Workers, R2, analytics at the edge: these are real features with real value, but they require a mental model ──────────────────────────────────────────────────────────── [4] "When Bots Become Readers: Publishing in the Age of AI Crawlers" URL: https://hoeijmakers.net/when-bots-become-readers/ Tags: AI in Practice Published: 2025-10-09 Relevance score: 0.711 I listened to a conversation between Azeem Azhar and Matthew Prince , the co-founder of Cloudflare . Prince described how a large share of today’s internet traffic no longer comes from people but from machines, these are bots that index, scrape, and simulate human behaviour. I enjoyed the discussion. It was clear, grounded, and made me look at my own practice differently. Because I host my Ghost site with Cloudflare as DNS, I realised that every request for my blog also passes through their network. In other words: I, too, publish into a space where machines are part of the audience. 🔮 How Al is breaking and rebuilding the internet economy Listen now (47 mins) | From search to answers | With Cloudflare co-founder & CEO Matthew Prince Exponential View Azeem Azhar I really enjoyed this episode. Matthew Prince became a tangible person when he said he and his wife own the local newspaper in there hometown. Quick takeaways Much of today’s web traffic is automated and driven by bots that fe ──────────────────────────────────────────────────────────── [5] "Guests That Should Behave" URL: https://hoeijmakers.net/guests-that-should-behave/ Tags: AI in Practice, #Ghost Published: 2026-04-19 Relevance score: 0.703 The traffic spikes in Plausible (Web analytics) made no sense. Peak after peak, no referral source, no pattern I recognised. Bots, clearly, but the kind that arrive carrying a real browser, behaving like a human long enough to slip past lightweight analytics. Not a security incident. More like guests who don't knock. That framing stuck with me as I worked through the fix. Bots are guests. Most of them are welcome. The question is which ones, and on what terms. Welcome and unwanted The web has always had crawlers. Search engines, archivers, feed readers: automated visitors that make the open web function. I have no objection to those. What changed over the past year or two is volume and intent. By mid-2025, crawling for AI model training accounted for nearly 80% of all AI bot activity on Cloudflare's network. Many of those crawlers identify themselves honestly. Some don't, cycling through residential IP addresses and real browsers to blend in. The ones showing up in my Plausible dashboa