2026-01-21


Thoughts


Let us suppose that you show your watch to a friend, and say to him: "Must not the individual who made this watch, and arranged the works, have understood his business very well? Must he not be a very clever fellow, and posses a first-rate head-piece?" Now suppose they were to reply: "Oh, nonsense! The watch made itself!" Should you not gaze fixedly at him, and make some such remark as the following: "My good friend, if you are in earnest, and really mean what you say, there must be a screw loose in your upper story." And you would be quite justified in thus addressing him. -- The Young Man's Guide, Father Lasance


In my random searching I came across an interesting post. In which the author posits that the English language being the "lingua franca" of the internet is causing some interesting consequences. One of those being that Western concerns are run through the fabric of the rest of the world as the news and underpinnings of most English sites can not keep that out of the discourse. Also it is an interesting thought point that as they point out there is a view of "The World" is in shambles when really it is the Western world that is struggling and many places might be getting along quite fine on a local level. It was a thought provoking read, for me anyway.



Tech


Messing with the gemini crawler recently and had to add a few features. The first was that I needed to add a capability to not go down cgi-bin black holes. An example of this was running across one of the wikipedia to gemini translations out there. Excellent to know exists, but something that is currently of low content value for the crawler. Thus I added a flag to just skip anything that comes across as cgi-bin. Later I think I came up with a better implementation where I now have an internal blacklist that is formatted similar to a robots.txt file. This came up because I was running across a capsule that was the information in the musicbrainz database. Again excellent information, but an enourmous pit of information that I think currently has low value for my crawler.



Spent a bit of time working with lnav, I had forgotten what a great tool it is. The challenge has been getting all of the log lines to parse properly and capture what is needed. Looks as though I am getting past the hump at this point in time and everything is starting to parse.


Developers, logs are an API. There is someone that is going to parse your logs looking for errors and even to get an idea if something is running as it should. Changing words in your messages will break a parser someone has created. Please also use a standard format. There are suites out there built around ingesting log lines and most of them will automatically pull in and do most of the work if you just use an existing standardized format.