Technology

70268 readers

3948 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

2006

Switzerland mandates all software developed for the government be open sourced (calckey.world)

submitted 10 months ago by tek@calckey.world to c/technology@lemmy.world

150 comments fedilink hide all child comments

Switzerland mandates all software developed for the government be open sourced

Switzerland mandates software source code disclosure for public sector: A legal milestone

https://joinup.ec.europa.eu/collection/open-source-observatory-osor/news/new-open-source-law-switzerland

@technology@lemmy.world

#tech #libre

you are viewing a single comment's thread
view the rest of the comments

[–] Dave@lemmy.nz 1 points 10 months ago (1 children)

Ah that sounds really interesting! Does it scale OK? I guess you could index at a word level and filter quite quickly for quick searches, but it seems you're going to have to store the full text of every website?

[–] vk6flab@lemmy.radio 2 points 10 months ago (1 children)

You store just the word count for each word on each URL.

The search is pretty trivial in database terms since you don't need to do any wildcard or like matching.

[–] Dave@lemmy.nz 1 points 10 months ago (1 children)

Ah of course!

I guess one of the things the Google originally solved was that the internet if full of crap and not all sites should have equal weighing. With AI spam sites these days, you'd probably also need a method of weighting results?

[–] vk6flab@lemmy.radio 2 points 10 months ago (1 children)

We never got that far to test that kind of issue and while I've been reimplementing it locally to search through employment advertising, I'm not at a point where I'd be able to test such a thing.

The original implementation used a data store written by another team member and it made the original project much too complicated.

Today I'd likely use duckdb to implement it. My local version uses text files for a proof of concept implementation.

[–] Dave@lemmy.nz 1 points 10 months ago

It sounds like a really cool project regardless!