📓 projects/misc.md by @karlicoss ☆

Table of Contents

* jdoe [[jdoe]]

related #exports #hpi:
J. Doe (Joe Doe? Jane Doe?) is a made up person with a narrative to showcase different quantifiedself/memex tools

[2021-01-22] have some info on ideas page

ask where to get sample API data? [[jdoe]]

examples of outputs [[dataliberation]] [[exports]] [[jdoe]]

jq 'walk(if type == "object" or type == "array" then . else type end)' pocket_20190923T210004Z.json

ugh, still might leak some private data…

jq 'walk(if type == "object" then del(.profile, .account, .groups) else . end)' pocket_20190923T210004Z.json

[2020-04-04] jackreid.xyz/pocket.json at master · JackWReid/jackreid.xyz [[jdoe]]

https://github.com/JackWReid/jackreid.xyz/blob/master/data/pocket.json
hmm nice pretty extensive pocket export

[2020-12-11] Bibliodemos’ Docs :: Bibliodemos’ Docs [[jdoe]]

A demo persona

[2020-09-02] AnandChowdhary/life: 🧬 All life data — location, health, work, play, and more — open sourced [[jdoe]]

[2020-04-24] saladmanssss/salad [[jdoe]] [[takeout]]

[2020-04-24] ctf/forensics-CryptoDIY at master · ANSSI-FR/ctf [[jdoe]] [[takeout]]

[2020-04-24] eh, only mail. not so interesting

[2020-10-05] ecopass/5kLineLocationHistory.json at 80f1afa346136d80c6319a1770501bff5bf8e25f · eco-pass/ecopass [[jdoe]] [[location]]

[2020-02-10] Python package, which helps generate big volumes of fake data for a variety of purposes in a variety of languages. /r/coolgithubprojects [[jdoe]]

>What could it be used for?
For generating placeholder data, bootstrapping database with anonymized data, generating JSON data, creating fake API endpoints and et.c

pdfannots has some test pdfs [[hpi]] [[jdoe]]

could share podcastaddict dbs? [[jdoe]] [[bleanser]]

[2021-04-15] file:places.sqlite - Sourcegraph [[jdoe]] [[hpi]] [[promnesia]]

wow, searching on sourcegraph is actually super nice…

* kython [[kython]]

https://beepb00p.xyz/kython.html

use sqlite like json path? https://www.sqlite.org/json1.html

checker that link is dead [[kython]]

[2019-10-27] but need something more finetuned? like for DELETED reddit posts…

get rid of remianing JsonState methods.. I think it’s only spotisync? [[kython]]

date parsing libraries [[kython]] [[python]] [[datetime]]

started tsomething in kython.datetime module

[2018-12-30] recurrent in workouts ??

maybe, blog or do some sort of benchmark??

grep my code for usages of this

run search indexer against my code?…

https://github.com/scrapinghub/dateparser – looks decent, can specify hints

using in kython and openrent and my porg thing
ok capable of kindle dates kida like Wednesday, November 22, 2017 9:11:56 PM
couldn’t handle insta photo…

https://github.com/bear/parsedatetime ??

dateutil.parse?

datefinder [[python]] [[parsing]] [[datetime]]

from datefinder import find_dates # type: ignore
TODO ugh find_dates is shit
can't even recognise 20180304
def my_find_dates(f):
    f = f.replace('_', '-') # just in case of _ separators
    f = f.replace('time', '') # apparently find_dates gets confused when it sees 'time'
    return list(find_dates(f))
could recognise endomondo-2018-04-10 though

hmmm https://github.com/scrapinghub

[2020-01-13] functools — Higher-order functions and operations on callable objects — Python 3.8.1 documentation [[kython]]

https://docs.python.org/3.8/library/functools.html

@functools.cached_property(func)¶

nice!

* kobuddy [[kobuddy]]

My library for accessing #kobo data
https://github.com/karlicoss/kobuddy

why it’s better for me [[exports]]

automatic backups
old databases
access from other tools

bug?

TODO tie breaking
-- 2019-03-24 11:01:44: finished
-- 2019-03-24 11:01:44: finished reading What is Life? by Erwin Schroedinger. total time spent undefined minutes
-- 2019-03-24 11:01:44: left What is Life? by Erwin Schroedinger: 99%, read for 80 mins
-- 2019-03-24 11:09:31: finished

TODO for goodreads scripts: get all titles with at least one 'finished' event
match them via goodreads api and check that they are marked as read
perhaps 'started reading' as well?

extraannotationdata??

would be nice to extract chapters, e.g. for Greg Egan

debugging stuff

def zeroes(bts):
    res = b''
    for b in bts:
        res += b'\0'
        res += bytes([b])
    return res

# seq = zeroes(b'eventTimestamps')
seq = zeroes(b'Monetization')
idd = blob.find(seq)
# if idd == -1:
#     return
# else:
#     import sys
#     print('----', file=sys.stderr)
#     # print(context(), file=sys.stderr)
#     print(blob[idd + len(seq):], file=sys.stderr)
#     return

[2020-05-07] Any way to filter out Pocket articles? · Issue #6 · karlicoss/kobuddy [[kobuddy]]

Kobo H2O Libra

might be useful for calibre??

[2019-11-07] my/kobo.py at master · karlicoss/my [[hpi]]

https://github.com/karlicoss/my/blob/master/my/books/kobo.py

from my_configuration import paths
from my_configuration.repos.kobuddy.src.kobuddy import *

set_databases(paths.kobuddy.export_dir)

# TODO maybe type over T?
_Predicate = Callable[[str], bool]
Predicatish = Union[str, _Predicate]
def from_predicatish(p: Predicatish) -> _Predicate:
    if isinstance(p, str):
        def ff(s):
            return s == p

not sure, do we want this instead?

[2019-05-19] https://github.com/search?q=kobo+ExtraData&type=Code – no result, which kinda means no one did it [[kobo]]

same for LastOccurrence

* workout ledger [[wledger]]

related #hpi

if attributes are detected, but exercise isn’t, error and just render it as is

‘walked’ summary every day? exclude running??

highlight checked in dashboard?

workouts ledger??

support for ignoring/consuming words within certain context (e.g. (straight) leg raises) [[wledger]]

Parser combinators for workouts [[wledger]] [[hpi]] [[parsing]]

* twidump [[twidump]]

my #twitter exporter

still useful to warn when user is suspended, private etc… could add it to weekly check? [[twidump]]

maybe, renderer should be decoupled from twidump itself? [[twidump]]

backup my followers? [[twidump]]

——

align properties same way org does it? [[inorganic]]

could have caught it if was checking against increasing size.. [[exports]] [[backupchecker]]

some stuff in orgzly-android tests [[orgsandbox]]

[2020-03-19] karlicoss/sufs: Merge multiple directories into one via symlinks

https://github.com/karlicoss/sufs

sufs is a tool for 'merging' multiple directories into one via symlinks.

add example?

Add unknown stuff to special playlist? [[spotisync]]

[2019-01-31] eh?

might be better to check against mpd?? It updates the database in background… [[spotisync]]

Rename to symlinkfs? [[sufs]]

[2019-11-10] tantivy::tokenizer - Rust [[scrapyroo]]

https://tantivy-search.github.io/tantivy/tantivy/tokenizer/index.html

In addition to what default does, the en_stem tokenizer also apply stemming to your tokens. Stemming consists in trimming words to remove their inflection. This tokenizer is slower than the default one, but is recommended to improve recall.

[2019-11-10] demonstrate different tokenizers?

name my package pyrog? [[org]]

gORGeououes as org-mode app name [[org]]

On a looped string? [[quitar]]

Two strings, one imaginary one real. Normalisation is not necessary [[quitar]]

work on poset-ratings [[rating]]

https://github.com/karlicoss/telegram2org

Update description

Hmm.not sure if it counts playlists? [[spotisync]]

[2018-12-05] whooops… not really!h

[2019-09-02] huh, ignoring could be implemented via special playlist…

list all cool things I tried and learned [[python]] [[hikkup]]

make symlink checker ignore file list list of symlinks to ignore [[ruci]]

[2020-08-27] reHackable/awesome-reMarkable: A curated list of projects related to the reMarkable tablet [[remarkable]]