Hide Idle (>14 d.) Chans


← 2020-02-16 | 2020-02-18 →
thimbronion: BingoBoingo: Took me 2h 45 min: http://paste.deedbot.org/?id=RU6-
BingoBoingo: thimbronion: ty
thimbronion: Nice! Lol Madame Secretary
BingoBoingo: It's the official label for the USG "Secretary of State"
BingoBoingo: Meme'ing and joking are strong on Qntra. The weak part, as I'm getting beaten for... is the spreading.
asciilifeform: BingoBoingo: re spreading -- have you an idea of current readership stats ? ( even e.g. via perlism on machine logs )
BingoBoingo: asciilifeform: I'll grab some of those soon. http://logs.ossasepia.com/log/trilema/2020-02-15#1958113 was a paralyzing blow
ossabot: (trilema) 2020-02-15 mp_en_viaje: i seriously do not wish to hear anymore from you. if all you've got inside is in that vein, do me the favour and make that your last word.
thimbronion: Are platforms, as a structure, out, in terms of spreading?
BingoBoingo: thimbronion: No, I decided to focus on blog comments as a means of targeting folks already writing
thimbronion: Also, what did he mean by "that vein?"
ossabot: (trilema) 2020-02-15 mp_en_viaje: fucking inane bullshit, you've decided to see what action actionbot can bring ? who the fuck are you ? go measure the thickness of walls somewhere already, there's 0 the fuck need for YET ANOTHER pompous asshole to do me the very great service of counting my balls.
thimbronion: Yeah I read that but I don't know what it means.
thimbronion: Like is he saying, don't be middle management?
BingoBoingo: thimbronion: He doesn't need me around if all the value I can contribute is flattering him is the core of what I'm reading.
BingoBoingo: But I've been discovering I'm not as good at reading him as I thought.
asciilifeform: BingoBoingo: ftr i read it as on-schedule implementation of 'the old hands gotta go'. a la 1937.
thimbronion: Oh, ok I can see that angle in the "count my balls" line
thimbronion: But I don't get why asking for help with technical work triggered it
ossabot: (trilema) 2020-02-15 mp_en_viaje: fucking inane bullshit, you've decided to see what action actionbot can bring ? who the fuck are you ? go measure the thickness of walls somewhere already, there's 0 the fuck need for YET ANOTHER pompous asshole to do me the very great service of counting my balls.
ossabot: (trilema-hanbot) 2020-02-16 hanbot_abroad: i mean, there'd be some things to ponder in here, such as the notion that you can't hire people to do things for you if you've no idea how to do them yourself; that figuring things out involves struggling with what you don't understand rather than lifting the results of other people's struggles; that "sooner" is nonsense in this context given you've been going to/thinking about maybe trying to start
ossabot: (trilema-hanbot) 2020-02-16 hanbot_abroad: r YEARS; that you picked up everything and moved to another continent, the process and commitment of which is at least a few orders of magnitude larger than what learning how to write a freaking script would involve...and i'm sure i could wring more out of here.
ossabot: (trilema-hanbot) 2020-02-16 hanbot_abroad: but seriously, all of that is moot, completely, if the foregoing ain't clear. republican tools ain't for passing the buck, and the behavior's not sound, or sane.
BingoBoingo: Something in my head broke, "Is that all I've got?", "Wat do?", and after being stunned by dread, I'm throwing myself at the command line trying to make the pieces that be strung together into a blog crawler.
thimbronion: BingoBoingo: what are you writing it in? Depending on the lang, I may be of more or less help.
BingoBoingo: thimbronion: Right not I'm working with shell commands, trying to do it with things that already belong on the system.
thimbronion: Ok. I did write a static blog using bash a while ago, but very much not my specialty.
BingoBoingo: thimbronion: So what I've got right now are a pair of commands. One retrieves urls from a page and sticks them in a file. The second command makes a second file from urls in the first that is a dedeplicated list of domains http://paste.deedbot.org/?id=nIKD
BingoBoingo: In trying to treat the spec I produced here as a sort of pseudocode, right now I'm banging my head on how to go to a top level page and follow the first link inside a div labeled "main"
BingoBoingo: From there I'd like to feed that second page into grep so grep looks signs of a working comment box.
thimbronion: This would be relatively easy to do with an html parser, which I suspect you might be avoiding, but not sure. Using regex to parse html can be very tricky.
BingoBoingo: It may be. Can you recommend any html parsers I should be looking at?
thimbronion: Well, the smarmy languages all have one or two popular libraries. Here's one for ruby for example: https://nokogiri.org/. In terms of smarm-free langs, I don't have anything I've used that I can recommend. I suspect there are some good utilities out there.
BingoBoingo: I'm reading the gawk man page and inclined to give it a whirl, but I'm still digesting how to shape the thing
asciilifeform: BingoBoingo: at one time i used a proggy called 'scrapy'. for simple spidering, worked. (python, tho.)
asciilifeform: ( if you want to try it, be aware that only 1.x vers work on python2 )
shinohai: lynx -dump http://qntra.net | awk '/http/{print $2}' <<< if trying to dump url's from a page doesn't that work?
BingoBoingo: shinohai: I'll try that form for parsing
BingoBoingo: asciilifeform: I might give that a try if exhausting myself with awk doesn't yield anything.
BingoBoingo to head to Spanish class, rescheduled from Friday due to Susan having a job interview, in a bit
BingoBoingo: Thank you all for giving me things to chew.
shinohai: eh sorry abt awk above, should have added: ` awk '/http/||/https/{print $2}' `
shinohai: (to catch the https junk)
BingoBoingo: Maximizing the targeting of targetable junk is indeed the name of the game.
shinohai: I love the fuck outta awk for that.
← 2020-02-16 | 2020-02-18 →