Skip to content

Instantly share code, notes, and snippets.

View gregjan's full-sized avatar

Gregory Jansen gregjan

View GitHub Profile
from bs4 import BeautifulSoup
import spacy
import stanza
import re
nlp = stanza.Pipeline(lang='en', processors='tokenize,pos,lemma,depparse,constituency', package={'constituency': 'ptb3_bert'})
# Change to a loop through of whatever the document storage is
HTMLFile = open("AgentOrange.html", "r")
#change this to whatever string search you're using
x="problems" # FIXME: Not currently used..
@gregjan
gregjan / asdf
Created May 28, 2022 03:07
Old World crash log
sasdfa
teststack_trellis-cassandra.1.6hyk44rvrytg@ciber-vs4.umd.edu |
2019-11-20 15:39:45,863 ERROR [io.qua.ver.htt.run.QuarkusErrorHandler]
(vert.x-worker-thread-18)
HTTP Request to /2dacfa78-3996-4cb2-83a2-a082eeddf37c failed, error
id: 7fbd39af-8cd6-4090-8efa-0e96036995d7-685:
org.jboss.resteasy.spi.UnhandledException:
com.google.common.util.concurrent.UncheckedExecutionException:
java.util.concurrent.CompletionException:
com.datastax.oss.driver.api.core.DriverTimeoutException: Query timed
This file has been truncated, but you can view the full file.
trellis_trellis.1.msubc3lgkxly@deck | INFO [2019-08-08 00:43:50,213] io.dropwizard.server.DefaultServerFactory: Registering jersey handler with root path prefix: /
trellis_trellis.1.msubc3lgkxly@deck | INFO [2019-08-08 00:43:50,215] io.dropwizard.server.DefaultServerFactory: Registering admin handler with root path prefix: /
trellis_trellis.1.msubc3lgkxly@deck | DEBUG [2019-08-08 00:43:50,215] org.trellisldp.app.AbstractTrellisApplication: Initializing Trellis application with class org.trellisldp.ext.db.app.AppConfiguration
trellis_trellis.1.msubc3lgkxly@deck | INFO [2019-08-08 00:43:50,641] org.trellisldp.file.FileMementoService: Storing Mementos as files at /opt/trellis/data/mementos
trellis_trellis.1.msubc3lgkxly@deck | INFO [2019-08-08 00:43:50,659] org.trellisldp.ext.db.DBResourceService: Using database persistence with TrellisLDP
ciber-vs1:testbed$ curl -v -H 'Prefer: return=representation; include="http://www.trellisldp.org/ns/trellis#PreferAccessControl"' http://ciber-vs1.umd.edu:10080/
* About to connect() to ciber-vs1.umd.edu port 10080 (#0)
* Trying 128.8.216.153...
* Connected to ciber-vs1.umd.edu (128.8.216.153) port 10080 (#0)
> GET / HTTP/1.1
> User-Agent: curl/7.29.0
> Host: ciber-vs1.umd.edu:10080
> Accept: */*
> Prefer: return=representation; include="http://www.trellisldp.org/ns/trellis#PreferAccessControl"
>
ciber-vs1:testbed$ curl -v -u admin:admin http://ciber-vs1.umd.edu:10080
* About to connect() to ciber-vs1.umd.edu port 10080 (#0)
* Trying 128.8.216.153...
* Connected to ciber-vs1.umd.edu (128.8.216.153) port 10080 (#0)
* Server auth using Basic with user 'admin'
> GET / HTTP/1.1
> Authorization: Basic YWRtaW46YWRtaW4=
> User-Agent: curl/7.29.0
> Host: ciber-vs1.umd.edu:10080
> Accept: */*
@gregjan
gregjan / session.log
Created May 20, 2019 15:14
authorization setting attempt
ciber-vs1:testbed$ curl -X PUT -v -H "Content-Type: text/turtle" --data-binary @auth.ttl -u admin:admin http://ciber-vs1.umd.edu:10080?ext=acl
* About to connect() to ciber-vs1.umd.edu port 10080 (#0)
* Trying 128.8.216.153...
* Connected to ciber-vs1.umd.edu (128.8.216.153) port 10080 (#0)
* Server auth using Basic with user 'admin'
> PUT /?ext=acl HTTP/1.1
> Authorization: Basic YWRtaW46YWRtaW4=
> User-Agent: curl/7.29.0
> Host: ciber-vs1.umd.edu:10080
> Accept: */*
jansen@X1:~$ curl -v -XPOST -H "Link: <http://www.w3.org/ns/ldp#BasicContainer>; rel=\"type\"" http://ciber-vs1.umd.edu:10080
* Rebuilt URL to: http://ciber-vs1.umd.edu:10080/
* Trying 128.8.216.153...
* TCP_NODELAY set
* Connected to ciber-vs1.umd.edu (128.8.216.153) port 10080 (#0)
> POST / HTTP/1.1
> Host: ciber-vs1.umd.edu:10080
> User-Agent: curl/7.58.0
> Accept: */*
> Link: <http://www.w3.org/ns/ldp#BasicContainer>; rel="type"
@gregjan
gregjan / terminal session
Last active March 21, 2019 19:47
duplication of the contains duplication issue
jansen@X1:~$ curl -v -XPOST -H "Link: <http://www.w3.org/ns/ldp#BasicContainer>; rel=\"type\"" http://ciber-vs1.umd.edu:10080
* Rebuilt URL to: http://ciber-vs1.umd.edu:10080/
* Trying 128.8.216.153...
* TCP_NODELAY set
* Connected to ciber-vs1.umd.edu (128.8.216.153) port 10080 (#0)
> POST / HTTP/1.1
> Host: ciber-vs1.umd.edu:10080
> User-Agent: curl/7.58.0
> Accept: */*
> Link: <http://www.w3.org/ns/ldp#BasicContainer>; rel="type"
@gregjan
gregjan / system.log
Created March 14, 2019 19:10
log lines from time of testing
DEBUG [cluster1-nio-worker-4] 2019-03-14 19:06:40,293 QueryLogger.java:711 - [cluster1] [/10.0.19.7:9042] Query completed normally, took 5 ms: [1 bound values] SELECT modified FROM mutabledata WHERE identifier = ?;
DEBUG [cluster1-nio-worker-4] 2019-03-14 19:06:40,296 QueryLogger.java:711 - [cluster1] [/10.0.19.7:9042] Query completed normally, took 9 ms: [2 bound values] SELECT * FROM mutabledata WHERE identifier = ? AND createdSeconds <= ? LIMIT 1 ALLOW FILTERING;
DEBUG [cluster1-nio-worker-2] 2019-03-14 19:06:46,182 QueryLogger.java:711 - [cluster1] [cassandra-1/10.0.19.5:9042] Query completed normally, took 4 ms: [2 bound values] SELECT * FROM mutabledata WHERE identifier = ? AND createdSeconds <= ? LIMIT 1 ALLOW FILTERING;
DEBUG [cluster1-nio-worker-2] 2019-03-14 19:06:46,184 QueryLogger.java:711 - [cluster1] [cassandra-1/10.0.19.5:9042] Query completed normally, took 6 ms: [1 bound values] SELECT modified FROM mutabledata WHERE identifier = ?;
DEBUG [cluster1-nio-worker-2] 2019-03-14 19:06:46,207 QueryL