Jérémie Lumbroso jlumbroso

Faculty at Penn CIS. Loves automation, markets, information systems, policy and thinking "at scale."

379 followers · 589 following

Department of Computer & Information Sciences, @upenn
Pennington, NJ
04:45 (UTC -04:00)
https://directory.seas.upenn.edu/jeremie-o-lumbroso/
https://orcid.org/0000-0002-5563-687X
in/jérémie-lumbroso
@JeremieLumbroso

View GitHub Profile

Recently created

Least recently created

Recently updated

Least recently updated

jlumbroso / combine_for_model.py

Created February 12, 2024 22:15

A Python script to concatenate source code to send to a foundational model as a prompt.

	#!/usr/bin/env python3
	"""
	combine_for_model.py

	Author: Jérémie Lumbroso <lumbroso@seas.upenn.edu>
	Date: July 25, 2023
	Version: 1.0

	Description:

muhark / hf_model_downloader.md

Last active July 27, 2024 14:37

Offline HuggingFace Models on HPC

Downloading HuggingFace Models

This gist shares a little workflow and script for a task that most people using university HPCs for NLP research will need to do: downloading and storing HuggingFace models for use on compute nodes.

What this workflow is for:

Context: you want to use HuggingFace models on Della (or other HPC clusters).
Problem 1: you cannot call AutoModel.from_pretrained('model/name') at run time because compute nodes are not connected to the internet.
Problem 2: running AutoModel.from_pretrained() on the head node is impractical because the model is too large to be loaded.
Problem 3: you do not want to save the model weights to the default ~/.cache/ because you only get 10GB of storage on /home

jflam / app.py

Created January 16, 2023 02:55

Citations needed

	# To run you'll need some secrets:
	# 1. SERPAPI_API_KEY secret in env var - get from https://serpapi.com/
	# 2. OPENAI_API_KEY secret in env var - get from https://openai.com

	import streamlit as st
	import json, os
	from langchain.prompts import PromptTemplate
	from langchain.llms import OpenAI
	from serpapi import GoogleSearch

josephlou5 / get_file_history.py

Last active December 1, 2022 02:23

Gets the history of a specific file in all the commits of a repository

	"""
	get_file_history.py
	Gets the history of a specific file in all the commits of a repo.

	GitPython: https://gitpython.readthedocs.io/en/stable/index.html
	"""

	# ==============================================================================

	import json

camtheman256 / export_when2meet.js

Created February 28, 2021 22:16

Export when2meet data from JS console

	function exportData() {
	const peopleMap = {};
	for(let i = 0; i < PeopleIDs.length; i++) {
	peopleMap[PeopleIDs[i]] = PeopleNames[i];
	}
	nameAtSlot = AvailableAtSlot.map(e => e.map(i => peopleMap[i]));
	timedNames = TimeOfSlot.map((e, i) => [e, nameAtSlot[i]]);
	return JSON.stringify(timedNames);
	}

akash-ch2812 / Marking_ROI.py

Last active January 17, 2024 06:11

Python code for marking regions of interest in an image for OCR

	# use this command to install open cv2
	# pip install opencv-python

	# use this command to install PIL
	# pip install Pillow

	import cv2
	from PIL import Image

	def mark_region(imagE_path):

fedarko / gh_url_to_raw_gh_url.py

Created October 2, 2019 22:10

Convert a github file URL to a raw.githubusercontent.com URL (that can be directly accessed for things like view.qiime2.org or wget)

	# your link goes here
	link = "https://github.com/knightlab-analyses/qurro-mackerel-analysis/blob/master/AnalysisOutput/qurro-plot.qzv"

	# note: this will break if a repo/organization or subfolder is named "blob" -- would be ideal to use a fancy regex
	# to be more precise here
	print(link.replace("github.com", "raw.githubusercontent.com").replace("/blob/", "/"))

	# example output link:
	# https://raw.githubusercontent.com/knightlab-analyses/qurro-mackerel-analysis/master/AnalysisOutput/qurro-plot.qzv

jlumbroso / SimpleColorLogging.py

Created April 21, 2019 22:49

Short snippet showing how to have colored terminal logging output in Python.

	import os
	import logging


	class _Color:
	PURPLE = '\033[95m'
	CYAN = '\033[96m'
	DARKCYAN = '\033[36m'
	BLUE = '\033[94m'
	GREEN = '\033[92m'

gstorer / PDF_extract_images.py

Created August 1, 2018 10:15

Extract images from a PDF file using Python, Pillow (PIL) and PyPDF2

	# coding=utf-8

	from __future__ import print_function
	"""
	The MIT License (MIT)
	Copyright (c) 2018 Louis Abraham <louis.abraham@yahoo.fr>
	Copyright ©2016 Ronan Paixão
	Copyright (c) 2018 Gerald Storer

	\x1B[34m\033[F\033[F

thackerronak / AESHelper.java

Last active September 5, 2024 16:35 — forked from armanso/AES.java

AES encryption/decryption in crypto-js way, use KDF for generating IV and Key, use CBC with PKCS7Padding for Cipher

	import com.sun.jersey.core.util.Base64;
	import java.io.UnsupportedEncodingException;
	import java.security.InvalidAlgorithmParameterException;
	import java.security.InvalidKeyException;
	import java.security.MessageDigest;
	import java.security.NoSuchAlgorithmException;
	import java.security.SecureRandom;
	import java.util.Arrays;
	import java.util.Random;
	import javax.crypto.BadPaddingException;

NewerOlder