Skip to content

Instantly share code, notes, and snippets.

@thehunmonkgroup
thehunmonkgroup / summary.md
Created September 26, 2024 23:46
Summary: Gravitational stability and fragmentation condition for discs around accreting supermassive stars

URL: https://export.arxiv.org/pdf/1901.00007.pdf

Gravitational stability and fragmentation condition for discs around accreting supermassive stars


QUESTION 1:

Can you give me a very clear explanation of the core assertions, implications, and mechanics elucidated in this paper?

ANSWER 1:

@thehunmonkgroup
thehunmonkgroup / summary.md
Created September 23, 2024 20:06
Summary: Training Language Models to Self-Correct via Reinforcement Learning

URL: https://arxiv.org/pdf/2409.12917.pdf

Training Language Models to Self-Correct via Reinforcement Learning


QUESTION 1:

Can you give me a very clear explanation of the core assertions, implications, and mechanics elucidated in this paper?

ANSWER 1:

@thehunmonkgroup
thehunmonkgroup / summary.md
Created September 23, 2024 19:22
Summary: TO COT OR NOT TO COT? CHAIN-OF-THOUGHT HELPS MAINLY ON MATH AND SYMBOLIC REASONING

URL: https://arxiv.org/pdf/2409.12183.pdf

TO COT OR NOT TO COT? CHAIN-OF-THOUGHT HELPS MAINLY ON MATH AND SYMBOLIC REASONING


QUESTION 1:

Can you give me a very clear explanation of the core assertions, implications, and mechanics elucidated in this paper?

ANSWER 1:

@thehunmonkgroup
thehunmonkgroup / summary.md
Created September 23, 2024 18:56
Summary: Jailbreaking Large Language Models with Symbolic Mathematics

URL: https://arxiv.org/pdf/2409.11445.pdf

Jailbreaking Large Language Models with Symbolic Mathematics


QUESTION 1:

Can you give me a very clear explanation of the core assertions, implications, and mechanics elucidated in this paper?

ANSWER 1:

@thehunmonkgroup
thehunmonkgroup / Analyzing Research Paper: Extracting Data.md
Created September 21, 2024 17:04
Analyzing Research Paper: Extracting Data

USER

Your task is to examine the provided research paper, and extract three pieces of related data:

  1. A question that is explored
  2. A chain of reasoning that bridges the question and the final answer
  3. The final answer

When choosing the question/chain/answer set, focus on a set that where the chain of reasoning best adheres to the 'Chain of Reasoning Criteria' listed below:

@thehunmonkgroup
thehunmonkgroup / summary.md
Created September 18, 2024 14:42
Summary: SafeLMM: Safe Large Multimodal Models By Design
@thehunmonkgroup
thehunmonkgroup / summary.md
Created September 18, 2024 14:15
Summary: What’s the Magic Word? A Control Theory of LLM Prompting

URL: https://arxiv.org/pdf/2310.04444.pdf

What’s the Magic Word? A Control Theory of LLM Prompting


QUESTION 1:

Can you give me a very clear explanation of the core assertions, implications, and mechanics elucidated in this paper?

ANSWER 1:

@thehunmonkgroup
thehunmonkgroup / 1. Fine-Tuning vs Reinforcement Learning: Explained, 2. Understanding Fine-Tuning and Reinforcement Learning, 3. Difference Between Fine-Tuning and Reinforcement Learning.md
Created September 18, 2024 14:05
1. Fine-Tuning vs Reinforcement Learning: Explained, 2. Understanding Fine-Tuning and Reinforcement Learning, 3. Difference Between Fine-Tuning and Reinforcement Learning

USER

I am an expert systems designer and programmer, but have little experience in machine learning. Provide me a brief tutoria with examples to explain and clarify the difference between fine-tuning and reinforcement learning, especially in the context of large language models.


ASSISTANT

Certainly! Let's delve into the differences between fine-tuning and reinforcement learning, particularly in the context of large language models (LLMs) like GPT-3 or GPT-4. I'll provide explanations and examples that leverage your expertise in systems design and programming, while introducing machine learning concepts.

@thehunmonkgroup
thehunmonkgroup / summary.md
Created September 18, 2024 13:57
Summary: P ROMETHEUS 2: An Open Source Language Model Specialized in Evaluating Other Language Models

URL: https://arxiv.org/pdf/2405.01535.pdf

P ROMETHEUS 2: An Open Source Language Model Specialized in Evaluating Other Language Models


QUESTION 1:

Can you give me a very clear explanation of the core assertions, implications, and mechanics elucidated in this paper?

ANSWER 1:

@thehunmonkgroup
thehunmonkgroup / summary.md
Created September 8, 2024 23:03
Summary: LONG CITE: ENABLING LLMS TO GENERATE FINE GRAINED CITATIONS IN LONG-CONTEXT QA

URL: https://arxiv.org/pdf/2409.02897.pdf

LONG CITE: ENABLING LLMS TO GENERATE FINE GRAINED CITATIONS IN LONG-CONTEXT QA


QUESTION 1:

Can you give me a very clear explanation of the core assertions, implications, and mechanics elucidated in this paper?

ANSWER 1: