AI Datasets | Daniel Rosehill

58

Total Datasets

Hugging Face

Iran Israel War 2026

Dataset

Iran-Israel War — OSINT Dataset Open-source intelligence dataset tracking Iranian missile and drone attack waves against Israel and US/coalition targets across four operations in the "True Promise" series (2024–2026). Dataset Description 53 attack waves across four Iranian military operations, each with 89 structured fields covering timing, weapons systems, targets, interception performance, casualties, and escalation indicators. Also includes international reactions data… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Iran-Israel-War-2026.

AI ExperimentsUtility Tools

3/5/2026

Hugging Face

Ivory Catalog 020226

Dataset

Ivory Computer Parts Catalog - February 2026 A snapshot of computer hardware pricing from Ivory.co.il, one of Israel's largest electronics retailers, captured on February 2, 2026. Dataset Description This dataset contains 658 computer hardware products with Israeli retail prices (ILS), estimated US RRP prices (USD), and price comparison ratios. The data was scraped from Ivory's public product catalog. Categories Category Products Cooling 138… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Ivory-Catalog-020226.

2/2/2026

Hugging Face

Transcription Cleanup Trainer

Dataset

Text Cleanup Fine-Tuning Dataset A curated dataset for training speech-to-text cleanup models to achieve optimal transcript refinement. Dataset Description This dataset contains paired examples of raw speech-to-text transcriptions and manually-cleaned versions, designed for fine-tuning models to clean up transcripts to a specific quality level ("Goldilocks" cleanup - not too much, not too little). Dataset Structure dataset/ ├── data/ │ ├── audio/… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Transcription-Cleanup-Trainer.

AI ExperimentsUtility ToolsDemos & POCs

12/18/2025

Hugging Face

Hebrew Image Eval 111225

Dataset

Hebrew Image Generation Evaluation An evaluation of major text-to-image models on their ability to accurately render Hebrew text. Overview This evaluation tests 12 image generation models on two Hebrew words: Word Transliteration Meaning Difficulty שלום Shalom Peace/Hello Easy - Most famous Hebrew word פירגון Firgun Joy in sharing others' success Hard - Uniquely Hebrew concept, less common Prompt template: "A banner graphic with the word [word] written… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Hebrew-Image-Eval-111225.

AI ExperimentsData VisualizationUtility Tools

12/11/2025

Hugging Face

STT Fine Tune Eval 101225

Dataset

STT Fine-Tune Evaluation Results Evaluation Date: December 10, 2025 This repository contains evaluation results comparing locally fine-tuned Whisper models against stock (original) Whisper inference across all model sizes. Related Resources Evaluation Script: Whisper-Fine-Tune-Accuracy-Eval Evaluation Dataset: Small STT Eval Audio Dataset Models Evaluated Fine-tuned models: tiny, base, small, medium, large Original (stock) models: tiny, base, small, medium… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/STT-Fine-Tune-Eval-101225.

AI ExperimentsUtility Tools

12/10/2025

Hugging Face

Small STT Eval Audio Dataset

Dataset

Small STT Eval Audio Dataset A small speech-to-text evaluation dataset containing 92 audio samples with ground truth transcriptions. Designed for evaluating STT systems on technical vocabulary, code-switching (English/Hebrew), and various speaking styles. Dataset Description This dataset contains audio recordings with accompanying transcriptions across multiple categories: Category Count Description tech_github 5 GitHub-related technical vocabulary… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Small-STT-Eval-Audio-Dataset.

AI ExperimentsUtility Tools

12/10/2025

Hugging Face

ASR WPM And Background Noise Eval

Dataset

ASR WPM and Background Noise Evaluation Dataset A dataset of annotated audio recordings for evaluating how different factors affect Whisper (and other ASR/STT systems) transcription accuracy. Purpose This dataset provides controlled audio samples with annotations to evaluate ASR performance across: Speaking pace (fast, normal, slow, mumbled, whispered, weird voices) Background noise (cafe, music, conversations in various languages, traffic, sirens, etc.) Microphone… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/ASR-WPM-And-Background-Noise-Eval.

Utility Tools

12/9/2025

Hugging Face

Israel Photos

Dataset

Israel Photos Dataset A collection of 369 photographs captured across Israel between 2024 and 2025, with LLM-generated captions and location annotations. The images are sourced from the photographer's Pexels gallery. About This Collection This dataset was deliberately curated to provide a diverse visual representation of Israel, encompassing: Varied locations: From the historic streets of Jerusalem's Old City to Tel Aviv's urban landscape, desert vistas in the Negev, and… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Israel-Photos.

AI ExperimentsData VisualizationUtility Tools

12/2/2025

Hugging Face

Sample Voice Context Data

Dataset

Sample Voice Context Data A small synthetic dataset containing LLM-generated context information simulating a job seeker narrating their career trajectory. Purpose This dataset was created to test a voice-to-vector-database RAG pipeline. The workflow being evaluated involves: Voice data (MP3 recordings) transcribed to text Transcriptions reformatted as structured context data Text data upserted into a vector database (Pinecone or Ragie) Retrieval accuracy tested by… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Sample-Voice-Context-Data.

AI ExperimentsAgent OrchestrationUtility Tools

11/30/2025

Hugging Face

Tech Sentences For ASR Training

Dataset

TechVoice Dataset Work in Progress – This dataset is actively being expanded with new recordings. Dataset Statistics Metric Current Target Progress Duration 38m 43s 5h 0m 0s ██░░░░░░░░░░░░░░░░░░ 12.9% Words 10,412 50,000 ████░░░░░░░░░░░░░░░░ 20.8% Total Recordings: 205 samples Total Characters: 74,312 A specialized speech dataset for fine-tuning Automatic Speech Recognition (ASR) models on technical and developer vocabulary. Contains human-recorded… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Tech-Sentences-For-ASR-Training.

AI ExperimentsUtility Tools

11/22/2025

Hugging Face

Whisper Fine Tune One Shot Eval

Dataset

Whisper Fine-Tuning Evaluation: Local vs Commercial ASR A "back of the envelope" evaluation comparing fine-tuned Whisper models running locally against commercial ASR APIs via Eden AI. The Question Can fine-tuning Whisper achieve measurable WER reductions, even when comparing local inference against cloud-based commercial models? TL;DR Yes. Fine-tuned Whisper Large Turbo running locally achieved 5.84% WER, beating the best commercial API (Assembly at… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Whisper-Fine-Tune-One-Shot-Eval.

AI ExperimentsUtility Tools

11/17/2025

Hugging Face

English Hebrew Mixed Sentences

Dataset

English-Hebrew Mixed Sentences Dataset A dataset of English sentences with Hebrew words and phrases interspersed, designed for speech-to-text training and evaluation for English speakers in Israel. Overview This dataset addresses a common challenge for English-speaking immigrants in Israel: standard speech-to-text (STT) systems struggle to accurately transcribe code-switched speech where Hebrew words are mixed into primarily English sentences. Example: "I need to pick up… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/English-Hebrew-Mixed-Sentences.

AI ExperimentsUtility ToolsDemos & POCs

11/17/2025

Hugging Face

Podcast ASR Evaluation

Dataset

No description provided

Utility Tools

11/13/2025

Hugging Face

Open Router API Pricing Analysis

Dataset

OpenRouter API Pricing Analysis Dataset Overview This dataset provides a point-in-time capture of pricing and parameters for LLMs available through the OpenRouter API for inference. Contents Raw Data (raw/) Contains the original data extracted from the OpenRouter API, including: Model pricing (input/output token costs) Model parameters and specifications Computed fields such as output/input token price ratios Enhanced Data (hf-enhanced/)… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Open-Router-API-Pricing-Analysis.

AI ExperimentsUtility ToolsContent & Blogs

11/10/2025

Hugging Face

Voice Notes Classified

Dataset

No description provided

Utility Tools

11/6/2025

Hugging Face

Accidental And Low Quality Photos

Dataset

Accidental and Low-Quality Photos Dataset This repository contains unintentional photos from camera rolls - accidental captures, blurry shots, and other non-intentionally captured images. Purpose The intended use case is training an image organization model to automatically distinguish and suggest for deletion non-intentionally captured images. This can help users efficiently clean up their camera rolls by identifying photos that were taken accidentally or are of poor… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/accidental-and-low-quality-photos.

AI ExperimentsUtility ToolsApps & GUIs

10/23/2025

Hugging Face

Multimodal Ai Taxonomy

Dataset

Multimodal AI Taxonomy A comprehensive, structured taxonomy for mapping multimodal AI model capabilities across input and output modalities. Dataset Description This dataset provides a systematic categorization of multimodal AI capabilities, enabling users to: Navigate the complex landscape of multimodal AI models Filter models by specific input/output modality combinations Understand the nuanced differences between similar models (e.g., image-to-video with/without audio… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/multimodal-ai-taxonomy.

AI ExperimentsUtility ToolsApps & GUIs

10/22/2025

Hugging Face

Jerusalem High Rise Development

Dataset

Jerusalem High-Rise Development Image Dataset Overview This dataset contains 56 photographs documenting high-rise buildings and urban development in Jerusalem, Israel. The images capture the architectural evolution of Jerusalem's modern skyline, featuring contemporary construction, building facades, and urban landscapes. Purpose This dataset has been created and shared for the following purposes: Image fine-tuning and AI training: High-quality architectural… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Jerusalem-High-Rise-Development.

AI ExperimentsData VisualizationUtility Tools

10/18/2025

Hugging Face

Hebrew Language Signage

Dataset

Hebrew Language Signage Dataset Overview This dataset contains photographs of Hebrew language text in everyday contexts throughout Israel, with a particular focus on signage displays including street signs, commercial signage, and public information displays. Dataset Details Total Images: 68 Format: PNG Content: Real-world photographs of Hebrew text and signage Language Coverage: Primarily Hebrew, with many signs also containing English and Arabic text… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Hebrew-Language-Signage.

AI ExperimentsData VisualizationUtility ToolsContent & Blogs

10/18/2025

Hugging Face

Tel Aviv Pics

Dataset

Tel Aviv Urban Photography Dataset Dataset Description This dataset contains 53 high-quality photographs of Tel Aviv's urban environment, captured to serve as reference material for game development, 3D world creation, and digital environment design. Dataset Summary Total Images: 53 photographs Location: Tel Aviv, Israel Format: JPG Average Size: ~1MB per image Resolution: High-resolution photographs suitable for texture extraction and reference License:… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Tel-Aviv-Pics.

AI ExperimentsData VisualizationUtility Tools

10/17/2025

Hugging Face

Jerusalem Streetscapes

Dataset

Jerusalem Streetscapes Dataset A small image dataset containing photos of the rapidly changing urban landscape of Jerusalem, Israel, captured by day and by night. About This dataset documents the evolving cityscape of Jerusalem through 120 photographs taken between June 2024 and September 2025. Dataset Details Number of Images: 120 Time Period: June 2024 - September 2025 Location: Jerusalem, Israel Coverage: Day and night photography of urban landscapes… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Jerusalem-Streetscapes.

AI ExperimentsData VisualizationUtility Tools

10/6/2025

Hugging Face

Narcissistic Abuse Support Configs

Dataset

Narcissistic Abuse AI Support Configurations A comprehensive network of AI agent configurations designed to provide support for individuals affected by relationships with personality disordered individuals, particularly those with Cluster B disorders and narcissistic personality disorder. Important Disclaimer These tools are not replacements for professional mental health support. They are intended as adjuncts to professional therapeutic care. Most configurations contain… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Narcissistic-Abuse-Support-Configs.

AI ExperimentsAgent OrchestrationUtility Tools

9/26/2025

Hugging Face

Jerusalem Emergency Shelters 0925

Dataset

Jerusalem Public Shelter Dataset - September 2025 This repository contains updated location data for public shelters in the Jerusalem area, populated on September 19th, 2025. The data has been processed and enhanced to support individual preparedness efforts and geolocation applications. Data Source The original data was provided by the Jerusalem Municipality and is available in the source_data folder. This dataset represents the most current information available as… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Jerusalem-Emergency-Shelters-0925.

AI ExperimentsUtility ToolsApps & GUIs

9/19/2025

Hugging Face

Code Gen Agents 0925

Dataset

Code Generation Agent Network A comprehensive collection of specialized AI agents for code generation, development workflows, and project management. While originally designed for Claude Code, these agent specifications are framework-agnostic and can be adapted to work with any AI code generation platform or multi-agent system. Framework Agnostic Design This repository contains agent specifications that define: Clear role definitions and capabilities Tool requirements… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Code-Gen-Agents-0925.

AI ExperimentsAgent OrchestrationUtility Tools

9/15/2025

Hugging Face

ISO 3166 4217 Consolidated

Dataset

ISO-3166 & ISO-4217 "Consolidated" Lookup Dataset This dataset contains a mapping between ISO-3166 (countries) and ISO-4217 (currencies). The objective was to create a single dataset to support everyday workloads in international financial analysis undertaken by "casual" / non-official actors and analysts. Version V1 Compiled by: Daniel Rosehill Date: 03-09 (September) - 2025 Note: geopolitics and the global financial system are clearly dynamic concepts. Much as this… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/ISO-3166-4217-Consolidated.

AI ExperimentsUtility ToolsApps & GUIs

9/3/2025

Hugging Face

Prompt Or Not

Dataset

No description provided

AI ExperimentsUtility Tools

8/31/2025

Hugging Face

Ai Generated Podcast Episodes

Dataset

No description provided

AI ExperimentsUtility Tools

8/26/2025

Hugging Face

Zapier Integrations 260825

Dataset

Zapier Apps/Integrations Extracted: Aug 26, 2025

Apps & GUIs

8/26/2025

Hugging Face

NVR Entity Recognition Experiment

Dataset

NVR Entity Recognition Experiment Overview This repository contains a training dataset designed for entity recognition in Network Video Recorder (NVR) applications, specifically focused on newborn safety monitoring. The dataset uses a stuffed animal as a privacy-conscious substitute for actual newborn footage, enabling the development of computer vision models that can identify critical safety scenarios in nursery environments. Purpose The primary goal of this… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/NVR-Entity-Recognition-Experiment.

AI ExperimentsUtility ToolsApps & GUIs

8/21/2025

Hugging Face

Long Prompt Experiment

Dataset

I conducted this experiment to investigate the impact of prompt structure and optimization on LLM performance, specifically testing whether quality and organization matter more than raw prompt length for complex technical tasks. Research Question For specialized technical tasks, does prompt structure and optimization have a greater impact on output quality than raw prompt length alone? Experiment Design I compared three distinct prompting approaches using Gemini 2.5 Lite… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Long-Prompt-Experiment.

AI ExperimentsUtility ToolsApps & GUIs

8/19/2025

Hugging Face

Global Value Factor Database Refactor V2

Dataset

Refactor and HF dataset (including texts): Daniel Rosehill Source data: International Foundation for Valuing Impacts This dataset provides V2 of a refactoring of the Global Value Factor Database (GVFD) by the International Foundation for Valuing Impacts intended to enhance the original dataset for machine readability and integration into data analysis and visualization workloads. The International Foundation for Valuing Impacts (IFVI) produces an (open-source) database called the Global… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Global-Value-Factor-Database-Refactor-V2.

Environmental & DataData VisualizationUtility Tools

8/16/2025

Hugging Face

Voice Note Audio

Dataset

Voice Notes Dataset Dataset Description This dataset contains real-world voice recordings with transcripts and comprehensive annotations. Dataset Statistics Total Entries: 2 Audio Files: 2 Uncorrected Transcripts: 2 Ground Truth Transcripts: 0 Annotation Files: 2 Export Date: 2025-10-27 Dataset Structure audio/ # Audio recordings (MP3, etc.) ├── 1.mp3 ├── 2.mp3 └── ... transcripts/ ├── uncorrected/ # Original STT… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Voice-Note-Audio.

AI ExperimentsUtility Tools

8/14/2025

Hugging Face

STT Voice Notes Evals

Dataset

STT Voice Note Evaluation Author: Daniel RosehillDate Created: August 11, 2025Purpose: Comparative evaluation of Speech-to-Text (STT) services for voice note transcription Overview This dataset was created as part of ongoing work developing voice note transcription systems. It contains ground truth transcripts representing typical daily voice notes, recorded to evaluate and compare STT service accuracy across different content types. Speaker Profile: Single speaker… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/STT-Voice-Notes-Evals.

AI ExperimentsUtility ToolsContent & Blogs

8/11/2025

Hugging Face

Github Repos 100825

Dataset

No description provided

Utility Tools

8/10/2025

Hugging Face

System Prompt Library 030825

Dataset

System Prompts Dataset - August 2025 Point-in-time export from Daniel Rosehill's system prompt library as of August 3rd, 2025 Overview This repository contains a comprehensive collection of 944 system prompts designed for various AI applications, agent workflows, and conversational AI systems. While many of these prompts now serve as the foundation for more complex agent-based workflows, they continue to provide essential building blocks for AI system design and… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/System-Prompt-Library-030825.

AI ExperimentsAgent OrchestrationUtility ToolsApps & GUIs

8/3/2025

Hugging Face

Text Transformation Prompts 300525

Dataset

Text Transformation Prompt Library A comprehensive collection of text transformation prompts for reformatting dictated text into various formats, styles, and structures. Quick Links Repository Structure /prompts/ The main collection of text transformation prompts. /prompts/md/ - Markdown format prompts /prompts/json/ - JSON format equivalents of the markdown prompts Prompt Structure Each prompt follows a standardized markdown… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Text-Transformation-Prompts-300525.

AI ExperimentsUtility Tools

5/29/2025

Hugging Face

System Prompt Library

Dataset

My AI System Prompt Library This repository contains a comprehensive, up-to-date library of system prompts for AI systems and autonomous agents, started on May 27th, 2025. Overview This collection houses 923 system prompts covering a diverse range of AI applications. The prompts include configurations for autonomous agents, simple chatbots, specialized assistants, and various AI-powered tools. This repository serves as a centralized hub for these prompts, maintained… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/System-Prompt-Library.

5/27/2025

Hugging Face

Pay For Outcomes Instruments

Dataset

Social-Impact-Bond-Data This repository contains a curated, redacted, and standardized data set based on the Government Outcome Labs project at Oxford University (UK). It is the leading international data resource tracking the growth and execution of social impact bonds (SIBs), development impact bonds (DIBs), outcome funds, and other pay-for-success instruments worldwide. Project Purpose The data set supports research and AI-driven policy analysis on innovative… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/pay-for-outcomes-instruments.

AI ExperimentsUtility Tools

5/21/2025

Hugging Face

Israel Alerting Zones

Dataset

Israel Emergency Alerting Zones Dataset This repository contains a comprehensive list of emergency alerting zones used in Israel by the Home Front Command (Pikud HaOref), compiled on May 9th, 2025. Dataset Description This dataset provides a point-in-time export of the alerting areas used by Israel's Home Front Command for issuing emergency alerts during security situations. The alerting zones are primarily used for missile threat notifications and other emergency… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Israel-Alerting-Zones.

AI ExperimentsUtility Tools

5/9/2025

Hugging Face

Blog Posts

Dataset

No description provided

Utility ToolsContent & Blogs

5/8/2025

Hugging Face

Career Data Context Repo

Dataset

Hello, Friendly AI Bot! Context Generation Date: 28 / April / 2025 Creation Timestamp: 2025-04-28T18:59:13Z Welcome! If you are able to read and parse this text, then this context data repository is working as intended.You have arrived at a small, modular pool of contextual data designed to provide insight into my career aspirations, professional experience, and work preferences. Refer to the "Context Generation Date" above, or if you are able to parse file metadata, use… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Career-Data-Context-Repo.

AI ExperimentsUtility Tools

4/28/2025

Hugging Face

Software Wish List Context Data

Dataset

Hello, Friendly AI Bot! Context Generation Date: 28 / (April) / 2025 If you're reading this, then the context pipeline is working as intended, and you have arrived at a small repository of contextual data intended to provide you with general background context about what I look for in software evaluations. As you probably already know, my name is Daniel. I'm a huge fan of technology. And I frequently find myself looking for software tools. Sometimes I do this for work… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Software-Wish-List-Context-Data.

AI ExperimentsUtility Tools

4/28/2025

Hugging Face

Corn Training Set

Dataset

Corn The Sloth Training Images (Repo 2) This repository is another image collection of images of a stuffed sloth that I am fine tuning for a custom image generation model for this particular character. If anyone else is interested in fine tuning for this character or fine tuning for character avatars generally or wants to use this small image set as training data for another project then .... use is granted in accordance iwth the license terms (the sloth didn't quite understand… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Corn-Training-Set.

AI ExperimentsUtility Tools

4/28/2025

Hugging Face

Chatgpt AI Vs API

Dataset

No description provided

AI ExperimentsUtility Tools

4/22/2025

Hugging Face

Shakespearean Text Transformation Prompts

Dataset

Shakespeare GPT (Shakespearean Text Generation Prompts) Welcome to what might be the internet's largest collection of prompts for rewriting text in Shakespearean English! This repository contains a variety of prompts designed to transform modern text into the style of Shakespeare, organized by format and purpose. These prompts can be used with any AI tool that accepts custom instructions. A user interface may be forthcoming for those who feel the need to do this regularly.… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Shakespearean-Text-Transformation-Prompts.

AI ExperimentsUtility ToolsApps & GUIsContent & Blogs

4/21/2025

Hugging Face

Corn The Sloth

Dataset

Keyframe Photos Of ..... A Sloth Plushie This repository contains a small collection of images featuring an adorable plush sloth named Cornelius. These images are part of ongoing experiments conducted in my spare time, aimed at testing the capabilities of photogrammetry tools and digital avatar creation for non-human subjects. If anyone needs a small repository of images for various purposes, such as distinguishing plushies from real animals or humans, you are welcome to utilize… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Corn-The-Sloth.

AI ExperimentsUtility ToolsApps & GUIs

4/18/2025

Hugging Face

Text To Image Test Prompts

Dataset

Text To Image Test Prompt Library A comprehensive collection of evaluation prompts for testing text-to-image AI models across diverse parameters and use cases. Overview This repository contains a structured set of test prompts designed to evaluate the capabilities of text-to-image generation models. Rather than focusing on formal evaluation metrics, these prompts are intended for end users who want to test how well a model might perform for their specific use cases.… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Text-To-Image-Test-Prompts.

4/11/2025

Hugging Face

General Purpose System Prompts

Dataset

🤖 Just A Few ... "General" System Prompts Here is a quandary that those who work with LLMs via API through self-hosted chat interfaces (etc) are familiar with: Without any system prompt at all (at least one visible to the user), the default model behavior feels a little bit flat and lifeless. With a deterministic system prompt, a model effectively becomes an "assistant" (and with context and API actions, a full-fledged agent). I haven't found a word yet for the kind of light… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/General-Purpose-System-Prompts.

AI ExperimentsAgent OrchestrationUtility ToolsApps & GUIs

4/9/2025

Hugging Face

Prompt Eng System Prompts

Dataset

Prompt Engineering System Prompts A curated collection of system prompts designed to assist with prompt engineering activities across various AI platforms and use cases. Last updated: April 6, 2025 Note: This is an ongoing collection. New system prompts are continuously being added to the library. Feel free to check back for updates. Repository Purpose This repository serves as a comprehensive resource for prompt engineers, AI enthusiasts, and developers who want to:… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Prompt-Eng-System-Prompts.

AI ExperimentsUtility Tools

4/9/2025

Hugging Face

Email Management System Prompts

Dataset

Email Management System Prompts April 06 2025 A collection of system prompts designed to enhance email productivity, communication, and management. These prompts can be used with various AI assistants to automate and improve email-related tasks. Categories Email Composition Email Template Generator - Creates customizable email templates for various purposes Email Rewriter - Reformats and improves existing email drafts Email Signature Generator - Creates… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Email-Management-System-Prompts.

4/9/2025

Hugging Face

Data Utils System Prompts

Dataset

Data Utilities System Prompts This repository contains a collection of system prompts for configuring AI assistance in data-related tasks. These prompts can be used to set up AI assistants for various data operations, analysis, and management tasks. Categories Data Conversion Tools for converting data between different formats (CSV, JSON, natural language, etc.) Database Helpers Assistants for working with different databases (MongoDB, Neo4j… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Data-Utils-System-Prompts.

AI ExperimentsUtility Tools

4/9/2025

Hugging Face

Geopolitical System Prompts

Dataset

Geopolitical Analysis System Prompts A collection of system prompts for AI assistants specialized in geopolitical analysis. These prompts enable AI systems to provide structured analysis, reporting, and insights across various aspects of international relations and geopolitical developments. Repository Structure The prompts are organized into the following categories: regional-analysis/ Specialized prompts for analyzing specific regions and generating… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Geopolitical-System-Prompts.

AI ExperimentsUtility Tools

4/9/2025

Hugging Face

Career Related System Prompts

Dataset

Career Utilities System Prompts A collection of system prompts for AI assistants focused on career guidance and job search assistance. These prompts are designed to help users navigate various aspects of career development, from resume writing to job searching and professional networking. Repository Structure The prompts are organized into the following categories: career-exploration/: Prompts for exploring career paths, understanding industry trends, and making career… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Career-Related-System-Prompts.

AI ExperimentsUtility ToolsApps & GUIsContent & Blogs

4/9/2025

Hugging Face

Speech To Text System Prompts 2

Dataset

Speech To Text System Prompt Library This repository provides a collection of system prompts designed to transform and refine text captured using speech-to-text technologies. By passing STT outputs through large language models with these specialized prompts, you can achieve cleaner, more structured, and purpose-specific text formats. 📋 The Idea Here is the basic implementation. I don't pretend that this is the stuff of high AI engineering. But it does create quite… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Speech-To-Text-System-Prompts-2.

AI ExperimentsUtility Tools

4/9/2025

Hugging Face

Single Prompt Book

Dataset

Can AI Write A Book In Just One Prompt? April 09, 2025 The pace of development in AI these days is so fast that it's hard to keep on top of all the latest developments. I've always found it interesting that among all the hotly debated parameters discussed in the most recent SOTA models, the question of how many tokens a model can generate in one continuous output (max output tokmens) seems to be very little discussed. This metric exists independent of the maximum input tokens and… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Single-Prompt-Book.

AI ExperimentsUtility Tools

4/9/2025

Hugging Face

Writing System Prompts

Dataset

Writing-Related System Prompt Collection This is a collection of system prompts derived from my larger collection of system prompts. The commonality here is that these system prompts are intended for assistance related to writing, specifically text reformatting, editing, proofing. This is a partial collection that will continue to hopefully evolve and grow over time. The system prompts are organised into folders representing a common purpose and within each folder each system… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Writing-System-Prompts.

3/25/2025

Hugging Face

GHG Emissions Data

Dataset

GHG Emissions Data Pipeline Description This repository contains a comprehensive pipeline for processing and analyzing greenhouse gas (GHG) emissions data. The pipeline integrates datasets from multiple sources, including Climate TRACE and Our World in Data, to provide insights into global emissions trends. It supports sustainability reporting, emissions tracking, and climate action planning. Dataset Details Sources and Methodologies The pipeline… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/GHG-Emissions-Data.

AI ExperimentsEnvironmental & DataUtility Tools

12/20/2024

Hugging Face

Ifvi Valuefactors Deriv

Dataset

⚠️ DEPRECATED - Dataset Superseded by V2 This refactored IFVI value factor dataset has been supplanted by V2. This dataset tracked the V2 of the IFVI release that was updated in March 2025. The V2 of the refactored analytical dataset tracking the GVFD was released on August 20th, 2025 and is now available at: 🔗 IFVI Global Value Factors Dataset V2 Please use the V2 dataset for all new projects and analysis.

AI Experiments

12/5/2024

Hugging Face

Visit Hugging Face

For the most up-to-date collection of datasets, visit my Hugging Face profile

View Hugging Face Profile