Tries-Based Dictionary: AI-Powered Educational Story Generation

An research project combining Compressed Trie data structures, BERT-based semantic analysis, and Google Gemini AI to create an intelligent educational story generation system for elementary school children learning English.

📋 Table of Contents

🎯 Project Overview
🏗️ Backend Story Generation Architecture
📱 Android Application
🚀 Getting Started
📊 Performance Metrics
🔬 Research Contributions
🗂️ Project Structure
🛠️ Development Status

🎯 Project Overview

This repository contains a comprehensive educational platform that generates contextually relevant, fill-in-the-blank stories for children aged 6-10. The system uniquely combines three key technologies:

Compressed Tries - Efficient storage and retrieval of 5000+ vocabulary words
BERT Neural Networks - Semantic word ranking for contextual relevance
Google Gemini AI - Dynamic story generation with educational content

The platform includes both an Android mobile application and a command-line backend prototype that demonstrates the core story generation architecture.

🏗️ Backend Story Generation Architecture

System Overview

graph TB
    A[Story Context] --> B[Compressed Trie]
    B --> C[Unused Words Filter]
    C --> D[BERT Model]
    D --> E[Top Relevant Words]
    E --> F[Gemini API]
    F --> G[Generated Story Template]
    G --> H[Word Placement Algorithm]
    H --> I[Final Story with Blanks]

Core Architecture Components

1. Compressed Trie Implementation (`compressed_tries.kt`)

Memory Efficient: Stores 5K+ words with shared prefixes
Fast Retrieval: O(m) search complexity where m = word length
Auto-completion: Levenshtein distance-based suggestions
Multiple Definitions: Supports words with multiple meanings

2. BERT-Based Word Ranking (`rank_words.py`)

Model: bert-base-uncased for masked language modeling
Context-Aware: Analyzes story context to predict relevant words
Semantic Scoring: Uses transformer attention for word relevance
Duplicate Prevention: Ensures no word repetition across story levels

3. Gemini AI Story Generation (`testing.main.kts`)

Model: gemini-2.0-flash-lite for natural language generation
Educational Focus: Prompts optimized for grade-level vocabulary
Contextual Continuity: Maintains story coherence across levels
Controlled Output: Generates exactly 3 blanks per story segment

Data Flow Pipeline

Initialization
- Load 5K+ words from Oxford 5000 csv.
- Initialize BERT model and tokenizer
- Set up Gemini API connection

Story Level Generation

fun playNextLevel() {
    val unusedWords = getAllWordsFromTrie() - usedWords
    val story = generateDynamicStory(currentContext, unusedWords)
    displayStory(story)
}

Dynamic Story Creation Process
- Context Analysis: Current story context passed to BERT
- Word Filtering: Unused words from trie filtered by relevance
- Template Generation: Gemini creates story with [MASK] placeholders
- Word Placement: BERT selects best words for each mask
- Story Assembly: Final story with blanks and answers

Educational Game Mechanics

data class Story(
    val text: String,           // Complete story text
    val blankPositions: List<Int>, // Indices of words to blank out
    val answers: List<String>      // Correct answers for blanks
)

📱 Android Application

Screenshots

Home Screen	Dictionary Lookup	Word Detail	Story World	Story Level Example

Features

Interactive Dictionary: Trie-based word lookup with auto-completion
Story Mode: Dynamic Story Generation
Educational Design: Child-friendly UI with Jetpack Compose
Offline Capability: Local trie storage for fast performance

Tech Stack

UI: Jetpack Compose
Database: Room with compressed trie storage
Architecture: MVVM with Repository pattern
Background: WorkManager for data processing

🚀 Getting Started

Prerequisites

# For Android App
- Android Studio Arctic Fox or newer
- Kotlin 1.9+
- Android SDK 24+

# For Backend Architecture
- Python 3.8+
- Kotlin 1.9+
- transformers library
- torch library

Backend Setup

Install Python Dependencies

pip install torch transformers numpy pandas

Set Up Gemini API

# Add your Gemini API key to testing.main.kts
val apiKey = "your-gemini-api-key-here"

Run the Story Generator

cd architecture
kotlin -script testing.main.kts

Android App Setup

Clone and Build

git clone https://github.com/ahmedsilat44/Tries-Based-Dictionary.git
cd Tries-Based-Dictionary
./gradlew assembleDebug

Install on Device
```
./gradlew installDebug
```

📊 Performance Metrics

🔬 Research Contributions

Novel Architecture

Scalable Unlike most AI storytelling systems, our approach integrates classical data structures (compressed tries) with modern transformer models, enabling both memory efficiency and educational word selection.

Educational Impact

Contextual Learning: Words selected based on semantic relevance
Engagement: AI-generated content keeps stories fresh and interesting

🗂️ Project Structure

├── app/                          # Android application
│   ├── src/main/java/           # Kotlin source files
│   │   ├── compressed_tries.kt  # Compressed trie implementation
│   │   ├── MainActivity.kt      # Main app entry point
│   │   └── ui/                  # Compose UI components
│   └── build.gradle.kts         # Android build configuration
├── architecture/                # Backend story generation system
│   ├── testing.main.kts        # Main backend application
│   ├── rank_words.py           # BERT-based word ranking
│   ├── words.txt               # 5K word dictionary
│   └── The_Oxford_3000.txt     # Curated vocabulary list
├── trie-implement.kts          # Basic trie implementation
├── compressed-trie-implement.kts # Advanced compressed trie
└── README.md                   # This documentation

🛠️ Development Status

✅ Completed Features

Compressed trie implementation with 5K+ words
BERT-based contextual word ranking
Gemini AI story template generation
Android app with Jetpack Compose UI
End-to-end story generation pipeline
Educational game mechanics

🎯 Future Enhancements

Although active development has concluded, several potential extensions could further improve the system:

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

📚 Citations

If you use this research in your work, please cite:

@misc{tries-dictionary-2025,
  title={Tries-Based Dictionary: AI-Powered Educational Story Generation},
  author={Ahmed Silat, Taha Zahid, Minhaj Ul Hasan, Maria Samad},
  year={2024},
  url={https://github.com/ahmedsilat44/Tries-Based-Dictionary}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Tries-Based Dictionary: AI-Powered Educational Story Generation

📋 Table of Contents

🎯 Project Overview

🏗️ Backend Story Generation Architecture

System Overview

Core Architecture Components

1. Compressed Trie Implementation (`compressed_tries.kt`)

2. BERT-Based Word Ranking (`rank_words.py`)

3. Gemini AI Story Generation (`testing.main.kts`)

Data Flow Pipeline

📱 Android Application

Screenshots

Features

Tech Stack

🚀 Getting Started

Prerequisites

Backend Setup

Android App Setup

📊 Performance Metrics

🔬 Research Contributions

Novel Architecture

Educational Impact

🗂️ Project Structure

🛠️ Development Status

✅ Completed Features

🎯 Future Enhancements

📜 License

📚 Citations

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 190 Commits
.idea		.idea
.kotlin/errors		.kotlin/errors
app		app
architecture		architecture
gradle		gradle
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.gradle.kts		build.gradle.kts
compressed-trie-implement.kts		compressed-trie-implement.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts
trie-implement.kts		trie-implement.kts

License

ahmedsilat44/LexiQuest

Folders and files

Latest commit

History

Repository files navigation

Tries-Based Dictionary: AI-Powered Educational Story Generation

📋 Table of Contents

🎯 Project Overview

🏗️ Backend Story Generation Architecture

System Overview

Core Architecture Components

1. Compressed Trie Implementation (compressed_tries.kt)

2. BERT-Based Word Ranking (rank_words.py)

3. Gemini AI Story Generation (testing.main.kts)

Data Flow Pipeline

📱 Android Application

Screenshots

Features

Tech Stack

🚀 Getting Started

Prerequisites

Backend Setup

Android App Setup

📊 Performance Metrics

🔬 Research Contributions

Novel Architecture

Educational Impact

🗂️ Project Structure

🛠️ Development Status

✅ Completed Features

🎯 Future Enhancements

📜 License

📚 Citations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

1. Compressed Trie Implementation (`compressed_tries.kt`)

2. BERT-Based Word Ranking (`rank_words.py`)

3. Gemini AI Story Generation (`testing.main.kts`)

Packages