---
title: "Human-AI Interaction in Spatial Environments"
description: "A research proposal for studying conversational dynamics between humans and AI agents in a 2D multiplayer setting"
author: "Eric Zou"
date: "12/11/2025"
categories:
  - Agents
  - Models
  - Experiments
  - Research
---


# Human-AI Interaction in Spatial Environments: A Research Proposal

## Abstract

This proposal outlines a study to investigate human-AI interaction dynamics in a 2D spatial multiplayer environment. By creating a controlled setting where participants interact with both human peers and AI agents (without knowing which is which), we aim to generate a unique dataset of conversational data and answer fundamental questions about how AI agents influence and are influenced by human communication patterns in situated contexts.

---

## Research Questions & Investigative Goals

### Primary Research Questions

1. **Detection & Perception**: Can humans reliably distinguish between AI agents and human participants in a spatial conversational setting? What behavioral or linguistic cues do they use?

2. **Influence Dynamics**: How does the presence of AI agents (at varying ratios) affect human communication patterns, including:
   - Conversational topics and depth
   - Turn-taking and response patterns
   - Spatial movement and coordination behaviors
   - Social dynamics and group formation

3. **AI Adaptation**: How do AI agents adapt their behavior when interacting with humans versus other agents? Do they converge toward human-like patterns or maintain distinct characteristics?

4. **Dataset Quality**: What makes this spatial conversational dataset unique compared to traditional chat datasets, and how can it be used for training future LLMs?

### Secondary Investigative Goals

- **Baseline Comparison**: Establish baseline communication patterns for pure human-human and pure AI-AI interactions
- **Model Benchmarking**: Create a standardized arena for evaluating different AI models' conversational capabilities
- **Longitudinal Effects**: Understand how interaction patterns evolve over time in mixed human-AI environments
- **Spatial Behavior**: Investigate how spatial proximity and movement influence conversational dynamics

---

## Study Design

### Overview

The study employs a multi-condition experimental design with approximately 100 participants interacting in a 2D spatial environment. Participants will be randomly assigned to different experimental conditions, and interactions will be recorded for analysis.

### Experimental Conditions

#### Condition 1: Mixed Arena (Baseline)
- **Composition**: ~50% human participants, ~50% AI agents
- **Purpose**: Establish baseline mixed-interaction patterns
- **Duration**: 2-hour sessions, multiple sessions over 2 weeks
- **Design Rationale**: Creates a "mosh pit" environment where natural interactions emerge without strict control

#### Condition 2: Segregated Rooms (Control)
- **Composition**: 
  - Room A: 100% human participants
  - Room B: 100% AI agents
- **Purpose**: Generate pure datasets for comparison and training
- **Duration**: 1-hour sessions
- **Design Rationale**: Provides clean baselines for human-only and AI-only communication patterns

#### Condition 3: Variable Ratio Mixes (Systematic Variation)
- **Compositions**: 
  - 10% AI / 90% Human
  - 25% AI / 75% Human
  - 50% AI / 50% Human (overlap with Condition 1)
  - 75% AI / 25% Human
  - 90% AI / 10% Human
- **Purpose**: Systematically investigate how AI presence ratio affects conversational dynamics
- **Duration**: 1.5-hour sessions per ratio
- **Design Rationale**: Enables quantitative analysis of AI influence thresholds

---

## Methodology

### Participant Recruitment

**Target Sample**: ~100 participants
- **Recruitment Method**: Online platforms, university participant pools, social media
- **Inclusion Criteria**: 
  - Age 18+
  - English proficiency
  - Access to stable internet connection
  - Willingness to participate in 2-4 hour sessions
- **Compensation**: Monetary compensation or course credit
- **Ethical Considerations**: Full informed consent, IRB approval, data anonymization

### AI Agent Configuration

**Model Selection**: Multiple LLM backends (GPT-4, Claude, Gemini) with consistent prompting
- **Persona Variability**: Each AI agent assigned distinct personality traits, goals, and communication styles
- **Consistency**: Same agent maintains consistent identity across sessions
- **Transparency**: Post-study debriefing will reveal which participants were AI agents

### Environment Setup

**Spatial Configuration**: 20x20 grid world (or similar scalable environment)
- **Features**: 
  - Visible agent positions
  - Proximity-based communication (with optional global chat)
  - Movement capabilities
  - Optional visual rendering for participants
- **Technical Infrastructure**: 
  - Real-time synchronization
  - Complete interaction logging (messages, movements, timestamps)
  - Session recording and replay capabilities

---

## Data Collection Strategy

### Data Types Collected

1. **Conversational Data**
   - All messages with timestamps
   - Speaker identification (human vs. AI, anonymized)
   - Message metadata (length, response time, etc.)

2. **Spatial Data**
   - Agent positions over time
   - Movement patterns
   - Proximity relationships

3. **Behavioral Data**
   - Turn-taking patterns
   - Topic shifts and conversation threads
   - Group formation and dissolution

4. **Post-Session Data**
   - Participant surveys (perceived human/AI identification)
   - Self-reported experience and engagement
   - Debriefing responses

### Data Management

- **Storage**: Secure, encrypted storage with access controls
- **Anonymization**: Participant identifiers removed, AI agents labeled
- **Format**: Structured JSON logs with standardized schema
- **Retention**: Long-term storage for research and potential public release (with consent)

---

## Analysis Plan

### Quantitative Analysis

1. **Detection Accuracy**: Statistical analysis of human ability to identify AI agents
2. **Linguistic Metrics**: 
   - Message length, vocabulary diversity, sentiment analysis
   - Comparison across conditions and ratios
3. **Spatial Metrics**:
   - Movement patterns, clustering, proximity-based interactions
4. **Temporal Analysis**:
   - Conversation evolution over time
   - Adaptation patterns

### Qualitative Analysis

1. **Conversation Quality**: Thematic analysis of topics and depth
2. **Social Dynamics**: Emergent group behaviors and coordination
3. **Case Studies**: Detailed analysis of interesting interaction patterns

### Comparative Analysis

- Human-only vs. AI-only vs. Mixed conditions
- Effects of varying AI ratios
- Model-specific differences (if multiple AI models used)

---

## Responsibilities & Roles

### Research Team Structure

#### Principal Investigator
- **Responsibilities**: Overall study design, IRB approval, final analysis oversight
- **Key Decisions**: Research questions, experimental conditions, publication strategy

#### Technical Lead
- **Responsibilities**: Environment development, data collection infrastructure, technical troubleshooting
- **Key Decisions**: Platform architecture, logging systems, real-time performance

#### Data Collection Coordinator
- **Responsibilities**: Participant recruitment, session scheduling, real-time monitoring
- **Key Decisions**: Recruitment strategy, session logistics, participant support

#### Data Analyst
- **Responsibilities**: Data cleaning, statistical analysis, visualization
- **Key Decisions**: Analysis methods, metric selection, interpretation frameworks

#### AI Agent Designer
- **Responsibilities**: AI agent configuration, persona design, prompt engineering
- **Key Decisions**: Agent diversity, behavioral parameters, consistency protocols

### Participant Responsibilities

- Active participation in assigned sessions
- Honest engagement without attempting to "game" the system
- Post-session survey completion
- Respectful interaction with other participants

---

## Ethical Considerations

### Informed Consent
- Clear explanation of study purpose (without revealing AI presence initially)
- Post-study debriefing revealing full study design
- Right to withdraw at any time

### Privacy & Anonymization
- All participant data anonymized
- No collection of personally identifiable information beyond necessary demographics
- Secure data storage and access controls

### Deception Management
- Temporary deception about AI presence is necessary for research validity
- Full debriefing after study completion
- IRB approval for deception protocol

### AI Agent Transparency
- Post-study disclosure of which participants were AI
- Discussion of implications and participant reactions

---

## Expected Outcomes & Impact

### Research Contributions

1. **Novel Dataset**: First large-scale dataset of human-AI interactions in spatial environments
2. **Behavioral Insights**: Understanding of how AI presence affects human communication
3. **Methodological Framework**: Reusable experimental design for future human-AI interaction studies
4. **Model Evaluation**: Benchmarking framework for conversational AI in situated contexts

### Practical Applications

1. **LLM Training**: Dataset for training future models on spatial conversational understanding
2. **AI Development**: Insights for improving AI agents' social and conversational capabilities
3. **Human-AI Collaboration**: Better design of mixed human-AI systems

### Long-term Vision

- **Open Arena**: Long-running platform where researchers can test new AI models
- **Community Resource**: Public dataset for the research community
- **Iterative Improvement**: Framework for continuous evaluation and refinement

---

## Timeline & Milestones

### Phase 1: Preparation (Months 1-2)
- Finalize study design and IRB approval
- Develop technical infrastructure
- Recruit initial participant pool
- Configure AI agents

### Phase 2: Pilot Study (Month 3)
- Small-scale pilot (10-20 participants)
- Refine protocols and technical systems
- Initial data analysis and methodology validation

### Phase 3: Main Data Collection (Months 4-6)
- Full participant recruitment
- Execute all experimental conditions
- Ongoing data quality monitoring

### Phase 4: Analysis & Publication (Months 7-9)
- Data cleaning and analysis
- Paper writing and submission
- Dataset preparation for public release

---

## Success Metrics

- **Data Quality**: Complete interaction logs from all conditions with <5% data loss
- **Participant Engagement**: >80% session completion rate
- **Research Output**: 1-2 peer-reviewed publications
- **Dataset Release**: Publicly available dataset with documentation
- **Community Impact**: Dataset used by at least 3 other research groups within 1 year

---

## Conclusion

This study represents a unique opportunity to understand human-AI interaction in situated, spatial contexts. By systematically varying conditions and collecting comprehensive data, we can generate both valuable insights and a novel dataset that advances the field of conversational AI and human-AI collaboration.