SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers

A research framework for evaluating social intelligence in LLM agents through communication barriers.

🌟 Introduction

SocialVeil is a research framework for evaluating social intelligence in LLM agents through communication barriers. The framework simulates realistic social interactions where agents must navigate various communication barriers:

🗣️ Semantic Barriers: Ambiguous language and unclear expressions
🌍 Cultural Barriers: Different communication styles and norms
💭 Emotional Barriers: Emotional states affecting communication clarity

🚀 Quick Start

Installation

# Clone the repository
git clone https://github.com/ulab-uiuc/socialveil.git
cd socialveil

# Create environment
conda create -n socialveil python=3.11
conda activate socialveil

# Install dependencies
pip install poetry
poetry install

# Install yq (for config parsing)
pip install yq  # or: brew install yq (macOS)

Configuration

Edit configs/config.yaml:

models:
  model_a: "gpt-4o-mini"              # Barrier agent
  model_b: "Qwen/Qwen2.5-7B-Instruct" # Partner agent
  vllm_port: 7900
  gpu: "0,1,2,3"

AGENT_OPENAI_API_KEY: "your-key-here"
EVALUATOR_OPENAI_API_KEY: "your-key-here"

Running Experiments

# Start vLLM server (for local models)
bash scripts/start_vllm_server.sh

# Run evaluation
bash scripts/run.sh

# With custom settings
CONCURRENCY=16 bash scripts/run.sh
PARTNER_REPAIR_MODE=true bash scripts/run.sh

Analyze Results

python results/compare_modes.py \
    --base_dir results/exp_qwen2.5-7b-instruct_episode_all_neutralized \
    --out_csv results/comparison.csv

📁 Project Structure

socialveil/
├── configs/          # Configuration files
├── data/             # Episode datasets
├── scripts/          # Experiment runners
├── socialveil/       # Core package
│   ├── agent/        # Agent implementations
│   ├── environment/  # Scenario management
│   └── evaluate.py   # Evaluation logic
├── results/          # Experiment outputs
└── analysis/         # Analysis tools

🛠️ Advanced Usage

Custom Experiment Settings

# High concurrency
CONCURRENCY=32 bash scripts/run.sh

# Chain-of-Thought prompting
PARTNER_COT_MODE=true bash scripts/run.sh

# Custom results directory
RESULTS_DIR="results/my_exp" bash scripts/run.sh

Compare Different Evaluators

CONCURRENCY=32 python analysis/compare_evaluators.py \
    --results_dir results/exp_qwen2.5-7b-instruct_episode_all_neutralized \
    --evaluator1 gpt-4o \
    --evaluator2 qwen2.5-7b-instruct \
    --use_vllm_for_evaluator2 \
    --output results/evaluator_comparison.csv

📊 Key Features

✅ Multi-Barrier Evaluation: Test agents across semantic, cultural, and emotional barriers
✅ Flexible Model Support: OpenAI API or local models via vLLM
✅ High Concurrency: Parallel scenario execution for faster experiments
✅ Statistical Analysis: Built-in significance testing and correlation analysis
✅ Extensible Framework: Easy to add new barrier types or evaluation metrics

🔧 Development

Running Tests

poetry run pytest
poetry run mypy --config-file pyproject.toml .

Code Style

# Install pre-commit hooks
pre-commit install

# Run all checks
pre-commit run --all-files

📝 Citation

If you use this code in your research, please cite:

@article{xuan2026socialveil,
  title={SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers},
  author={Xuan, Keyang and Wang, Pengda and Ye, Chongrui and Yu, Haofei and August, Tal and You, Jiaxuan},
  journal={arXiv preprint arXiv:2602.05115},
  year={2026}
}

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🤝 Contributing

Contributions are welcome! Please see CONTRIBUTING.md for guidelines.

Name		Name	Last commit message	Last commit date
Latest commit History 395 Commits
.github		.github
assets		assets
configs		configs
data		data
docs		docs
examples		examples
scripts		scripts
socialveil		socialveil
.DS_Store		.DS_Store
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers

🌟 Introduction

🚀 Quick Start

Installation

Configuration

Running Experiments

Analyze Results

📁 Project Structure

🛠️ Advanced Usage

Custom Experiment Settings

Compare Different Evaluators

📊 Key Features

🔧 Development

Running Tests

Code Style

📝 Citation

📄 License

🤝 Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers

🌟 Introduction

🚀 Quick Start

Installation

Configuration

Running Experiments

Analyze Results

📁 Project Structure

🛠️ Advanced Usage

Custom Experiment Settings

Compare Different Evaluators

📊 Key Features

🔧 Development

Running Tests

Code Style

📝 Citation

📄 License

🤝 Contributing

About

Resources

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages