Zhizhe Liu

I am an incoming PhD student in Accounting at Columbia Business School. My research interests are in disclosure, information processing, and AI in accounting.

I also maintain a collection of pipelines, scripts, and replication packages for capital-market researchers in the Research Toolkits section.

Ph.D. Accounting Columbia UniversityExpected 2031
M.S. Data Science University of Wisconsin-Madison2026
B.B.A. Supply Chain Management Wuhan University2024

Research

How is GenAI transforming information generation, processing, and dissemination in capital markets?

Working Papers

2026

Can AI Do Financial Research? LLM-Guided Hypothesis Discovery in Asset Pricing

with Huan Liu, Miao Liu and Danqing Mei

Updated Version in Preparation

SSRN

2026

Human-AI Collaboration in Corporate Valuation: Experimental Evidence with a Valuation AI Agent

with Huan Liu, Miao Liu and Danqing Mei

Under Review at Journal of Accounting and Economics

SSRN AlphaVal Agent

2025

When Charts Clarify and Mislead: Visual Disclosure in Earnings Call Presentations

with Yang Cao and Miao Liu

Revising for First Resubmit to Journal of Accounting Research

Coming Soon

Conference Papers

2024

Controllable Traffic Simulation through LLM-Guided Hierarchical Chain-of-Thought Reasoning

with Zhiyuan Liu, Leheng Li, Yuning Wang, Haotian Lin, Lei He and Jianqiang Wang

Accepted by IROS 2025

arXiv:2409.15135

Selected Work in Progress

Risk Factor Padding with Kalash Jain and Shivaram Rajgopal
Functional Fixation in AI Analysts Sole-authored
Detecting, Quantifying, and Analyzing Accounting Standards Adoption Effects with Miao Liu and Andrea Tillet
Making Tacit Knowledge Explicit: Recovering Human Judgment in Sell-Side Equity Research with AI with Miao Liu and Danqing Mei

Teaching

Teaching Assistant

Wisconsin School of Business, University of Wisconsin-Madison

Spring 2026 AIS 301/701 Financial Reporting I
Spring 2025 AIS 301/701 Financial Reporting I

Teaching Assistant

Economics and Management School, Wuhan University

Spring 2024 Data Analysis using R: Methods and Applications
Spring 2023 Data Analysis using R: Methods and Applications

Research Toolkits

Open-source pipelines and replication packages for empirical accounting and finance research.

WRDS Event Study (Polars + DuckDB)

High-performance event study framework replicating the WRDS Python app, enhanced with Polars and DuckDB for ~240× acceleration.

Abnormal return models: market-adjusted, CAPM, FF3, Carhart4
Supports CAR, BHAR, abret with flexible event windows
Efficient processing of large-scale samples

GPT-Disclosure Analyzer

Modular pipeline for analyzing SEC filings using OpenAI GPT models via keyword and item extraction.

Automatic extraction of selected SEC items (e.g., Item 1, 7 of 10-K)
Custom keyword filtering (unigrams and bigrams)
Batch annotation via OpenAI Chat API

Image-Text Inference with LLaVA

Multimodal inference tool using LLaVA v1.5 for interpreting image + prompt input.

Supports remote and local image input
Runs efficiently on GPU with float16
Returns vision-language model responses

RavenPack News Processor

Pipeline for processing RavenPack news event data with keyword filtering, accelerated by Polars and DuckDB.

Supports unigram and bigram keyword filtering
Polars + DuckDB for high performance
Outputs .parquet files with compression

FF Industry Parser

Lightweight parser for mapping SIC codes to Fama-French industry categories.

Parses 49-industry (or 12) structure
No external dependencies (pure re module)

Replication of Donelson et al. (2011, TAR)

Replicates and extends Donelson et al. (2011) using Compustat data, validated on 1967–2005 and extended to 2023.

Key financial ratios and rolling volatility metrics
Replicates Table 1 and Table 2
Extends analysis to 2006–2023

Replication of Daniel et al. (1997, JF)

Constructs 125 characteristic-sorted portfolios (5×5×5) using size, book-to-market, and momentum quintiles.

NYSE breakpoints for size, BM, and momentum quintiles
12-2 month momentum window with minimum 8 valid observations
Multiple share class handling

Replication of Call et al. (2023, RAST)

Replicates tone calculation from earnings call transcripts using the Loughran-McDonald sentiment dictionary.

Separates manager (type 4) and analyst (type 3) statements
Loughran-McDonald sentiment scoring
Outputs tone_manager and tone_analyst measures

Zhizhe Liu

Research

Working Papers

Can AI Do Financial Research? LLM-Guided Hypothesis Discovery in Asset Pricing

Human-AI Collaboration in Corporate Valuation: Experimental Evidence with a Valuation AI Agent

When Charts Clarify and Mislead: Visual Disclosure in Earnings Call Presentations

Conference Papers

Controllable Traffic Simulation through LLM-Guided Hierarchical Chain-of-Thought Reasoning

Selected Work in Progress

Teaching

Teaching Assistant

Teaching Assistant

Research Toolkits

WRDS Event Study (Polars + DuckDB)

GPT-Disclosure Analyzer

Image-Text Inference with LLaVA

RavenPack News Processor

FF Industry Parser

Replication of Donelson et al. (2011, TAR)

Replication of Daniel et al. (1997, JF)

Replication of Call et al. (2023, RAST)

Replication of García et al. (2023, JFE)

High-Frequency Event Study TAQ