Replication Data for the paper ‘Authenticity and personal signals increase hiring chances in online labor markets: large-scale analysis of online impression management’ (hdl:21.15109/ARP/0XAKYA)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Replication Data for the paper ‘Authenticity and personal signals increase hiring chances in online labor markets: large-scale analysis of online impression management’

Identification Number:

hdl:21.15109/ARP/0XAKYA

Distributor:

ARP

Date of Distribution:

2026-03-30

Version:

1

Bibliographic Citation:

Ilyés, Virág; O’Szabo, Rebeka; Császár, Olga; László, Lőrincz, 2026, "Replication Data for the paper ‘Authenticity and personal signals increase hiring chances in online labor markets: large-scale analysis of online impression management’", https://hdl.handle.net/21.15109/ARP/0XAKYA, ARP, V1

Study Description

Citation

Title:

Replication Data for the paper ‘Authenticity and personal signals increase hiring chances in online labor markets: large-scale analysis of online impression management’

Identification Number:

hdl:21.15109/ARP/0XAKYA

Authoring Entity:

Ilyés, Virág (HUN-REN Centre for Economic and Regional Studies)

O’Szabo, Rebeka (Corvinus University of Budapest)

Császár, Olga (Corvinus University of Budapest)

László, Lőrincz (Corvinus University of Budapest)

Producer:

Ilyés, Virág

Distributor:

ARP

Access Authority:

Ilyés, Virág

Depositor:

Ilyés, Virág

Date of Deposit:

2026-03-25

Holdings Information:

https://hdl.handle.net/21.15109/ARP/0XAKYA

Study Scope

Keywords:

Social Sciences, online labor markets, hiring outcomes, self-presentation, impression management, job application, skills, text analysis

Abstract:

The dataset was collected from Freelancer.com to examine how online impression management shapes hiring outcomes in gig-economy environments. It combines detailed information from project pages and freelancer profile pages to construct a linked project–applicant–level dataset. The data include rich information on project requirements, applicant bids, skills, reputation, and demographic proxies, as well as extensive textual features derived from application messages and personal introductions. These textual measures capture multiple dimensions of self-presentation, including structure, readability, formality, politeness strategies, sentiment, emotional tone, and persuasive language, alongside indicators of authenticity such as text uniqueness and introduction style. The primary goal of data collection is to provide large-scale empirical evidence on whether and how online self-presentation—beyond traditional “hard” signals like skills and experience—affects hiring decisions, and to identify which impression management strategies are rewarded or penalized across different types of work in online labor markets.

Methodology and Processing

Sources Statement

Data Access

Confidentiality Declaration:

The dataset does not contain personal or confidential information to the best of the authors’ knowledge.

Special Permissions:

No special permissions are required beyond compliance with the stated terms of use.

Restrictions:

Use of the dataset is limited to research purposes. Commercial use is not permitted without prior permission from the data provider.

Citation Requirement:

Users must cite the dataset in any publication or output using it. Proper citation should include the dataset title, authors, year, and repository.

Deposit Requirement:

Users must not misrepresent the data or its source.

Conditions:

By using this dataset, users agree to comply with these terms and conditions.

Disclaimer:

The dataset is provided "as is" without warranty of any kind. The authors are not responsible for any errors or for any consequences arising from the use of the data.

Notes:

This dataset is made available for research purposes only. Users are free to use, share, and adapt the data for non-commercial research activities, provided that proper citation is given.

Contact for Access: ilyes.virag@krtk.elte.hu ilyesvirag@gmail.com Terms of Access for Restricted Files: Access to the data is granted for research purposes upon request. Users must agree to cite the dataset appropriately.

Other Study Description Materials

Other Study-Related Materials

Label:

codebook.xlsx

Text:

Variable-level documentation with names, descriptions, and variable types

Notes:

application/vnd.openxmlformats-officedocument.spreadsheetml.sheet

Other Study-Related Materials

Label:

custom_dictionaries.docx

Text:

Detailed description of the custom dictionaries used for feature extraction

Notes:

application/vnd.openxmlformats-officedocument.wordprocessingml.document

Other Study-Related Materials

Label:

data_full.csv

Text:

Cleaned dataset used for the analysis, including all variables described in the codebook

Notes:

text/csv

Other Study-Related Materials

Label:

documentation.docx

Text:

Comprehensive documentation covering data collection, methodology, and dataset structure

Notes:

application/vnd.openxmlformats-officedocument.wordprocessingml.document