<p><br></p><p dir="ltr">The Find an Expert (FaE) resource provides a comprehensive collection of data describing academic expertise and real-world search interactions at the University of Melbourne. It comprises the following components:</p><h2><b>Profile Data</b></h2><p dir="ltr">A set of 8,984 academic profiles containing rich contextual information, including:</p><ul><li>Biographical descriptions</li><li>Research interests</li><li>Academic positions and appointments</li><li>Authored publications</li><li>Externally funded research projects</li></ul><p dir="ltr">Each profile is represented as a structured JSON record with multiple nested fields (see schema section below).<br>The collection captures 474,576 unique publications and 25,064 unique projects, providing extensive coverage of scholarly activity.</p><h4><b>Profile-level statistics</b></h4><ul><li><b>Publications per profile:</b> min = 0 | median = 33 | mean = 80 | max = 2,250</li><li><b>Projects per profile:</b> min = 0 | median = 1 | mean = 4 | max = 188</li><li><b>Profile size (kB):</b> min = 2 | median = 77 | mean = 199 | max = 5,573</li></ul><h4><b>Field-level characteristics (in characters)</b></h4><ul><li><b>preferred_name:</b> min = 4 | median = 13 | mean = 13.4 | max = 46</li><li><b>title:</b> min = 2 | median = 2 | mean = 3.0 | max = 8</li><li><b>bio:</b> min = 18 | median = 1,233 | mean = 1,396 | max = 3,998</li><li><b>primary_interest:</b> min = 1 | median = 23 | mean = 30.6 | max = 225</li><li><b>publication_title:</b> min = 1 | median = 95 | mean = 98.3 | max = 1,384</li><li><b>project_name:</b> min = 1 | median = 31 | mean = 40.3 | max = 255</li></ul><h4><b>Availability indicators</b></h4><ul><li><b>supervisor_avail:</b> 31.9%</li><li><b>industry_avail:</b> 7.0%</li><li><b>media_avail:</b> 6.8%</li><li><b>ext_news_source_agree:</b> 1.9%</li></ul><p dir="ltr">These statistics illustrate the diversity and completeness of the profiles, with substantial variation in text field richness and engagement attributes.</p><h2><b>Interaction Logs</b></h2><p dir="ltr">A 239-day log dataset (January–September 2025) comprising 712,937 interaction records from 89,582 users.<br>The logs capture search queries, result clicks, and temporal sequences of actions, enabling the analysis of authentic expert-seeking behaviour at scale.</p><h2><b>Search Results (SERPs)</b></h2><p dir="ltr">A collection of 530 fifty-item search result pages (SERPs) corresponding to queries that resulted in at least one profile click.<br>Each SERP includes ranked profile identifiers and click metadata, allowing fine-grained analysis of ranking positions and user engagement.</p><h2><b>FaE Profile Schema</b></h2><p dir="ltr">Each profile record contains the following key categories of fields:</p><ul><li><b>Identifiers:</b> <code>id</code>, <code>fae_profile_url</code>, <code>orc_id</code></li><li><b>Personal and professional details:</b> <code>title</code>, <code>first_name</code>, <code>last_name</code>, <code>preferred_name</code></li><li><b>Positions:</b> <code>primary_fac_position</code> (faculty, school, department, role)</li><li><b>Research interests:</b> <code>primary_interest</code>, <code>bio</code></li><li><b>Supervision and industry engagement:</b> <code>supervisor_avail</code>, <code>supervision_statement</code>, <code>industry_avail</code>, <code>industry_statement</code></li><li><b>Academic outputs:</b> <code>authorship_objects</code>, <code>editorship_objects</code>, <code>translatorship_objects</code></li><li><b>Projects:</b> <code>project_objects</code> (project ID, funding type, scheme, sponsor, value, start/end dates, description)</li><li><b>Keywords:</b> <code>user_keywords</code>, <code>wordcloud_keywords</code>, <code>publication_keywords</code></li><li><b>Metadata:</b> <code>last_updated</code></li></ul><h2><b>Data Availability</b></h2><p dir="ltr">The FaE resource will become available upon publication of the corresponding paper. <br>Access will require a signed license agreement, after which the full dataset will be shared upon request via direct contact with the authors.</p><h2><b>Public Sample</b></h2><p dir="ltr">To facilitate reproducibility, we provide a sample package containing:</p><ol><li>A small subset of the interaction log—approximately ten sessions that include user interactions and click-throughs to one of two sample profiles.</li><li>The corresponding two example profiles.</li><li>All 530 search result pages (SERPs) in their original HTML format, each corresponding to a query that resulted in a profile click.</li></ol><p dir="ltr">This sample provides a transparent view of how user interactions, retrieved profiles, and query data are structured within the full FaE resource. It is designed to help researchers replicate key analyses and understand the data organization without requiring access to the full dataset.</p><h2><b>Citation</b></h2><p dir="ltr">To cite this resource, please reference the forthcoming paper describing the dataset.<br>A BibTeX entry will be provided upon publication.<br>Users of this dataset are expected to acknowledge and cite the paper in any derived work.</p><p><br></p>