FactMiners.org - Conference Papers
http://www.factminers.org/tags/conference-papers
enFactMiners References to our #DATeCH2017 Poster Session
http://www.factminers.org/datech2017
<div class="field field-name-field-tags field-type-taxonomy-term-reference field-label-hidden view-mode-rss view-mode-rss"><ul class="field-items"><li class="field-item even" rel="dc:subject"><a href="/tags/datech2017" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">#DATeCH2017</a></li><li class="field-item odd" rel="dc:subject"><a href="/tags/magazine-research" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Magazine Research</a></li><li class="field-item even" rel="dc:subject"><a href="/tags/conference-papers" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Conference Papers</a></li></ul></div><div class="field field-name-body field-type-text-with-summary field-label-hidden view-mode-rss view-mode-rss"><div class="field-items"><div class="field-item even" property="content:encoded"><p>This article is a placeholder ATM that will be created as a supplementary resource in support of our poster session at #DATeCH2017.</p>
</div></div></div>Tue, 23 May 2017 18:06:50 +0000Jim Salmons55 at http://www.factminers.orghttp://www.factminers.org/datech2017#commentsSoftalk Apple Is First Collection at Internet Archive with FactMiners' MAGAZINE Ground Truth Storage Metadata
http://www.factminers.org/content/softalk-apple-first-collection-internet-archive-factminers-magazine-ground-truth-storage
<div class="field field-name-field-tags field-type-taxonomy-term-reference field-label-hidden view-mode-rss view-mode-rss"><ul class="field-items"><li class="field-item even" rel="dc:subject"><a href="/tags/news" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">News</a></li><li class="field-item odd" rel="dc:subject"><a href="/tags/datech2017" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">#DATeCH2017</a></li><li class="field-item even" rel="dc:subject"><a href="/tags/conference-papers" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Conference Papers</a></li><li class="field-item odd" rel="dc:subject"><a href="/tags/internet-archive" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Internet Archive</a></li><li class="field-item even" rel="dc:subject"><a href="/tags/ground-truth" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Ground Truth</a></li><li class="field-item odd" rel="dc:subject"><a href="/tags/magazine-research" typeof="skos:Concept" property="rdfs:label skos:prefLabel" datatype="">Magazine Research</a></li></ul></div><div class="field field-name-body field-type-text-with-summary field-label-hidden view-mode-rss view-mode-rss"><div class="field-items"><div class="field-item even" property="content:encoded"><p><strong>FactMiners</strong> and <strong>The Softalk Apple Project</strong> are excited to announce that The Softalk Apple Project's digital collection of the Apple edition of Softalk magazine is now included in both <a href="https://archive.org/details/computermagazines"><em>"The Computer Magazine Archives"</em></a> and <em>"Magazine Rack"</em> collections at the <strong>Internet Archive</strong>. Our projects also were granted full admin rights to the Softalk Apple collection at the Archive in support of our applied research.</p>
<p>And the first BIG NEWS made possible by our having full admin access is that The Softalk Apple Project collection is the <strong>FIRST</strong> (and so-far ONLY) <strong>digital magazine, newspaper, or serial publication at the Archive to provide XML-based FactMiners' MAGAZINE #GTS (Ground Truth Storage) metadata files</strong> for each issue of the magazine as well as a "master file" for the entire publication/collection!</p>
<p>"Ground Truth Storage" is a term that image-analysis and text-mining researchers use for metadata files that are human-curated and validated as (close to) 100% accurate as possible. The #GTS format that we developed at FactMiners is based on an 'ontological stack' of the #cidocCRM (the International Council of Museums' Conceptual Reference Model for Cultural Heritage), FRBRoo (the IFLA's Functional Requirements for Bibliographic Records) and PRESSoo (the IFLA model for serial publications). Rather than focus on the within-page ground truth of individual page layout and text recognition, the FactMiners' MAGAZINE metadata format incorporates a comprehensive, publication-wide metadata model that integrates the complex Document Strucure and Content Depiction models.</p>
<p>In an effort to keep our project collaborators and supporters informed, we made two short demo screencast video updates about our progress developing the Python-based ppg2leaf_ferret metadata discovery and validation tool:</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/ei1YoSgNL6w" frameborder="0" allowfullscreen=""></iframe><p>
with the second update showcasing our generalization of the ferret to handle bottom-margin page number spotting. The issue we quickly explore is the famous August 1981 issue of Byte magazine all about Smalltalk:</p>
<iframe width="560" height="315" src="https://www.youtube.com/embed/mttUby8NRpw" frameborder="0" allowfullscreen=""></iframe><p>
To take a look at the initial iteration of the FactMiners MAGAZINE #GTS (Ground Truth Storage) format metadata files at the Internet Archive, see here:</p>
<p> <a href="https://archive.org/download/softalkapple/softalkapple_publication.xml">https://archive.org/download/softalkapple/softalkapple_publication.xml</a></p>
<p>for the <strong>publication level</strong> MAGAZINE #GTS file, and here:</p>
<p> <a href="https://archive.org/download/softalkv1n01sep1980/softalkv1n01sep1980_magazine.xml">https://archive.org/download/softalkv1n01sep1980/softalkv1n01sep1980_mag...</a></p>
<p>for an example of the <strong>issue level</strong> MAGAZINE #GTS metadata file.</p>
<p>Keep in mind that this initial publication of our MAGAZINE metadata files is very "thin" at the moment, with mostly empty placeholder tags that will be filled in with full models and associated datasets. The XML Schemas for the MAGAZINE format are published and available to all researchers via the FactMiners website. See the XML header of the above metadata files for the standard XML schema location reference to these files.</p>
<p>The individual issue level MAGAZINE #GTS metadata files include a "ppg2leaf_map" that guarantees the relationship between Softalk's print page numbers and their respective "leaf" images in the digital copies at the Archive. While our MAGAZINE files are admittedly lean at the moment, The Softalk Apple Project already has extensive data "in the can" -- being curated Advertisers Index, mastheads, Table of Contents, and lists of Companies, People, Products, etc. who made or were covered editorially in the magazine. We are currently writing the Python scripts to generate the XML metadata that will begin populating the publication level model and dataset metadata. Issue-specific subsets of our data will also be included in the issue level metadata files.</p>
<p>The publication of the these MAGAZINE #GTS files is the subject of FactMiners' first paper submitted to <a href="http://ddays.digitisation.eu/datech-2017/"><strong>#DATeCH2017</strong></a>, and the ppg2leaf mapping found in the issue level files is the subject of our second paper to this EuropeanaTech Digital Humanities research conference which is scheduled to take place in Germany in early June.</p>
<p>The FactMiners #GTS format is being evolved as a resource to support eResearch and machine learning at the Internet Archive. As always, comments and questions are welcome. Even better, we welcome volunteers who would like to become involved in our #CitizenScience and #CitizenHistory projects, FactMiners and The Softalk Apple Project. To express your interest feel free to contact us through this website or via social media channels.</p>
<p>For those interested in reading pre-review PDFs of our #DATeCH2017 submissions, they are available to those with ResearchGate.net access at <a href="https://www.researchgate.net/publication/305720742_Ground_Truth_Softalk_Magazine_Using_Aletheia_Web_Edition_to_do_FactMiners%27_Text-mining"><em>"Ground Truth & Softalk Magazine: Using Aletheia Web Edition to do FactMiners’ Text-mining"</em></a> and <a href="https://www.researchgate.net/publication/313046838_Print-Page_Number_to_Leaf_ID_Mapping_in_Support_of_eResearch_and_Machine-Learning_at_the_Internet_Archive"><em>"Print-Page Number to "Leaf" ID Mapping in Support of eResearch and Machine-Learning at the Internet Archive"</em></a>. Others interested in our applied research may request personal communication copies via the contact form on this website or through any of the social media channels in which we are active.</p>
</div></div></div>Mon, 06 Mar 2017 19:48:23 +0000Jim Salmons54 at http://www.factminers.orghttp://www.factminers.org/content/softalk-apple-first-collection-internet-archive-factminers-magazine-ground-truth-storage#comments
Uncaught exception thrown in shutdown function.
PDOException: SQLSTATE[42000]: Syntax error or access violation: 1142 DELETE command denied to user 'factminersAdmin'@'localhost' for table 'semaphore': DELETE FROM {semaphore}
WHERE (value = :db_condition_placeholder_0) ; Array
(
[:db_condition_placeholder_0] => 13224251916417fdc48ef640.75713628
)
in lock_release_all() (line 269 of /var/www/webadmin/data/www/factminers.org/html/includes/lock.inc).