Khemeia TDW Live Presentation NOV16
-
Upload
maria-shiao -
Category
Documents
-
view
8 -
download
1
Transcript of Khemeia TDW Live Presentation NOV16
1
Generating value from your legacy content – you are not aloneTDW Live – Nov 16 2016
Content Transformation Software: Khemeia™
Maria ShiaoVP Business DevelopmentStelae Technologies
2
Big Data and its myths…
21/11/2016 Stelae Technologies – TDW Live 2
3
Paper = unstructuredDigital ≠ structured
Also … unstructured ≠ old
• Reusable• Searchable• Compliant
• Industry/RegulatoryStandards
• Internal data and meta data standard – enables big data analytics21/11/2016 Stelae Technologies – TDW Live 3
4
What’s happening in other industries…
LegalLegislationRegulation
FinancialAccountingRiskCompliance
Publishing
Scale and automationStandardizationSkills optimizationEnterprise-wide insights (faster, better)Managing compliance and risk
21/11/2016 Stelae Technologies – TDW Live 4
5
What makes unstructured data valuable?
21/11/2016 Stelae Technologies – TDW Live 5
6
The value chain of (unstructured) data
StructuredContent(20%)
UnstructuredContent (80%)
Unified Standardised
Structured Content
Management Search
Analytics
ManagementEnd Users(devices,
mobile, AR)
21/11/2016 Stelae Technologies – TDW Live 6
7
What are the options today?
Input: PDF Word Text
OCR Scans
Output: XML DITA S1000D XBRL/iXBRL html
Outsourcing or semi-automated scripts• Cost• Speed• Quality• Bias
Text and semantic meta-tagging• Size of dictionaries• Visual structures• Language• Bias
21/11/2016 Stelae Technologies – TDW Live 7
8
Typical Editor-based Transformation Workflow
Source Documents: Multi Format searchable PDF/ Word
Create DM Ref
OCR images
Start Workflow
Glossary of s1000D tags
Manual copy paste from text into s1000D editor
Extract images from PDF –TIFF/JPEG
Bring it together into s1000D editor
Draft and Quality Check/ Approval
Publish Final Copy
End
21/11/2016 Stelae Technologies – TDW Live 8
9
Challenges
TIME
TRAINING ON EDITOR
TRAINING ON TAGS – PROCEDURES, DESCRIPTIVE, IPC …
COPY PASTE OPERATION
SKILL
KNOWLEDGE OF S1000D
HIGH COST DUE TO
TRAINED RESOURCES
IN-ABILITY TO SCALE
PRODUCTIVITY CONSTRAINTS
TIME FOR RESOURCE RAMP-UP
21/11/2016 Stelae Technologies – TDW Live 9
10
Khemeia based Transformation Workflow
Source Documents: Multi Format searchable PDF/ Word
Create DM Ref
OCR images
Start Workflow
Glossary of s1000D tags
Manual copy paste from text into s1000D editor
Extract images from PDF –TIFF/JPEG
Bring it together into s1000D editor
Draft and Quality Check/ Approval
Publisher of Editor
End
Import to CSDB/ Publisher
21/11/2016 Stelae Technologies – TDW Live 10
11
Easy Conversion with Khemeia™
Minimal Change Management
Drag file
Drop file
Hot Folder Configurations
identify Document Type and Layout
Identify tables and content layout
Uses Glossary to Tag words
Automatable to directly drop into the folders
Feedback to existing process
21/11/2016 Stelae Technologies – TDW Live 11
12
Khemeia overcomes Challenges
TIME
TRAINING ON EDITOR
TRAINING ON TAGS – PROCEDURES, DESCRIPTIVE, IPC …
COPY PASTE OPERATION
SKILL
EVERY OPERATOR DOES
NOT NEED KNOWLEDGE OF
S1000D
HIGH COST DUE TO
TRAINED RESOURCES
IN-ABILITY TO SCALE
PRODUCTIVITY CONSTRAINTS
TIME FOR RESOURCE
RAMP-UP
21/11/2016 Stelae Technologies – TDW Live 12
13
Conclusions
Unstructured legacy data has value Why
Which data/meta-data sets have the most relevance and impact downstream
This value has to be extracted upfront Direct cost
Operational, lifecycle cost
Time
Technology is available to largely automate the process Proven in other industries
Significant early adopters in the A&D sector
21/11/2016 Stelae Technologies – TDW Live 13
14
Q&AThank You!
Contact:Maria Shiao, VP Business Development+44 7779 77 89 [email protected]@mariashiao
www.stelae-technologies.com@stelaetech