Solved

Entity Extractor

Posted on 2008-06-12
7
407 Views
Last Modified: 2013-11-23
I need to extract entities from text. I wondered if there is a entity extraction library, vcl, api, etc that can be used and incorporated into a Delphi program? I am using Delphi 2006 VCL.
0
Comment
Question by:mcmahling
7 Comments
 
LVL 28

Expert Comment

by:ciuly
ID: 21768987
depends on what entities you are talking about. there are designated, specialized "extractors" named parsers like html, xml, csv, etc which will parse out the data.

for generic data you have at least one option: regular expression. but still, you will have to write the regex in order to use it (hwich i snot that hard once you get familiar with it).
0
 

Author Comment

by:mcmahling
ID: 21769299
Thanks for your response. I would be extracting entities from .txt documents. Using regular expressions still would require me to have an ontology that must be the accessed by the regular expression to determine what words were entities. I don't readily see how to use regular expressions without that sort of semantic library. I was hoping something already existed that I would not have to do all that work.  
0
 
LVL 28

Expert Comment

by:ciuly
ID: 21769378
you canot have a computer guess what you want to extract. you need to tell it somehow. like ti or not, that's the way things work. the computer doens'tknow that a text file contains text and words and numbers and whatever. for the computer all files are the same: a serie of 0's and 1s.

not even I know what you mean by "entities". how do you want a computer to know?

regular expressions are the most egneric way of extracting arbitrary data from something. if that doesn't help, nothing does.
0
How to run any project with ease

Manage projects of all sizes how you want. Great for personal to-do lists, project milestones, team priorities and launch plans.
- Combine task lists, docs, spreadsheets, and chat in one
- View and edit from mobile/offline
- Cut down on emails

 
LVL 17

Expert Comment

by:TheRealLoki
ID: 21809345
can you explain more about these entities.
What is the source text? is it just random text, or is it in an exisiting structure, such as XML, CSV?
Are theses "entities" going to be consistent within your app? Or do you really need a solution that gives you an extreme amount of flexibility?
I've written commercial EDI engines, so let me know your full requirements
0
 

Author Comment

by:mcmahling
ID: 21811874
The entities are people, places, or things. The text is free unstructured text coming from news articles. I have heard of the open source product Annie but I wondered was their another product especially made for integration with Delphi.
0
 
LVL 36

Expert Comment

by:Geert Gruwez
ID: 21822106
It looks like you only need an AI to read the news articles and let the AI extract the entities ...
AI = probably human brain instead of Artificial Intelligence
0
 
LVL 3

Accepted Solution

by:
Mamouri earned 500 total points
ID: 21855941
Hi

This issue related to field of Data Mining and Text Extraction. Take a look to this library: http://www.dewresearch.com/data-miner.html
The component is a general Data Mining library and could be used for a variety of Data mining usage.
0

Featured Post

Free Trending Threat Insights Every Day

Enhance your security with threat intelligence from the web. Get trending threat insights on hackers, exploits, and suspicious IP addresses delivered to your inbox with our free Cyber Daily.

Join & Write a Comment

Objective: - This article will help user in how to convert their numeric value become words. How to use 1. You can copy this code in your Unit as function 2. than you can perform your function by type this code The Code   (CODE) The Im…
Creating an auto free TStringList The TStringList is a basic and frequently used object in Delphi. On many occasions, you may want to create a temporary list, process some items in the list and be done with the list. In such cases, you have to…
Get a first impression of how PRTG looks and learn how it works.   This video is a short introduction to PRTG, as an initial overview or as a quick start for new PRTG users.
Polish reports in Access so they look terrific. Take yourself to another level. Equations, Back Color, Alternate Back Color. Write easy VBA Code. Tighten space to use less pages. Launch report from a menu, considering criteria only when it is filled…

708 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

12 Experts available now in Live!

Get 1:1 Help Now