Implicit Information Extraction

4:26:00 PM
by Sujan Perera

People communicate their ideas, opinions, and facts using natural language. It is one of the powerful tools that we as humans collectively developed over hundreds of thousands of years and will continue to develop in years to come. While the debate on the evolution of the languages has not reached a consensus, the theories have estimated that it first evolved around 150,000 - 350,000 years ago, which is roughly the time frame accepted for the evolution of modern Homo sapiens.1 English is one of the most used descendants of this evolution and it has evolved over 1,400 years on its own.2 The evolution of the language accounts for the social, cultural, and economic changes which have taken place in society. For example, the industrial revolution took place in the 18th and 19th century and had a great impact on these three dimensions. It added new words to the language such as condenser, vacuum, reservoir, taxonomy, sodium, and platinum [1].

The evolution of the language has enriched it with many features. One such feature is its ability to express facts, ideas, and opinions in an implicit manner. As humans, we seamlessly use implicit constructs in our daily conversations and rarely find it difficult to decode the content of the messages. Consider the following tweet:

Are we not going to talk about how ridiculous the new space movie with Sandra Bullock is going to be?
— ☯Mac (@mfpokesmot) September 15, 2013

This tweet contains an implicit mention of the movie 'Gravity'. A human with up-to-date knowledge on movies would instantly understand that the tweet talks about the movie 'Gravity'. However, the whole field of information extraction, which has the objective of automatically extracting structured information from unstructured and/or semi-structured data, almost exclusively focused on extracting explicit information from the text. Consider the following text snippet extracted from a clinical narrative.

"Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac catheterization because of a positive exercise tolerance test. Recently, he started to have left shoulder twinges and tingling in his hands. A stress test done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes, stopped due to fatigue. However, Mr. Smith is comfortably breathing in room air. He also showed accumulation of fluid in his extremities. He does not have any chest pain."

The state-of-the-art information extraction algorithms would extract information like 'Bob Smith' and 'Dr. Davis' are entities of type person, 'cardiac catheterization' and 'chest pain' are the same as entities identified with concept unique identifiers (CUI) C0018795 and C0008031 in unified medical language system (UMLS), and there is a cause-effect relationship between 'fatigue' and exercising. The algorithms developed for named entity recognition, entity linking, and relationship extraction would help to extract this structured information from the above text snippet. However, the two sentences "Mr. Smith is comfortably breathing in room air" and "He also showed accumulation of fluid in his extremities" implicitly indicate that the patient does not have the clinical condition named 'shortness of breath', but he has 'edema.' This is very important information for assessing the health status of the patient and a medical professional reading this snippet would easily decode the mentions of these two clinical conditions in the text. An automatic entity linking technique should identify these mentions as same as the entities identified with CUIs C0013404 ('shortness of breath') and C0013604 ('edema') in UMLS. Commercially, it has applications such as Computer Assisted Coding and Computerized Document Improvement. Unfortunately, current information extraction algorithms would not be able to extract this implicit information.

Implicit constructs are not a rare occurrence. Our studies found that 21% of the movie mentions and 40% of the book mentions are implicit in tweets, and about 35% and 40% of 'edema' and 'shortness of breath' mentions are implicit in clinical narratives. There are genuine reasons why people tend to use implicit mentions in daily conversations. Here are few reasons that we have observed:

To express sentiment and sarcasm : The following tweet has an element of sarcasm and a negative sentiment towards the movie 'Transformers: Age of Extinction.' These feelings were expressed implicitly in this tweet. It is noted that people heavily use implicit constructs to express sarcasm [2].

I'm striving to be positive in what I say on Twitter. So I'll refrain from making a comment about the latest Michael Bay movie.
— Darren Currin (@darrencurrin) July 22, 2014
To provide descriptive information : For example, it is a common practice to describe the features of an entity rather than simply list down its name in clinical narratives. Consider the sentence "small fluid adjacent to the gallbladder with gallstones which may represent inflammation." This sentence contains an implicit mention of the clinical condition 'cholecystitis' and provides important information about the patient's health status that would be missing if the author chose to list down only the name of clinical condition. The condition 'cholecystitis' means "inflammation in gallbladder" with multiple causes and the sentence provides a detailed explanation of 'cholecystitis' along with the possible cause. This descriptive information is critical in understanding the patient's health status and treating the patient.
To emphasize the features of an entity : Sometimes we replace the name of the entity with its special characteristics in order to give importance to those characteristics. For example, the text snippet "Mason Evans 12 year long shoot won big in golden globe" has an implicit mention of the movie 'Boyhood.' There is a difference between this text snippet and its alternative form "Boyhood won big in golden globe." The speaker is interested in emphasizing the distinct feature of the movie, which would have been ignored if he had used the name of the movie as in the second phrase.
To communicate shared understanding : We do not bother spelling out everything when we know that the other person has enough background knowledge to understand the message conveyed. A good example is the fact that clinical narratives rarely mention the relationships between entities explicitly (e.g., relationships between symptoms and disorders, relationships between medications and disorders), rather it is understood that the other professionals reading the document have the expertise to understand such implicit relationships in the document.

The above examples show the value added by the implicit constructs to daily communications. Another important observation is the role of world knowledge in interpreting implicit constructs. A human reading the text with implicit information would only be able to decode implicit information if he/she has relevant knowledge on the domain. A reader who does not know about Michael Bay's movie release would have no clue about the movie mentioned in the tweet with sarcasm; a reader who does not know the characteristics of the clinical conditions 'shortness of breath' and 'edema' would not be able to decode their mentions in the clinical text snippet shown above; a reader who is not a medical expert would not be able to connect the diseases and symptoms mentioned in a clinical narrative.

The implicit information extraction task demands comprehensive and up-to-date world knowledge. Individuals resort to a diverse set of entity characteristics to make implicit references (also see [3]). For example, the implicit references to the movie 'Boyhood' use phrases like "Richard Linklater movie", "Ellar Coltrane on his 12-year movie role", "12-year long movie shoot", "latest movie shot in my city Houston", and "Mason Evan's childhood movie." Hence, it is important to have comprehensive knowledge about the entities to decode their implicit mentions. Another complexity is the temporal relevancy of the knowledge. The same phrase can be used to implicitly refer to different entities at different time intervals. For instance, the phrase "space movie" could refer to the movie 'Gravity' in fall 2013 while the same phrase in fall 2015 would likely refer to the movie 'The Martian.' On the flip side, the most salient characteristics of the movies may change over time, and so will the phrases used to refer to them. The movie 'Furious 7' was frequently referred to with the phrase "Paul Walker's last movie" in November 2014. This was due to the actor's death around that time. However, after the movie release in April 2015 the same entity was often mentioned through the phrase "fastest film to reach the $1 billion."

At Kno.e.sis, we have developed a knowledge-driven solution to perform implicit information extraction. This solution acquires relevant domain knowledge from a diverse set of structured and unstructured knowledge sources, processes acquired knowledge to represent it in a machine readable manner, and contains information extraction techniques that uses these knowledge sources to decode the implicit information in the text. We have successfully applied this solution to extract implicit entities and relationships in clinical narratives [4] [6] and implicit entities in tweets [5].

1https://en.wikipedia.org/wiki/Origin_of_language

2https://en.wikipedia.org/wiki/English_language

References:

[1] Bragg, Melvyn. The adventure of English: The biography of a language. Arcade Publishing, 2006.

[2] Davidov, Dmitry, Oren Tsur, and Ari Rappoport. "Semi-supervised recognition of sarcastic sentences in twitter and amazon." Proceedings of the fourteenth conference on computational natural language learning. Association for Computational Linguistics, 2010.

[3] "Help For HealthCare: Mapping Unstructured Clinical Notes To ICD-10 Coding Schemes." Http://www.dataversity.net/. N.p., 26 Nov. 2013. Web. 19 Aug. 2016.

[4] Sujan Perera, Pablo N. Mendes, Amit Sheth, Krishnaprasad Thirunarayan, Adarsh Alex, Christopher Heid, and Greg Mott. "Implicit entity recognition in clinical documents." In Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics (*SEM), pp. 228-238. 2015.

[5] Sujan Perera, Pablo N. Mendes, Adarsh Alex, Amit P. Sheth, and Krishnaprasad Thirunarayan. "Implicit Entity Linking in Tweets." In Extended Semantic Web Conference, pp. 118-132. Springer International Publishing, 2016.

[6] Sujan Perera, Cory Henson, Krishnaprasad Thirunarayan, Amit Sheth, and Suhas Nair. "Semantics driven approach for knowledge acquisition from EMRs." IEEE journal of biomedical and health informatics 18, no. 2 (2014): 515-524.

Sujan Perera

24 comments

AnonymousDecember 5, 2022 at 12:16 AM
Additionally, there are not any legal guidelines particularly prohibiting the sale of state lotteries or on-line sports betting. However, find a way to|you possibly can} get pleasure from on-line sports betting and 1xbet korea on line casino video games at quantity of|numerous|a selection of} approved digital operators. However, if you're looking for a chance to gamble in South Korea, you need to} examine with local government officials earlier than attempting to play on-line. Due to strict playing legal guidelines, those looking to play on line casino video games in South Korea are compelled to sign up|to enroll} with worldwide operators. Luckily for players, some of the the} finest on-line on line casino websites on the earth remain accessible. So, whether or not you trying to|want to|wish to} play video slots, Roulette or prompt win video games, our prime ranked casinos have you ever covered.
ReplyDelete
Replies
JackSeptember 25, 2023 at 3:37 AM
Implicit information extraction is a crucial aspect of natural language processing and information retrieval. It is often discussed in reviews, with users praising its efficiency, accuracy, natural language understanding, contextual relevance, and usefulness for research. The software's integration capabilities make it easy to extract implicit information, saving time and effort. However, a learning curve is required to fully utilize its potential. Customization options allow users to tailor extraction rules to specific information retrieval needs. Data privacy and security are prioritized, and the tool ensures appropriate handling of sensitive information. The tool's scalability makes it suitable for enterprise-level applications. The company provides excellent support and comprehensive documentation, ensuring the benefits of implicit information extraction are maximized.Acuerdo de Solución Disputa de Contrato
ReplyDelete
Replies
judasanjoyOctober 18, 2023 at 7:03 AM
Implicit information extraction is a crucial task in natural language processing (NLP) that requires understanding the context of a sentence or document to extract hidden meaning. It can be used to identify hidden relationships between entities and concepts, extract information from unstructured text data, and improve the accuracy of NLP tasks like machine translation and sentiment analysis. It has potential applications in market research, fraud detection, and medical diagnosis. Challenges include developing systems that understand sentence context with high accuracy, computational costs for training and deployment, and difficulty in evaluating performance due to lack of ground truth data.motorcycle accident attorney virginia beach
ReplyDelete
Replies
Stephen JohnFebruary 9, 2024 at 4:32 AM
Implicit information extraction involves identifying and extracting meaningful insights or patterns from unstructured data sources without explicit labeling or predefined categories. Lawyer for motorcycle accident
ReplyDelete
Replies
james andersonMarch 1, 2024 at 8:29 AM
Sujan Perera delves into the intricate realm of Implicit Information Extraction, shedding light on how language evolution shapes our communication. From movie mentions to clinical narratives, decoding implicit constructs is crucial.
New Jersey Careless Driving Ticket
ReplyDelete
Replies
ambrosed081March 30, 2024 at 2:40 AM
The article highlights the importance of implicit information extraction in text, particularly in specialized domains like social media and clinical narratives. It highlights the need for comprehensive knowledge acquisition and processing to accurately decode implicit information. The article provides examples of implicit references in social media posts and clinical narratives, emphasizing their diverse purposes and critical information they convey. It also emphasizes the role of world knowledge and context in interpreting implicit constructs, stressing the need for domain expertise. The article also delves into the complexities of implicit information extraction, such as temporal relevance and evolving entity characteristics, emphasizing the need for adaptable and dynamic extraction techniques. abogado de derecho de familia
ReplyDelete
Replies
noahjoyMay 28, 2024 at 5:08 AM
The review comment should cover several aspects to ensure the document effectively educates bloggers about their legal obligations and empowers them to comply with relevant laws and regulations. These include accuracy, completeness, clarity, timeliness, practicality, risk management, sources and citations, and a feedback mechanism. It is crucial to ensure that the information is comprehensive, up-to-date, and practical for bloggers, and that it effectively communicates potential legal risks associated with non-compliance. Additionally, the document should include references to relevant laws, regulations, and legal precedents to enhance credibility. Lastly, a feedback mechanism should be included to address specific concerns or uncertainties.
dui lawyer fairfax va
ReplyDelete
Replies
villa raoMay 29, 2024 at 12:39 PM
Implicit Information Extraction represents a groundbreaking advancement in natural language processing, revolutionizing the way we extract meaningful insights from unstructured text data. By leveraging sophisticated algorithms and machine learning techniques, this innovative approach enables the identification and extraction of valuable information that is not explicitly stated in the text but implied through context and relationships. Implicit Information Extraction holds immense potential across various domains, including finance, healthcare, and cybersecurity, where extracting nuanced insights from large volumes of data is paramount.
charlottesville toxic exposure lawyer
charlottesville slip and fall lawyer
ReplyDelete
Replies
StephenJune 26, 2024 at 10:12 AM
Implicit information extraction refers to the process of identifying and extracting information that is not explicitly stated but can be inferred from textual or data sources. Unlike explicit information, which is directly stated and easily extractable, implicit information requires deeper analysis and understanding of context, relationships, and background knowledge. Lawyer bankruptcies
ReplyDelete
Replies
RobertAugust 8, 2024 at 5:08 AM
Implicit Information Extraction explores methods for identifying and retrieving unstated or implied data within texts, enhancing our ability to interpret nuanced meanings. This research is ||divorce in new york state cost||How much does a Divorce cost New York crucial for improving natural language processing applications where understanding context beyond explicit statements is necessary.

ReplyDelete
Replies
williams09176August 14, 2024 at 6:43 AM
The writing provides a comprehensive overview of natural language's evolution, particularly in the context of implicit communication and information extraction. It has strengths such as a clear introduction to language evolution, relevance of implicit information, and use of specific examples. It also connects the discussion to research conducted at Kno.e.sis, showcasing practical applications of the theory. However, there are suggestions for improvement, such as restructured sentences, smoother transitions, better technical terminology, consistent references formatting, and a more detailed explanation of the Kno.e.sis solution. Despite these improvements, the content remains informative and well-researched, with some adjustments for improved readability and accessibility. easiest way to get a divorce in virginia
ReplyDelete
Replies
rufusmiraSeptember 17, 2024 at 5:24 AM
The text provides several comments on the process of registering a SIM card, including ease of the process, information about the required documents, compliance with regulations, user experience, reliability, and up-to-date information. It also mentions the importance of understanding the specific field and providing clear and up-to-date information. The comments aim to provide a more comprehensive and useful guide for users.
reckless driving lawyer fairfax Passionate advocate for justice and equality. Navigating the complexities of law to create impactful change—because everyone deserves a voice.
ReplyDelete
Replies
RobertSeptember 19, 2024 at 6:41 AM
"Implicit information extraction is a fascinating area, allowing systems to understand hidden meanings and context beyond ||New Jersey Domestic Violence Law||New Jersey Order of Protection surface-level data. It’s crucial for improving AI and natural language processing."
ReplyDelete
Replies
jaker056789October 28, 2024 at 2:23 AM
The text discusses implicit information extraction, a crucial aspect of human communication. It involves understanding and interpreting messages suggested or implied through context, tone, or shared knowledge. This understanding relies on cultural and social cues, which play a crucial role in communication. Different cultures have varying degrees of directness, with some preferring explicit statements while others lean towards implicit suggestions. Recognizing these differences is essential for effective communication, particularly in multicultural settings. how to get a court order for child passport Lawyers, often referred to as attorneys or advocates, serve as critical players in the administration of justice and the legal system. Their role encompasses a wide array of responsibilities, from representing clients in court to providing legal advice and drafting documents. This multifaceted profession demands not only a deep understanding of the law but also strong analytical, communication, and ethical skills.
ReplyDelete
Replies
dulkarDecember 17, 2024 at 5:39 AM
Implicit information extraction is a process in natural language processing (NLP) that focuses on identifying and extracting information that is not explicitly stated but can be inferred from the context. Unlike explicit information, which is directly mentioned, implicit information is often hidden within the relationships between words, sentences, or larger contexts.
contract dispute resolution process
ReplyDelete
Replies
Horizon Garage DoorDecember 20, 2024 at 5:58 AM
If your garage door isn’t functioning properly due to a broken spring, you need reliable repair service fast, our expert technicians specialize in broken spring garage door Ellicott City, offering quick, efficient solutions to restore your door’s functionality. Whether it's a torsion or extension spring, we have the experience to handle the job safely and effectively.

Using top-quality parts and the latest tools, we ensure a long-lasting repair that keeps your garage door working smoothly. Don’t let a broken spring compromise your security or convenience—contact our team for prompt and professional service. Call now and let us solve your garage door issues in Ellicott City.
ReplyDelete
Replies
RobertDecember 21, 2024 at 5:13 AM
This comment has been removed by the author.
ReplyDelete
Replies
brooklucasDecember 24, 2024 at 5:44 AM
AC Tune Up Houston provided excellent service for their air conditioning system, identifying and fixing flaws promptly. The technician was professional, polite, and considerate, ensuring the system's optimal functioning and year-round comfort. The client recommended AC Tune Up Houston for reliable HVAC maintenance.
leesburg traffic lawyer Navigating the legal maze with expertise and integrity. Your trusted advisor for solutions that empower and protect your interests. Let's turn your challenges into victories!
ReplyDelete
Replies
RobertJanuary 6, 2025 at 4:16 AM
Implicit Information Extraction is a game-changer in data analysis, uncovering hidden insights from unstructured content! This Is New York A Community Property State for Divorce||Is New York A No Fault State Divorce technique allows us to understand context and nuance beyond explicit data, enhancing decision-making.
ReplyDelete
Replies
tracyroseJanuary 22, 2025 at 7:37 AM
Driving can significantly reduce fuel consumption, reduce vehicle wear, and potentially lower maintenance costs. This practice encourages safer driving, reduces overall expenses, and reduces environmental impact. The savings depend on vehicle type and driving conditions, but are often overlooked. reckless driving lawyer loudoun county Your trusted legal navigator, turning complex laws into clear solutions. Empowering you with expert advice to protect your rights and achieve your goals. Let's build your victory together!
ReplyDelete
Replies
ellyse hixFebruary 17, 2025 at 6:33 AM
Excellent insights! Marketing assignments require strategic thinking, research, and a strong understanding of consumer behavior. If anyone needs expert assistance, a Marketing Assignment Writing Service can provide professional guidance to ensure well-structured and high-quality work. Looking forward to more valuable content.
ReplyDelete
Replies
josbuttler1102February 18, 2025 at 4:50 AM
This article provides a great overview of the potential risks of acupuncture! While many people swear by its benefits, it’s important to be aware of possible complications, especially when done by unqualified practitioners. I’d love to hear more about how to find a reputable acupuncturist and what red flags to watch out for. Thanks for sharing this information! Why Does Your Body Hurt After Acupuncture? Pain, Inflammation, and Healing Crisis Explained Visit our blog
ReplyDelete
Replies
ellyse hixFebruary 24, 2025 at 7:03 AM
Great insights! Assessments play a crucial role in academic success, requiring thorough research and critical thinking. If anyone needs expert guidance, My Assessment Help provides professional support to improve understanding and boost performance. Looking forward to more valuable content!
ReplyDelete
Replies
Quick MarqueJuly 11, 2025 at 2:31 AM
Quick Marque is a top performance marketing agency based in UAE,India. We specialize in online advertising, digital marketing, and professional website development services. Software Development & Marketing Agency in UAE
ReplyDelete
Replies

Add comment

Implicit Information Extraction

Sujan Perera

24 comments

Labels

Blog Archive

Other Links

Implicit Information Extraction

Share This Story

Sujan Perera

You Might Also Like

24 comments

Labels

Blog Archive

Other Links