<?xml version="1.0" encoding="UTF-8"?>
<mods xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.loc.gov/mods/v3" version="3.1" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
  <titleInfo>
    <title>Improving the automatic summarization of Arabic text depending on rhetorical structure theory</title>
  </titleInfo>
  <titleInfo type="alternative">
    <title>تحسين التلخيص الآلي للنصوص العربية اعتمادًا على نظرية التركيب البياني</title>
  </titleInfo>
  <name type="personal">
    <namePart>Ahmed Ibrahim Moussa Hussein Ali</namePart>
    <role>
      <roleTerm authority="marcrelator" type="text">creator</roleTerm>
    </role>
  </name>
  <name type="personal">
    <namePart>Laila Nassef</namePart>
    <role>
      <roleTerm type="text">Supervisor</roleTerm>
    </role>
  </name>
  <name type="personal">
    <namePart>Mervat Gheith</namePart>
    <role>
      <roleTerm type="text">Supervisor</roleTerm>
    </role>
  </name>
  <name type="personal">
    <namePart>Tarek Elghazaly</namePart>
    <role>
      <roleTerm type="text">Supervisor</roleTerm>
    </role>
  </name>
  <typeOfResource>text</typeOfResource>
  <genre authority="marc">theses</genre>
  <originInfo>
    <place>
      <placeTerm type="code" authority="marccountry">ua</placeTerm>
    </place>
    <place>
      <placeTerm type="text">Cairo</placeTerm>
    </place>
    <publisher>Ahmed Ibrahim Moussa Hussein Ali</publisher>
    <dateIssued>2014</dateIssued>
    <issuance>monographic</issuance>
  </originInfo>
  <language>
    <languageTerm authority="iso639-2b" type="code">eng</languageTerm>
  </language>
  <physicalDescription>
    <form authority="marcform">print</form>
    <extent>114 Leaves :  charts ;  25cm</extent>
  </physicalDescription>
  <abstract>Nowadays, numerous documents, reports and articles are available in a digital form. Consequently, search engines retrieve an abundance of information. Besides, an overwhelming number of emails and documents floods users and agencies. Therefore, such retrieved documents need to be summarized. In this information explosion, the automatic text summarization proves to be an essential tool. Nevertheless, the key problem with the automatic text summarization process is that the target-summarized text is incoherent and deviates from the context of the original text. This problem emerges when statistical techniques are used for summarization. This thesis uses a semantic technique by adopting a Rhetorical Structure Theory. RST is a descriptive theory for a major aspect of the organization of natural texts. It extracts the semantics behind the text by identifying the most significant parts thereof. Here comes the role of this thesis as it introduces an infrastructure for applying RST to Arabic by collecting the Arabic rhetorical relations from different resources to build the rhetorical structure theory. However, the quality of RST summarization suffers when dealing with large documents</abstract>
  <targetAudience authority="marctarget">specialized</targetAudience>
  <note type="statement of responsibility">Ahmed Ibrahim Moussa Hussein Ali ; Supervised Mervat Gheith , Laila Nassef , Tarek Elghazaly</note>
  <note>Thesis (M.Sc.) - Cairo University - Institute of Statistical Studies and Research - Department of Computer and Information Sciences</note>
  <note>Issued also as CD</note>
  <subject>
    <topic>Arabic text</topic>
  </subject>
  <subject>
    <topic>Rhetorical structure theory </topic>
  </subject>
  <subject>
    <topic>RST </topic>
  </subject>
  <identifier type="uri">http://172.23.153.220/th.pdf</identifier>
  <location>
    <url>http://172.23.153.220/th.pdf</url>
  </location>
  <recordInfo>
    <recordContentSource authority="marcorg">EG-GiCUC</recordContentSource>
    <recordCreationDate encoding="marc">141113</recordCreationDate>
    <recordChangeDate encoding="iso8601">20250223031106.0</recordChangeDate>
    <languageOfCataloging>
      <languageTerm authority="iso639-2b" type="code">eng</languageTerm>
    </languageOfCataloging>
  </recordInfo>
</mods>
