Class EmblRecord


  • public class EmblRecord
    extends java.lang.Object
    this class contains the parsed data of embl file
    Since:
    5.0.0
    Author:
    Noor Aldeen Al Mbaidin
    • Constructor Summary

      Constructors 
      Constructor Description
      EmblRecord()  
    • Method Summary

      All Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      java.util.List<java.lang.String> getAccessionNumber()
      The AC (Accession number) line lists the accession numbers associated with the entry.
      java.lang.String getAssemblyHeader()
      The AH (Assembly Header) line provides column headings for the assembly information.
      java.lang.String getAssemblyInformation()
      The AS (Assembly Information) lines provide information on the composition of a TPA or TSA sequence.
      java.lang.String getConstructedSequence()
      Con(structed) sequences in the CON data classes represent complete chromosomes, genomes and other long sequences constructed from segment entries.
      java.lang.String getCreatedDate()
      The DT line shows when an entry first appeared in the database
      java.lang.String getDatabaseCrossReference()
      The DR (Database Cross-reference) line cross-references other databases which contain information related to the entry in which the DR line appears.
      EmblId getEmblId()
      The ID (IDentification) line The tokens represent: 1.
      java.util.List<EmblReference> getEmblReference()
      The Reference (RN, RC, RP, RX, RG, RA, RT, RL) Lines These lines comprise the literature citations within the database.
      java.lang.String getFeatureHeader()
      The FH (Feature Header) lines are present only to improve readability of an entry when it is printed or displayed on a terminal screen.
      java.lang.String getFeatureTable()
      The FT (Feature Table) lines provide a mechanism for the annotation of the sequence data.
      java.util.List<java.lang.String> getKeyword()
      The KW (KeyWord) lines provide information which can be used to generate cross-reference indexes of the sequence entries based on functional, structural, or other categories deemed important.
      java.lang.String getLastUpdatedDate()
      The DT (DaTe) line shows when an entry was last updated in the database.
      java.lang.String getOrGanelle()
      The OG (OrGanelle) linetype indicates the sub-cellular location of non-nuclear sequences.
      java.lang.String getOrganismClassification()
      The OC (Organism Classification) lines contain the taxonomic classification Of the source organism
      java.lang.String getOrganismSpecies()
      The OS (Organism Species) line specifies the preferred scientific name of the organism which was the source of the stored sequence.
      java.lang.String getProjectIdentifier()  
      java.lang.String getSequence()
      The Sequence Data Line
      java.lang.String getSequenceDescription()
      The DE (Description) lines contain general descriptive information about the sequence stored.
      java.lang.String getSequenceHeader()
      The SQ (SeQuence header) line marks the beginning of the sequence data and Gives a summary of its content.
      void setAccessionNumber​(java.util.List<java.lang.String> accessionNumber)  
      void setAssemblyHeader​(java.lang.String assemblyHeader)  
      void setAssemblyInformation​(java.lang.String assemblyInformation)  
      void setConstructedSequence​(java.lang.String constructedSequence)  
      void setCreatedDate​(java.lang.String createdDate)  
      void setDatabaseCrossReference​(java.lang.String databaseCrossReference)  
      void setEmblId​(EmblId emblId)  
      void setEmblReference​(java.util.List<EmblReference> emblReference)  
      void setFeatureHeader​(java.lang.String featureHeader)  
      void setFeatureTable​(java.lang.String featureTable)  
      void setKeyword​(java.util.List<java.lang.String> keyword)  
      void setLastUpdatedDate​(java.lang.String lastUpdatedDate)  
      void setOrGanelle​(java.lang.String orGanelle)  
      void setOrganismClassification​(java.lang.String organismClassification)  
      void setOrganismSpecies​(java.lang.String organismSpecies)  
      void setProjectIdentifier​(java.lang.String projectIdentifier)  
      void setSequence​(java.lang.String sequence)  
      void setSequenceDescription​(java.lang.String sequenceDescription)  
      void setSequenceHeader​(java.lang.String sequenceHeader)  
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Constructor Detail

      • EmblRecord

        public EmblRecord()
    • Method Detail

      • getEmblId

        public EmblId getEmblId()
        The ID (IDentification) line The tokens represent: 1. Primary accession number 2. Sequence version number 3. Topology: 'circular' or 'linear' 4. Molecule type 5. Data class 6. Taxonomic division 7. Sequence length
        Returns:
        EmblId
      • setEmblId

        public void setEmblId​(EmblId emblId)
      • getEmblReference

        public java.util.List<EmblReference> getEmblReference()
        The Reference (RN, RC, RP, RX, RG, RA, RT, RL) Lines These lines comprise the literature citations within the database. The citations provide access to the papers from which the data has been abstracted.
        Returns:
        EmblReference
      • setEmblReference

        public void setEmblReference​(java.util.List<EmblReference> emblReference)
      • getAccessionNumber

        public java.util.List<java.lang.String> getAccessionNumber()
        The AC (Accession number) line lists the accession numbers associated with the entry.
        Returns:
        List
      • setAccessionNumber

        public void setAccessionNumber​(java.util.List<java.lang.String> accessionNumber)
      • getProjectIdentifier

        public java.lang.String getProjectIdentifier()
        Returns:
        String
      • setProjectIdentifier

        public void setProjectIdentifier​(java.lang.String projectIdentifier)
      • getOrGanelle

        public java.lang.String getOrGanelle()
        The OG (OrGanelle) linetype indicates the sub-cellular location of non-nuclear sequences.
        Returns:
        String
      • setOrGanelle

        public void setOrGanelle​(java.lang.String orGanelle)
      • getCreatedDate

        public java.lang.String getCreatedDate()
        The DT line shows when an entry first appeared in the database
        Returns:
        String
      • setCreatedDate

        public void setCreatedDate​(java.lang.String createdDate)
      • getFeatureHeader

        public java.lang.String getFeatureHeader()
        The FH (Feature Header) lines are present only to improve readability of an entry when it is printed or displayed on a terminal screen.
        Returns:
        String
      • setFeatureHeader

        public void setFeatureHeader​(java.lang.String featureHeader)
      • getFeatureTable

        public java.lang.String getFeatureTable()
        The FT (Feature Table) lines provide a mechanism for the annotation of the sequence data. Regions or sites in the sequence which are of interest are listed in the table.
        Returns:
        String
      • setFeatureTable

        public void setFeatureTable​(java.lang.String featureTable)
      • getLastUpdatedDate

        public java.lang.String getLastUpdatedDate()
        The DT (DaTe) line shows when an entry was last updated in the database.
        Returns:
        String
      • setLastUpdatedDate

        public void setLastUpdatedDate​(java.lang.String lastUpdatedDate)
      • getSequenceDescription

        public java.lang.String getSequenceDescription()
        The DE (Description) lines contain general descriptive information about the sequence stored. This may include the designations of genes for which the sequence codes, the region of the genome from which it is derived, or other information which helps to identify the sequence.
        Returns:
        String
      • setSequenceDescription

        public void setSequenceDescription​(java.lang.String sequenceDescription)
      • getKeyword

        public java.util.List<java.lang.String> getKeyword()
        The KW (KeyWord) lines provide information which can be used to generate cross-reference indexes of the sequence entries based on functional, structural, or other categories deemed important.
        Returns:
        List
      • setKeyword

        public void setKeyword​(java.util.List<java.lang.String> keyword)
      • getOrganismSpecies

        public java.lang.String getOrganismSpecies()
        The OS (Organism Species) line specifies the preferred scientific name of the organism which was the source of the stored sequence. In most cases this is done by giving the Latin genus and species designations, followed (in parentheses) by the preferred common name in English where known.
        Returns:
        String
      • setOrganismSpecies

        public void setOrganismSpecies​(java.lang.String organismSpecies)
      • getOrganismClassification

        public java.lang.String getOrganismClassification()
        The OC (Organism Classification) lines contain the taxonomic classification Of the source organism
        Returns:
        String
      • setOrganismClassification

        public void setOrganismClassification​(java.lang.String organismClassification)
      • getDatabaseCrossReference

        public java.lang.String getDatabaseCrossReference()
        The DR (Database Cross-reference) line cross-references other databases which contain information related to the entry in which the DR line appears.
        Returns:
        String
      • setDatabaseCrossReference

        public void setDatabaseCrossReference​(java.lang.String databaseCrossReference)
      • getAssemblyHeader

        public java.lang.String getAssemblyHeader()
        The AH (Assembly Header) line provides column headings for the assembly information.
        Returns:
        String
      • setAssemblyHeader

        public void setAssemblyHeader​(java.lang.String assemblyHeader)
      • getAssemblyInformation

        public java.lang.String getAssemblyInformation()
        The AS (Assembly Information) lines provide information on the composition of a TPA or TSA sequence.
        Returns:
        String
      • setAssemblyInformation

        public void setAssemblyInformation​(java.lang.String assemblyInformation)
      • getConstructedSequence

        public java.lang.String getConstructedSequence()
        Con(structed) sequences in the CON data classes represent complete chromosomes, genomes and other long sequences constructed from segment entries.
        Returns:
        String
      • setConstructedSequence

        public void setConstructedSequence​(java.lang.String constructedSequence)
      • getSequenceHeader

        public java.lang.String getSequenceHeader()
        The SQ (SeQuence header) line marks the beginning of the sequence data and Gives a summary of its content.
        Returns:
        String
      • setSequenceHeader

        public void setSequenceHeader​(java.lang.String sequenceHeader)
      • getSequence

        public java.lang.String getSequence()
        The Sequence Data Line
        Returns:
        String
      • setSequence

        public void setSequence​(java.lang.String sequence)