Amino acid dipepetide frequency for Gorilla smacovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.263AlaAla: 5.263 ± 3.218
0.0AlaCys: 0.0 ± 0.0
7.018AlaAsp: 7.018 ± 1.759
0.0AlaGlu: 0.0 ± 0.0
3.509AlaPhe: 3.509 ± 2.145
3.509AlaGly: 3.509 ± 2.145
1.754AlaHis: 1.754 ± 1.459
3.509AlaIle: 3.509 ± 0.386
1.754AlaLys: 1.754 ± 1.073
0.0AlaLeu: 0.0 ± 0.0
1.754AlaMet: 1.754 ± 1.459
1.754AlaAsn: 1.754 ± 1.073
1.754AlaPro: 1.754 ± 1.073
3.509AlaGln: 3.509 ± 0.386
1.754AlaArg: 1.754 ± 1.073
8.772AlaSer: 8.772 ± 0.3
1.754AlaThr: 1.754 ± 1.073
7.018AlaVal: 7.018 ± 1.759
1.754AlaTrp: 1.754 ± 1.459
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
3.509CysAla: 3.509 ± 0.386
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.754CysAsn: 1.754 ± 1.459
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.754CysArg: 1.754 ± 1.459
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.754AspAla: 1.754 ± 1.073
3.509AspCys: 3.509 ± 0.386
3.509AspAsp: 3.509 ± 0.386
1.754AspGlu: 1.754 ± 1.073
0.0AspPhe: 0.0 ± 0.0
8.772AspGly: 8.772 ± 0.3
1.754AspHis: 1.754 ± 1.459
1.754AspIle: 1.754 ± 1.459
1.754AspLys: 1.754 ± 1.459
5.263AspLeu: 5.263 ± 0.686
0.0AspMet: 0.0 ± 0.0
1.754AspAsn: 1.754 ± 1.459
8.772AspPro: 8.772 ± 0.3
5.263AspGln: 5.263 ± 3.218
5.263AspArg: 5.263 ± 1.846
1.754AspSer: 1.754 ± 1.073
8.772AspThr: 8.772 ± 4.764
5.263AspVal: 5.263 ± 0.686
0.0AspTrp: 0.0 ± 0.0
1.754AspTyr: 1.754 ± 1.073
0.0AspXaa: 0.0 ± 0.0
Glu
5.263GluAla: 5.263 ± 0.686
0.0GluCys: 0.0 ± 0.0
3.509GluAsp: 3.509 ± 2.145
1.754GluGlu: 1.754 ± 1.459
1.754GluPhe: 1.754 ± 1.459
0.0GluGly: 0.0 ± 0.0
0.0GluHis: 0.0 ± 0.0
1.754GluIle: 1.754 ± 1.459
1.754GluLys: 1.754 ± 1.459
3.509GluLeu: 3.509 ± 0.386
1.754GluMet: 1.754 ± 1.073
1.754GluAsn: 1.754 ± 1.073
0.0GluPro: 0.0 ± 0.0
1.754GluGln: 1.754 ± 1.073
3.509GluArg: 3.509 ± 2.918
1.754GluSer: 1.754 ± 1.073
3.509GluThr: 3.509 ± 2.918
5.263GluVal: 5.263 ± 0.686
3.509GluTrp: 3.509 ± 2.918
1.754GluTyr: 1.754 ± 1.459
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
1.754PheAsp: 1.754 ± 1.073
0.0PheGlu: 0.0 ± 0.0
5.263PhePhe: 5.263 ± 0.686
1.754PheGly: 1.754 ± 1.459
0.0PheHis: 0.0 ± 0.0
1.754PheIle: 1.754 ± 1.459
7.018PheLys: 7.018 ± 1.759
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
3.509PhePro: 3.509 ± 2.145
5.263PheGln: 5.263 ± 0.686
1.754PheArg: 1.754 ± 1.073
3.509PheSer: 3.509 ± 0.386
1.754PheThr: 1.754 ± 1.073
3.509PheVal: 3.509 ± 2.145
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.754GlyAla: 1.754 ± 1.073
0.0GlyCys: 0.0 ± 0.0
1.754GlyAsp: 1.754 ± 1.073
5.263GlyGlu: 5.263 ± 3.218
0.0GlyPhe: 0.0 ± 0.0
5.263GlyGly: 5.263 ± 3.218
3.509GlyHis: 3.509 ± 0.386
3.509GlyIle: 3.509 ± 2.918
3.509GlyLys: 3.509 ± 2.918
10.526GlyLeu: 10.526 ± 3.904
1.754GlyMet: 1.754 ± 1.459
1.754GlyAsn: 1.754 ± 1.459
3.509GlyPro: 3.509 ± 0.386
3.509GlyGln: 3.509 ± 0.386
3.509GlyArg: 3.509 ± 0.386
5.263GlySer: 5.263 ± 0.686
5.263GlyThr: 5.263 ± 3.218
7.018GlyVal: 7.018 ± 1.759
3.509GlyTrp: 3.509 ± 0.386
5.263GlyTyr: 5.263 ± 0.686
0.0GlyXaa: 0.0 ± 0.0
His
1.754HisAla: 1.754 ± 1.073
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.509HisGly: 3.509 ± 0.386
0.0HisHis: 0.0 ± 0.0
3.509HisIle: 3.509 ± 2.918
3.509HisLys: 3.509 ± 0.386
1.754HisLeu: 1.754 ± 1.459
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
3.509HisGln: 3.509 ± 2.145
1.754HisArg: 1.754 ± 1.073
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
1.754HisTrp: 1.754 ± 1.459
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.0IleCys: 0.0 ± 0.0
5.263IleAsp: 5.263 ± 0.686
3.509IleGlu: 3.509 ± 2.918
0.0IlePhe: 0.0 ± 0.0
1.754IleGly: 1.754 ± 1.073
0.0IleHis: 0.0 ± 0.0
1.754IleIle: 1.754 ± 1.459
3.509IleLys: 3.509 ± 2.918
0.0IleLeu: 0.0 ± 0.0
3.509IleMet: 3.509 ± 1.719
1.754IleAsn: 1.754 ± 1.459
5.263IlePro: 5.263 ± 0.686
3.509IleGln: 3.509 ± 2.918
3.509IleArg: 3.509 ± 2.918
0.0IleSer: 0.0 ± 0.0
7.018IleThr: 7.018 ± 1.759
7.018IleVal: 7.018 ± 5.836
0.0IleTrp: 0.0 ± 0.0
1.754IleTyr: 1.754 ± 1.459
0.0IleXaa: 0.0 ± 0.0
Lys
3.509LysAla: 3.509 ± 0.386
1.754LysCys: 1.754 ± 1.459
3.509LysAsp: 3.509 ± 2.918
3.509LysGlu: 3.509 ± 2.918
1.754LysPhe: 1.754 ± 1.073
5.263LysGly: 5.263 ± 1.846
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
1.754LysLys: 1.754 ± 1.459
5.263LysLeu: 5.263 ± 0.686
3.509LysMet: 3.509 ± 2.145
0.0LysAsn: 0.0 ± 0.0
1.754LysPro: 1.754 ± 1.073
1.754LysGln: 1.754 ± 1.459
5.263LysArg: 5.263 ± 0.686
5.263LysSer: 5.263 ± 1.846
7.018LysThr: 7.018 ± 3.305
3.509LysVal: 3.509 ± 0.386
7.018LysTrp: 7.018 ± 3.305
1.754LysTyr: 1.754 ± 1.073
0.0LysXaa: 0.0 ± 0.0
Leu
1.754LeuAla: 1.754 ± 1.073
0.0LeuCys: 0.0 ± 0.0
0.0LeuAsp: 0.0 ± 0.0
0.0LeuGlu: 0.0 ± 0.0
1.754LeuPhe: 1.754 ± 1.073
5.263LeuGly: 5.263 ± 0.686
3.509LeuHis: 3.509 ± 2.145
5.263LeuIle: 5.263 ± 1.846
3.509LeuLys: 3.509 ± 0.386
3.509LeuLeu: 3.509 ± 0.386
0.0LeuMet: 0.0 ± 0.955
1.754LeuAsn: 1.754 ± 1.459
7.018LeuPro: 7.018 ± 4.291
3.509LeuGln: 3.509 ± 2.145
1.754LeuArg: 1.754 ± 1.459
5.263LeuSer: 5.263 ± 0.686
10.526LeuThr: 10.526 ± 3.904
0.0LeuVal: 0.0 ± 0.0
0.0LeuTrp: 0.0 ± 0.0
7.018LeuTyr: 7.018 ± 0.773
0.0LeuXaa: 0.0 ± 0.0
Met
1.754MetAla: 1.754 ± 1.073
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
3.509MetGlu: 3.509 ± 0.386
1.754MetPhe: 1.754 ± 1.459
1.754MetGly: 1.754 ± 1.459
1.754MetHis: 1.754 ± 1.073
3.509MetIle: 3.509 ± 2.145
1.754MetLys: 1.754 ± 1.073
1.754MetLeu: 1.754 ± 1.459
3.509MetMet: 3.509 ± 2.145
0.0MetAsn: 0.0 ± 0.0
3.509MetPro: 3.509 ± 0.386
1.754MetGln: 1.754 ± 1.073
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
1.754MetThr: 1.754 ± 1.459
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.754MetTyr: 1.754 ± 1.073
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
5.263AsnAsp: 5.263 ± 1.846
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
3.509AsnGly: 3.509 ± 0.386
1.754AsnHis: 1.754 ± 1.459
0.0AsnIle: 0.0 ± 0.0
3.509AsnLys: 3.509 ± 0.386
1.754AsnLeu: 1.754 ± 1.073
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
0.0AsnPro: 0.0 ± 0.0
0.0AsnGln: 0.0 ± 0.0
0.0AsnArg: 0.0 ± 0.0
1.754AsnSer: 1.754 ± 1.459
5.263AsnThr: 5.263 ± 0.686
1.754AsnVal: 1.754 ± 1.073
1.754AsnTrp: 1.754 ± 1.459
3.509AsnTyr: 3.509 ± 2.145
0.0AsnXaa: 0.0 ± 0.0
Pro
5.263ProAla: 5.263 ± 3.218
0.0ProCys: 0.0 ± 0.0
1.754ProAsp: 1.754 ± 1.459
3.509ProGlu: 3.509 ± 0.386
3.509ProPhe: 3.509 ± 2.145
3.509ProGly: 3.509 ± 2.145
0.0ProHis: 0.0 ± 0.0
3.509ProIle: 3.509 ± 0.386
3.509ProLys: 3.509 ± 2.918
0.0ProLeu: 0.0 ± 0.0
3.509ProMet: 3.509 ± 2.145
0.0ProAsn: 0.0 ± 0.0
1.754ProPro: 1.754 ± 1.073
1.754ProGln: 1.754 ± 1.073
8.772ProArg: 8.772 ± 2.232
1.754ProSer: 1.754 ± 1.073
7.018ProThr: 7.018 ± 4.291
5.263ProVal: 5.263 ± 3.218
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
7.018GlnAla: 7.018 ± 0.773
0.0GlnCys: 0.0 ± 0.0
7.018GlnAsp: 7.018 ± 3.305
0.0GlnGlu: 0.0 ± 0.0
1.754GlnPhe: 1.754 ± 1.073
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
5.263GlnIle: 5.263 ± 0.686
1.754GlnLys: 1.754 ± 1.073
8.772GlnLeu: 8.772 ± 2.832
0.0GlnMet: 0.0 ± 0.0
1.754GlnAsn: 1.754 ± 1.073
1.754GlnPro: 1.754 ± 1.073
1.754GlnGln: 1.754 ± 1.073
5.263GlnArg: 5.263 ± 0.686
1.754GlnSer: 1.754 ± 1.073
3.509GlnThr: 3.509 ± 0.386
3.509GlnVal: 3.509 ± 0.386
1.754GlnTrp: 1.754 ± 1.073
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.754ArgAla: 1.754 ± 1.459
0.0ArgCys: 0.0 ± 0.0
5.263ArgAsp: 5.263 ± 1.846
3.509ArgGlu: 3.509 ± 0.386
5.263ArgPhe: 5.263 ± 1.846
5.263ArgGly: 5.263 ± 0.686
1.754ArgHis: 1.754 ± 1.073
3.509ArgIle: 3.509 ± 0.386
7.018ArgLys: 7.018 ± 3.305
5.263ArgLeu: 5.263 ± 0.686
0.0ArgMet: 0.0 ± 0.0
1.754ArgAsn: 1.754 ± 1.073
3.509ArgPro: 3.509 ± 0.386
1.754ArgGln: 1.754 ± 1.459
1.754ArgArg: 1.754 ± 1.459
1.754ArgSer: 1.754 ± 1.073
3.509ArgThr: 3.509 ± 2.918
1.754ArgVal: 1.754 ± 1.073
3.509ArgTrp: 3.509 ± 0.386
3.509ArgTyr: 3.509 ± 0.386
0.0ArgXaa: 0.0 ± 0.0
Ser
5.263SerAla: 5.263 ± 0.686
0.0SerCys: 0.0 ± 0.0
7.018SerAsp: 7.018 ± 3.305
1.754SerGlu: 1.754 ± 1.459
0.0SerPhe: 0.0 ± 0.0
7.018SerGly: 7.018 ± 1.759
0.0SerHis: 0.0 ± 0.0
3.509SerIle: 3.509 ± 2.145
3.509SerLys: 3.509 ± 0.386
1.754SerLeu: 1.754 ± 1.073
1.754SerMet: 1.754 ± 1.073
3.509SerAsn: 3.509 ± 0.386
3.509SerPro: 3.509 ± 2.145
0.0SerGln: 0.0 ± 0.0
1.754SerArg: 1.754 ± 1.073
3.509SerSer: 3.509 ± 0.386
1.754SerThr: 1.754 ± 1.073
1.754SerVal: 1.754 ± 1.073
1.754SerTrp: 1.754 ± 1.459
1.754SerTyr: 1.754 ± 1.073
0.0SerXaa: 0.0 ± 0.0
Thr
1.754ThrAla: 1.754 ± 1.073
1.754ThrCys: 1.754 ± 1.459
7.018ThrAsp: 7.018 ± 4.291
3.509ThrGlu: 3.509 ± 0.386
1.754ThrPhe: 1.754 ± 1.459
10.526ThrGly: 10.526 ± 1.372
0.0ThrHis: 0.0 ± 0.0
1.754ThrIle: 1.754 ± 1.459
1.754ThrLys: 1.754 ± 1.073
1.754ThrLeu: 1.754 ± 1.073
1.754ThrMet: 1.754 ± 1.459
7.018ThrAsn: 7.018 ± 1.759
5.263ThrPro: 5.263 ± 0.686
1.754ThrGln: 1.754 ± 1.459
1.754ThrArg: 1.754 ± 1.073
7.018ThrSer: 7.018 ± 1.759
0.0ThrThr: 0.0 ± 0.0
15.789ThrVal: 15.789 ± 0.473
5.263ThrTrp: 5.263 ± 4.377
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
7.018ValAla: 7.018 ± 0.773
0.0ValCys: 0.0 ± 0.0
3.509ValAsp: 3.509 ± 2.145
3.509ValGlu: 3.509 ± 0.386
3.509ValPhe: 3.509 ± 2.145
5.263ValGly: 5.263 ± 0.686
5.263ValHis: 5.263 ± 1.846
5.263ValIle: 5.263 ± 4.377
5.263ValLys: 5.263 ± 1.846
7.018ValLeu: 7.018 ± 1.759
0.0ValMet: 0.0 ± 0.0
1.754ValAsn: 1.754 ± 1.073
1.754ValPro: 1.754 ± 1.459
5.263ValGln: 5.263 ± 0.686
5.263ValArg: 5.263 ± 1.846
1.754ValSer: 1.754 ± 1.073
3.509ValThr: 3.509 ± 2.145
8.772ValVal: 8.772 ± 0.3
1.754ValTrp: 1.754 ± 1.459
7.018ValTyr: 7.018 ± 4.291
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.754TrpGlu: 1.754 ± 1.459
3.509TrpPhe: 3.509 ± 2.145
1.754TrpGly: 1.754 ± 1.459
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.754TrpLys: 1.754 ± 1.459
3.509TrpLeu: 3.509 ± 2.918
5.263TrpMet: 5.263 ± 1.846
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
5.263TrpGln: 5.263 ± 0.686
3.509TrpArg: 3.509 ± 2.918
0.0TrpSer: 0.0 ± 0.0
3.509TrpThr: 3.509 ± 2.918
3.509TrpVal: 3.509 ± 2.918
0.0TrpTrp: 0.0 ± 0.0
1.754TrpTyr: 1.754 ± 1.459
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.754TyrAla: 1.754 ± 1.073
0.0TyrCys: 0.0 ± 0.0
5.263TyrAsp: 5.263 ± 1.846
7.018TyrGlu: 7.018 ± 0.773
1.754TyrPhe: 1.754 ± 1.073
1.754TyrGly: 1.754 ± 1.073
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
5.263TyrLys: 5.263 ± 0.686
1.754TyrLeu: 1.754 ± 1.073
0.0TyrMet: 0.0 ± 0.0
1.754TyrAsn: 1.754 ± 1.459
1.754TyrPro: 1.754 ± 1.073
1.754TyrGln: 1.754 ± 1.073
3.509TyrArg: 3.509 ± 2.145
0.0TyrSer: 0.0 ± 0.0
1.754TyrThr: 1.754 ± 1.073
1.754TyrVal: 1.754 ± 1.459
1.754TyrTrp: 1.754 ± 1.073
5.263TyrTyr: 5.263 ± 3.218
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (571 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski