Amino acid dipepetide frequency for Hubei tombus-like virus 38

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.523AlaAla: 11.523 ± 2.264
0.823AlaCys: 0.823 ± 0.604
5.761AlaAsp: 5.761 ± 2.638
3.292AlaGlu: 3.292 ± 1.524
2.469AlaPhe: 2.469 ± 1.811
3.292AlaGly: 3.292 ± 0.229
0.0AlaHis: 0.0 ± 0.0
3.292AlaIle: 3.292 ± 1.236
4.938AlaLys: 4.938 ± 2.553
7.407AlaLeu: 7.407 ± 4.199
3.292AlaMet: 3.292 ± 1.411
3.292AlaAsn: 3.292 ± 1.228
3.292AlaPro: 3.292 ± 2.533
3.292AlaGln: 3.292 ± 2.093
6.584AlaArg: 6.584 ± 1.205
5.761AlaSer: 5.761 ± 3.458
7.407AlaThr: 7.407 ± 3.26
1.646AlaVal: 1.646 ± 1.023
0.823AlaTrp: 0.823 ± 0.814
2.469AlaTyr: 2.469 ± 1.037
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.646CysGly: 1.646 ± 1.207
0.823CysHis: 0.823 ± 0.604
1.646CysIle: 1.646 ± 1.207
0.0CysLys: 0.0 ± 0.0
1.646CysLeu: 1.646 ± 0.614
0.823CysMet: 0.823 ± 0.604
0.0CysAsn: 0.0 ± 0.0
0.823CysPro: 0.823 ± 0.814
0.823CysGln: 0.823 ± 0.604
0.0CysArg: 0.0 ± 0.0
4.115CysSer: 4.115 ± 2.073
0.0CysThr: 0.0 ± 0.0
0.823CysVal: 0.823 ± 0.814
1.646CysTrp: 1.646 ± 1.207
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.469AspAla: 2.469 ± 1.628
0.0AspCys: 0.0 ± 0.0
0.823AspAsp: 0.823 ± 0.604
0.823AspGlu: 0.823 ± 0.876
4.115AspPhe: 4.115 ± 1.972
3.292AspGly: 3.292 ± 0.229
0.0AspHis: 0.0 ± 0.0
3.292AspIle: 3.292 ± 2.093
2.469AspLys: 2.469 ± 1.31
1.646AspLeu: 1.646 ± 1.207
3.292AspMet: 3.292 ± 0.229
2.469AspAsn: 2.469 ± 1.511
7.407AspPro: 7.407 ± 3.11
2.469AspGln: 2.469 ± 1.31
4.115AspArg: 4.115 ± 2.217
1.646AspSer: 1.646 ± 1.023
3.292AspThr: 3.292 ± 1.411
1.646AspVal: 1.646 ± 1.207
0.0AspTrp: 0.0 ± 0.0
1.646AspTyr: 1.646 ± 1.023
0.0AspXaa: 0.0 ± 0.0
Glu
4.938GluAla: 4.938 ± 2.646
0.823GluCys: 0.823 ± 0.604
1.646GluAsp: 1.646 ± 1.628
4.115GluGlu: 4.115 ± 3.018
0.823GluPhe: 0.823 ± 0.604
0.823GluGly: 0.823 ± 0.604
3.292GluHis: 3.292 ± 1.524
0.823GluIle: 0.823 ± 0.814
0.0GluLys: 0.0 ± 0.0
3.292GluLeu: 3.292 ± 0.229
0.823GluMet: 0.823 ± 0.814
0.823GluAsn: 0.823 ± 0.604
2.469GluPro: 2.469 ± 0.43
3.292GluGln: 3.292 ± 1.411
2.469GluArg: 2.469 ± 1.037
3.292GluSer: 3.292 ± 1.067
2.469GluThr: 2.469 ± 1.037
4.938GluVal: 4.938 ± 2.238
0.823GluTrp: 0.823 ± 0.604
0.823GluTyr: 0.823 ± 0.604
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
2.469PheCys: 2.469 ± 1.811
3.292PheAsp: 3.292 ± 0.229
4.115PheGlu: 4.115 ± 3.018
0.823PhePhe: 0.823 ± 0.604
1.646PheGly: 1.646 ± 0.614
0.823PheHis: 0.823 ± 0.604
1.646PheIle: 1.646 ± 1.207
0.823PheLys: 0.823 ± 0.604
2.469PheLeu: 2.469 ± 1.31
0.823PheMet: 0.823 ± 0.604
1.646PheAsn: 1.646 ± 0.746
2.469PhePro: 2.469 ± 0.905
1.646PheGln: 1.646 ± 1.023
2.469PheArg: 2.469 ± 1.511
4.938PheSer: 4.938 ± 2.619
0.0PheThr: 0.0 ± 0.0
1.646PheVal: 1.646 ± 1.207
0.0PheTrp: 0.0 ± 0.0
2.469PheTyr: 2.469 ± 1.31
0.0PheXaa: 0.0 ± 0.0
Gly
4.115GlyAla: 4.115 ± 2.693
0.0GlyCys: 0.0 ± 0.0
4.938GlyAsp: 4.938 ± 1.811
1.646GlyGlu: 1.646 ± 0.614
1.646GlyPhe: 1.646 ± 0.614
0.823GlyGly: 0.823 ± 0.604
0.0GlyHis: 0.0 ± 0.0
0.823GlyIle: 0.823 ± 0.604
1.646GlyLys: 1.646 ± 1.628
4.115GlyLeu: 4.115 ± 0.805
0.823GlyMet: 0.823 ± 0.658
4.115GlyAsn: 4.115 ± 1.448
2.469GlyPro: 2.469 ± 1.811
0.823GlyGln: 0.823 ± 0.876
8.23GlyArg: 8.23 ± 4.118
4.938GlySer: 4.938 ± 1.998
6.584GlyThr: 6.584 ± 4.09
5.761GlyVal: 5.761 ± 3.395
2.469GlyTrp: 2.469 ± 1.511
4.115GlyTyr: 4.115 ± 1.972
0.0GlyXaa: 0.0 ± 0.0
His
0.823HisAla: 0.823 ± 0.604
0.823HisCys: 0.823 ± 0.876
1.646HisAsp: 1.646 ± 1.207
1.646HisGlu: 1.646 ± 1.207
0.823HisPhe: 0.823 ± 0.876
0.823HisGly: 0.823 ± 0.876
0.0HisHis: 0.0 ± 0.0
0.823HisIle: 0.823 ± 0.604
0.0HisLys: 0.0 ± 0.0
2.469HisLeu: 2.469 ± 1.811
0.823HisMet: 0.823 ± 0.604
0.823HisAsn: 0.823 ± 0.814
0.823HisPro: 0.823 ± 0.814
0.0HisGln: 0.0 ± 0.0
0.823HisArg: 0.823 ± 0.876
0.823HisSer: 0.823 ± 0.876
0.823HisThr: 0.823 ± 0.604
2.469HisVal: 2.469 ± 1.037
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.115IleAla: 4.115 ± 1.849
0.0IleCys: 0.0 ± 0.0
6.584IleAsp: 6.584 ± 0.458
3.292IleGlu: 3.292 ± 1.411
0.0IlePhe: 0.0 ± 0.0
2.469IleGly: 2.469 ± 0.905
0.823IleHis: 0.823 ± 0.876
2.469IleIle: 2.469 ± 0.905
4.115IleLys: 4.115 ± 1.972
4.115IleLeu: 4.115 ± 2.099
2.469IleMet: 2.469 ± 1.037
1.646IleAsn: 1.646 ± 0.746
4.938IlePro: 4.938 ± 0.55
2.469IleGln: 2.469 ± 1.811
2.469IleArg: 2.469 ± 2.628
4.938IleSer: 4.938 ± 1.842
2.469IleThr: 2.469 ± 1.037
1.646IleVal: 1.646 ± 1.628
1.646IleTrp: 1.646 ± 0.614
0.823IleTyr: 0.823 ± 0.604
0.0IleXaa: 0.0 ± 0.0
Lys
5.761LysAla: 5.761 ± 2.466
1.646LysCys: 1.646 ± 0.614
0.823LysAsp: 0.823 ± 0.604
0.823LysGlu: 0.823 ± 0.604
1.646LysPhe: 1.646 ± 1.207
3.292LysGly: 3.292 ± 1.524
0.823LysHis: 0.823 ± 0.604
0.823LysIle: 0.823 ± 0.814
0.0LysLys: 0.0 ± 0.0
3.292LysLeu: 3.292 ± 1.411
1.646LysMet: 1.646 ± 0.746
0.823LysAsn: 0.823 ± 0.604
3.292LysPro: 3.292 ± 2.093
2.469LysGln: 2.469 ± 0.905
1.646LysArg: 1.646 ± 1.628
3.292LysSer: 3.292 ± 2.093
0.823LysThr: 0.823 ± 0.876
3.292LysVal: 3.292 ± 1.228
0.823LysTrp: 0.823 ± 0.814
2.469LysTyr: 2.469 ± 0.905
0.0LysXaa: 0.0 ± 0.0
Leu
6.584LeuAla: 6.584 ± 1.543
3.292LeuCys: 3.292 ± 1.411
3.292LeuAsp: 3.292 ± 0.229
2.469LeuGlu: 2.469 ± 1.811
0.823LeuPhe: 0.823 ± 0.604
5.761LeuGly: 5.761 ± 1.132
0.823LeuHis: 0.823 ± 0.814
6.584LeuIle: 6.584 ± 1.543
4.938LeuLys: 4.938 ± 2.619
8.23LeuLeu: 8.23 ± 2.849
2.469LeuMet: 2.469 ± 0.443
2.469LeuAsn: 2.469 ± 0.43
9.053LeuPro: 9.053 ± 1.113
5.761LeuGln: 5.761 ± 0.332
7.407LeuArg: 7.407 ± 1.019
2.469LeuSer: 2.469 ± 2.628
3.292LeuThr: 3.292 ± 0.229
5.761LeuVal: 5.761 ± 0.332
1.646LeuTrp: 1.646 ± 0.614
2.469LeuTyr: 2.469 ± 0.905
0.0LeuXaa: 0.0 ± 0.0
Met
4.115MetAla: 4.115 ± 0.597
0.823MetCys: 0.823 ± 0.604
1.646MetAsp: 1.646 ± 1.023
0.823MetGlu: 0.823 ± 0.876
0.823MetPhe: 0.823 ± 0.604
1.646MetGly: 1.646 ± 0.614
0.823MetHis: 0.823 ± 0.604
0.0MetIle: 0.0 ± 0.0
0.823MetLys: 0.823 ± 0.604
2.469MetLeu: 2.469 ± 0.43
0.823MetMet: 0.823 ± 0.604
0.823MetAsn: 0.823 ± 0.814
4.115MetPro: 4.115 ± 2.073
1.646MetGln: 1.646 ± 0.614
0.823MetArg: 0.823 ± 0.604
5.761MetSer: 5.761 ± 2.223
1.646MetThr: 1.646 ± 1.752
1.646MetVal: 1.646 ± 1.207
0.0MetTrp: 0.0 ± 0.0
0.823MetTyr: 0.823 ± 0.604
0.0MetXaa: 0.0 ± 0.0
Asn
4.115AsnAla: 4.115 ± 0.597
0.823AsnCys: 0.823 ± 0.604
0.823AsnAsp: 0.823 ± 0.604
2.469AsnGlu: 2.469 ± 2.628
3.292AsnPhe: 3.292 ± 2.362
0.823AsnGly: 0.823 ± 0.814
0.0AsnHis: 0.0 ± 0.0
3.292AsnIle: 3.292 ± 1.236
0.823AsnLys: 0.823 ± 0.876
0.823AsnLeu: 0.823 ± 0.604
0.0AsnMet: 0.0 ± 0.0
1.646AsnAsn: 1.646 ± 1.023
2.469AsnPro: 2.469 ± 1.721
1.646AsnGln: 1.646 ± 1.023
7.407AsnArg: 7.407 ± 4.386
2.469AsnSer: 2.469 ± 0.43
1.646AsnThr: 1.646 ± 0.614
4.115AsnVal: 4.115 ± 1.877
1.646AsnTrp: 1.646 ± 1.207
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.646ProAla: 1.646 ± 0.746
0.823ProCys: 0.823 ± 0.814
2.469ProAsp: 2.469 ± 1.31
4.115ProGlu: 4.115 ± 1.972
3.292ProPhe: 3.292 ± 1.524
7.407ProGly: 7.407 ± 1.29
0.823ProHis: 0.823 ± 0.876
4.938ProIle: 4.938 ± 0.55
4.938ProLys: 4.938 ± 3.699
5.761ProLeu: 5.761 ± 2.007
0.823ProMet: 0.823 ± 0.604
5.761ProAsn: 5.761 ± 2.935
1.646ProPro: 1.646 ± 0.746
2.469ProGln: 2.469 ± 1.037
6.584ProArg: 6.584 ± 2.985
2.469ProSer: 2.469 ± 1.511
4.115ProThr: 4.115 ± 1.424
3.292ProVal: 3.292 ± 1.228
0.0ProTrp: 0.0 ± 0.0
1.646ProTyr: 1.646 ± 1.207
0.0ProXaa: 0.0 ± 0.0
Gln
2.469GlnAla: 2.469 ± 1.721
0.0GlnCys: 0.0 ± 0.0
0.823GlnAsp: 0.823 ± 0.604
3.292GlnGlu: 3.292 ± 0.229
4.938GlnPhe: 4.938 ± 1.842
2.469GlnGly: 2.469 ± 0.43
1.646GlnHis: 1.646 ± 0.614
3.292GlnIle: 3.292 ± 0.229
1.646GlnLys: 1.646 ± 0.614
4.938GlnLeu: 4.938 ± 2.646
2.469GlnMet: 2.469 ± 0.43
1.646GlnAsn: 1.646 ± 1.023
2.469GlnPro: 2.469 ± 1.037
1.646GlnGln: 1.646 ± 1.628
4.115GlnArg: 4.115 ± 0.906
0.823GlnSer: 0.823 ± 0.876
4.938GlnThr: 4.938 ± 2.142
2.469GlnVal: 2.469 ± 1.31
0.0GlnTrp: 0.0 ± 0.0
1.646GlnTyr: 1.646 ± 0.614
0.0GlnXaa: 0.0 ± 0.0
Arg
6.584ArgAla: 6.584 ± 0.458
0.823ArgCys: 0.823 ± 0.604
3.292ArgAsp: 3.292 ± 1.492
1.646ArgGlu: 1.646 ± 1.752
3.292ArgPhe: 3.292 ± 1.524
9.877ArgGly: 9.877 ± 7.196
2.469ArgHis: 2.469 ± 1.037
4.938ArgIle: 4.938 ± 2.969
0.823ArgLys: 0.823 ± 0.604
4.115ArgLeu: 4.115 ± 1.703
1.646ArgMet: 1.646 ± 0.746
4.115ArgAsn: 4.115 ± 2.217
4.938ArgPro: 4.938 ± 0.917
6.584ArgGln: 6.584 ± 3.72
4.938ArgArg: 4.938 ± 4.234
4.115ArgSer: 4.115 ± 3.216
5.761ArgThr: 5.761 ± 3.143
5.761ArgVal: 5.761 ± 1.262
1.646ArgTrp: 1.646 ± 1.023
0.823ArgTyr: 0.823 ± 0.604
0.0ArgXaa: 0.0 ± 0.0
Ser
5.761SerAla: 5.761 ± 2.47
0.0SerCys: 0.0 ± 0.0
2.469SerAsp: 2.469 ± 1.721
2.469SerGlu: 2.469 ± 1.511
2.469SerPhe: 2.469 ± 1.31
5.761SerGly: 5.761 ± 2.223
0.0SerHis: 0.0 ± 0.0
4.115SerIle: 4.115 ± 1.972
4.115SerLys: 4.115 ± 1.448
4.938SerLeu: 4.938 ± 2.619
3.292SerMet: 3.292 ± 3.257
3.292SerAsn: 3.292 ± 2.045
4.938SerPro: 4.938 ± 0.55
3.292SerGln: 3.292 ± 2.045
6.584SerArg: 6.584 ± 2.985
8.23SerSer: 8.23 ± 3.036
5.761SerThr: 5.761 ± 2.686
5.761SerVal: 5.761 ± 2.223
0.823SerTrp: 0.823 ± 0.604
3.292SerTyr: 3.292 ± 1.228
0.0SerXaa: 0.0 ± 0.0
Thr
5.761ThrAla: 5.761 ± 1.603
0.0ThrCys: 0.0 ± 0.0
2.469ThrAsp: 2.469 ± 0.43
1.646ThrGlu: 1.646 ± 1.628
1.646ThrPhe: 1.646 ± 0.614
3.292ThrGly: 3.292 ± 2.093
1.646ThrHis: 1.646 ± 1.752
4.938ThrIle: 4.938 ± 1.767
3.292ThrLys: 3.292 ± 1.411
9.053ThrLeu: 9.053 ± 2.209
0.0ThrMet: 0.0 ± 0.0
2.469ThrAsn: 2.469 ± 1.628
0.823ThrPro: 0.823 ± 0.814
1.646ThrGln: 1.646 ± 1.207
6.584ThrArg: 6.584 ± 2.718
4.938ThrSer: 4.938 ± 0.86
3.292ThrThr: 3.292 ± 2.045
4.938ThrVal: 4.938 ± 1.409
0.823ThrTrp: 0.823 ± 0.876
1.646ThrTyr: 1.646 ± 1.023
0.0ThrXaa: 0.0 ± 0.0
Val
8.23ValAla: 8.23 ± 1.11
0.823ValCys: 0.823 ± 0.604
1.646ValAsp: 1.646 ± 1.207
1.646ValGlu: 1.646 ± 0.614
1.646ValPhe: 1.646 ± 0.614
2.469ValGly: 2.469 ± 1.628
0.823ValHis: 0.823 ± 0.604
1.646ValIle: 1.646 ± 0.614
2.469ValLys: 2.469 ± 0.905
7.407ValLeu: 7.407 ± 1.62
3.292ValMet: 3.292 ± 1.228
1.646ValAsn: 1.646 ± 0.614
4.115ValPro: 4.115 ± 0.597
3.292ValGln: 3.292 ± 1.236
4.115ValArg: 4.115 ± 2.073
8.23ValSer: 8.23 ± 3.489
4.938ValThr: 4.938 ± 1.842
9.053ValVal: 9.053 ± 2.209
1.646ValTrp: 1.646 ± 1.752
1.646ValTyr: 1.646 ± 1.628
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.823TrpCys: 0.823 ± 0.604
0.0TrpAsp: 0.0 ± 0.0
2.469TrpGlu: 2.469 ± 0.905
0.0TrpPhe: 0.0 ± 0.0
2.469TrpGly: 2.469 ± 1.628
0.0TrpHis: 0.0 ± 0.0
1.646TrpIle: 1.646 ± 1.207
0.0TrpLys: 0.0 ± 0.0
2.469TrpLeu: 2.469 ± 0.43
0.823TrpMet: 0.823 ± 0.876
0.0TrpAsn: 0.0 ± 0.0
0.823TrpPro: 0.823 ± 0.876
0.0TrpGln: 0.0 ± 0.0
0.823TrpArg: 0.823 ± 0.604
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.646TrpVal: 1.646 ± 1.752
1.646TrpTrp: 1.646 ± 1.207
2.469TrpTyr: 2.469 ± 0.905
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.646TyrAla: 1.646 ± 1.023
0.0TyrCys: 0.0 ± 0.0
2.469TyrAsp: 2.469 ± 0.905
0.823TyrGlu: 0.823 ± 0.604
1.646TyrPhe: 1.646 ± 0.746
0.0TyrGly: 0.0 ± 0.0
1.646TyrHis: 1.646 ± 1.207
2.469TyrIle: 2.469 ± 0.905
1.646TyrLys: 1.646 ± 1.207
4.938TyrLeu: 4.938 ± 0.55
0.823TyrMet: 0.823 ± 0.604
0.823TyrAsn: 0.823 ± 0.604
1.646TyrPro: 1.646 ± 0.614
2.469TyrGln: 2.469 ± 1.31
0.0TyrArg: 0.0 ± 0.0
4.115TyrSer: 4.115 ± 1.877
1.646TyrThr: 1.646 ± 0.614
2.469TyrVal: 2.469 ± 0.905
0.0TyrTrp: 0.0 ± 0.0
0.823TyrTyr: 0.823 ± 0.876
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1216 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski