Amino acid dipepetide frequency for Niminivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.22AlaAla: 1.22 ± 1.929
1.22AlaCys: 1.22 ± 1.929
6.098AlaAsp: 6.098 ± 1.867
0.0AlaGlu: 0.0 ± 0.0
1.22AlaPhe: 1.22 ± 2.116
1.22AlaGly: 1.22 ± 0.886
1.22AlaHis: 1.22 ± 0.89
1.22AlaIle: 1.22 ± 0.89
2.439AlaLys: 2.439 ± 0.748
7.317AlaLeu: 7.317 ± 1.467
3.659AlaMet: 3.659 ± 1.985
3.659AlaAsn: 3.659 ± 1.377
2.439AlaPro: 2.439 ± 1.771
1.22AlaGln: 1.22 ± 0.89
13.415AlaArg: 13.415 ± 5.509
1.22AlaSer: 1.22 ± 0.886
0.0AlaThr: 0.0 ± 0.0
2.439AlaVal: 2.439 ± 1.771
1.22AlaTrp: 1.22 ± 0.89
4.878AlaTyr: 4.878 ± 1.497
0.0AlaXaa: 0.0 ± 0.0
Cys
1.22CysAla: 1.22 ± 0.886
0.0CysCys: 0.0 ± 0.0
1.22CysAsp: 1.22 ± 0.89
0.0CysGlu: 0.0 ± 0.0
3.659CysPhe: 3.659 ± 2.321
0.0CysGly: 0.0 ± 0.0
1.22CysHis: 1.22 ± 0.89
1.22CysIle: 1.22 ± 0.89
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
1.22CysMet: 1.22 ± 0.886
0.0CysAsn: 0.0 ± 0.0
1.22CysPro: 1.22 ± 1.929
2.439CysGln: 2.439 ± 0.748
2.439CysArg: 2.439 ± 0.748
1.22CysSer: 1.22 ± 0.89
1.22CysThr: 1.22 ± 1.929
1.22CysVal: 1.22 ± 0.89
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.659AspAla: 3.659 ± 1.377
0.0AspCys: 0.0 ± 0.0
4.878AspAsp: 4.878 ± 3.56
4.878AspGlu: 4.878 ± 2.206
2.439AspPhe: 2.439 ± 0.748
7.317AspGly: 7.317 ± 3.589
0.0AspHis: 0.0 ± 0.0
6.098AspIle: 6.098 ± 2.032
3.659AspLys: 3.659 ± 1.386
7.317AspLeu: 7.317 ± 2.245
0.0AspMet: 0.0 ± 0.0
2.439AspAsn: 2.439 ± 1.771
1.22AspPro: 1.22 ± 0.89
1.22AspGln: 1.22 ± 0.886
2.439AspArg: 2.439 ± 2.961
1.22AspSer: 1.22 ± 0.886
3.659AspThr: 3.659 ± 1.471
9.756AspVal: 9.756 ± 1.989
1.22AspTrp: 1.22 ± 0.89
1.22AspTyr: 1.22 ± 0.886
0.0AspXaa: 0.0 ± 0.0
Glu
2.439GluAla: 2.439 ± 1.964
0.0GluCys: 0.0 ± 0.0
2.439GluAsp: 2.439 ± 0.748
1.22GluGlu: 1.22 ± 0.886
0.0GluPhe: 0.0 ± 0.0
0.0GluGly: 0.0 ± 0.0
1.22GluHis: 1.22 ± 0.89
1.22GluIle: 1.22 ± 0.89
2.439GluLys: 2.439 ± 1.78
3.659GluLeu: 3.659 ± 2.67
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
0.0GluPro: 0.0 ± 0.0
0.0GluGln: 0.0 ± 0.0
1.22GluArg: 1.22 ± 0.89
4.878GluSer: 4.878 ± 5.469
2.439GluThr: 2.439 ± 0.748
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
1.22GluTyr: 1.22 ± 0.89
0.0GluXaa: 0.0 ± 0.0
Phe
3.659PheAla: 3.659 ± 1.386
2.439PheCys: 2.439 ± 1.745
2.439PheAsp: 2.439 ± 1.78
2.439PheGlu: 2.439 ± 0.748
7.317PhePhe: 7.317 ± 7.943
2.439PheGly: 2.439 ± 0.748
1.22PheHis: 1.22 ± 0.89
1.22PheIle: 1.22 ± 2.116
2.439PheLys: 2.439 ± 1.771
4.878PheLeu: 4.878 ± 6.141
0.0PheMet: 0.0 ± 0.0
3.659PheAsn: 3.659 ± 2.67
0.0PhePro: 0.0 ± 0.0
2.439PheGln: 2.439 ± 1.771
3.659PheArg: 3.659 ± 1.66
9.756PheSer: 9.756 ± 7.624
4.878PheThr: 4.878 ± 1.727
1.22PheVal: 1.22 ± 0.886
0.0PheTrp: 0.0 ± 0.0
1.22PheTyr: 1.22 ± 0.886
0.0PheXaa: 0.0 ± 0.0
Gly
3.659GlyAla: 3.659 ± 2.657
1.22GlyCys: 1.22 ± 0.89
3.659GlyAsp: 3.659 ± 1.985
1.22GlyGlu: 1.22 ± 0.89
3.659GlyPhe: 3.659 ± 2.657
0.0GlyGly: 0.0 ± 0.0
2.439GlyHis: 2.439 ± 2.035
1.22GlyIle: 1.22 ± 0.886
7.317GlyLys: 7.317 ± 2.245
6.098GlyLeu: 6.098 ± 3.631
1.22GlyMet: 1.22 ± 0.886
4.878GlyAsn: 4.878 ± 1.428
2.439GlyPro: 2.439 ± 0.748
0.0GlyGln: 0.0 ± 0.0
1.22GlyArg: 1.22 ± 0.89
3.659GlySer: 3.659 ± 1.377
1.22GlyThr: 1.22 ± 1.929
2.439GlyVal: 2.439 ± 1.771
0.0GlyTrp: 0.0 ± 0.0
3.659GlyTyr: 3.659 ± 1.377
0.0GlyXaa: 0.0 ± 0.0
His
3.659HisAla: 3.659 ± 1.386
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
2.439HisGlu: 2.439 ± 1.78
2.439HisPhe: 2.439 ± 1.78
1.22HisGly: 1.22 ± 0.886
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
4.878HisLeu: 4.878 ± 3.56
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.659HisPro: 3.659 ± 2.321
2.439HisGln: 2.439 ± 1.78
1.22HisArg: 1.22 ± 2.116
2.439HisSer: 2.439 ± 1.745
1.22HisThr: 1.22 ± 1.929
1.22HisVal: 1.22 ± 2.116
0.0HisTrp: 0.0 ± 0.0
1.22HisTyr: 1.22 ± 0.89
0.0HisXaa: 0.0 ± 0.0
Ile
3.659IleAla: 3.659 ± 1.377
1.22IleCys: 1.22 ± 0.886
3.659IleAsp: 3.659 ± 1.386
1.22IleGlu: 1.22 ± 0.89
6.098IlePhe: 6.098 ± 3.598
2.439IleGly: 2.439 ± 1.964
1.22IleHis: 1.22 ± 0.89
1.22IleIle: 1.22 ± 0.886
2.439IleLys: 2.439 ± 2.035
7.317IleLeu: 7.317 ± 1.985
2.439IleMet: 2.439 ± 3.425
2.439IleAsn: 2.439 ± 0.748
1.22IlePro: 1.22 ± 0.886
3.659IleGln: 3.659 ± 1.386
1.22IleArg: 1.22 ± 0.886
1.22IleSer: 1.22 ± 0.89
2.439IleThr: 2.439 ± 0.748
4.878IleVal: 4.878 ± 1.497
1.22IleTrp: 1.22 ± 0.89
2.439IleTyr: 2.439 ± 2.035
0.0IleXaa: 0.0 ± 0.0
Lys
1.22LysAla: 1.22 ± 0.886
0.0LysCys: 0.0 ± 0.0
7.317LysAsp: 7.317 ± 2.771
1.22LysGlu: 1.22 ± 0.89
1.22LysPhe: 1.22 ± 1.929
0.0LysGly: 0.0 ± 0.0
3.659LysHis: 3.659 ± 1.66
2.439LysIle: 2.439 ± 0.748
7.317LysLys: 7.317 ± 2.754
3.659LysLeu: 3.659 ± 1.66
2.439LysMet: 2.439 ± 1.78
1.22LysAsn: 1.22 ± 0.89
4.878LysPro: 4.878 ± 2.979
2.439LysGln: 2.439 ± 1.745
9.756LysArg: 9.756 ± 5.667
7.317LysSer: 7.317 ± 3.003
3.659LysThr: 3.659 ± 1.377
6.098LysVal: 6.098 ± 3.046
1.22LysTrp: 1.22 ± 0.89
3.659LysTyr: 3.659 ± 1.386
0.0LysXaa: 0.0 ± 0.0
Leu
4.878LeuAla: 4.878 ± 1.727
1.22LeuCys: 1.22 ± 0.89
2.439LeuAsp: 2.439 ± 1.78
2.439LeuGlu: 2.439 ± 1.964
6.098LeuPhe: 6.098 ± 1.467
4.878LeuGly: 4.878 ± 2.191
3.659LeuHis: 3.659 ± 2.321
3.659LeuIle: 3.659 ± 2.361
7.317LeuLys: 7.317 ± 1.467
3.659LeuLeu: 3.659 ± 2.787
3.659LeuMet: 3.659 ± 1.299
3.659LeuAsn: 3.659 ± 1.471
7.317LeuPro: 7.317 ± 7.186
0.0LeuGln: 0.0 ± 0.0
3.659LeuArg: 3.659 ± 2.787
10.976LeuSer: 10.976 ± 2.783
8.537LeuThr: 8.537 ± 3.551
3.659LeuVal: 3.659 ± 1.66
0.0LeuTrp: 0.0 ± 0.0
8.537LeuTyr: 8.537 ± 2.279
0.0LeuXaa: 0.0 ± 0.0
Met
1.22MetAla: 1.22 ± 2.116
0.0MetCys: 0.0 ± 0.0
4.878MetAsp: 4.878 ± 1.428
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
2.439MetIle: 2.439 ± 1.745
2.439MetLys: 2.439 ± 0.748
2.439MetLeu: 2.439 ± 0.748
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.659MetPro: 3.659 ± 1.386
3.659MetGln: 3.659 ± 2.361
1.22MetArg: 1.22 ± 0.886
2.439MetSer: 2.439 ± 1.78
1.22MetThr: 1.22 ± 0.886
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.878AsnAla: 4.878 ± 3.491
1.22AsnCys: 1.22 ± 0.89
2.439AsnAsp: 2.439 ± 1.771
2.439AsnGlu: 2.439 ± 1.78
3.659AsnPhe: 3.659 ± 2.67
2.439AsnGly: 2.439 ± 1.745
1.22AsnHis: 1.22 ± 0.89
6.098AsnIle: 6.098 ± 1.467
2.439AsnLys: 2.439 ± 0.748
1.22AsnLeu: 1.22 ± 0.886
3.659AsnMet: 3.659 ± 1.386
2.439AsnAsn: 2.439 ± 0.748
1.22AsnPro: 1.22 ± 0.886
0.0AsnGln: 0.0 ± 0.0
3.659AsnArg: 3.659 ± 2.305
1.22AsnSer: 1.22 ± 0.89
4.878AsnThr: 4.878 ± 2.191
2.439AsnVal: 2.439 ± 1.771
1.22AsnTrp: 1.22 ± 0.89
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.22ProAla: 1.22 ± 1.929
1.22ProCys: 1.22 ± 0.89
2.439ProAsp: 2.439 ± 2.028
0.0ProGlu: 0.0 ± 0.0
2.439ProPhe: 2.439 ± 0.748
2.439ProGly: 2.439 ± 1.771
1.22ProHis: 1.22 ± 0.89
1.22ProIle: 1.22 ± 2.116
1.22ProLys: 1.22 ± 0.886
2.439ProLeu: 2.439 ± 1.771
0.0ProMet: 0.0 ± 0.0
2.439ProAsn: 2.439 ± 1.78
0.0ProPro: 0.0 ± 0.0
1.22ProGln: 1.22 ± 0.89
2.439ProArg: 2.439 ± 1.964
7.317ProSer: 7.317 ± 5.178
4.878ProThr: 4.878 ± 1.692
8.537ProVal: 8.537 ± 1.55
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.439GlnAla: 2.439 ± 2.961
2.439GlnCys: 2.439 ± 0.748
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
1.22GlnPhe: 1.22 ± 0.886
6.098GlnGly: 6.098 ± 3.046
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
4.878GlnLys: 4.878 ± 1.692
3.659GlnLeu: 3.659 ± 1.471
0.0GlnMet: 0.0 ± 0.0
2.439GlnAsn: 2.439 ± 0.748
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
1.22GlnArg: 1.22 ± 0.89
2.439GlnSer: 2.439 ± 1.771
2.439GlnThr: 2.439 ± 1.771
1.22GlnVal: 1.22 ± 0.89
0.0GlnTrp: 0.0 ± 0.0
3.659GlnTyr: 3.659 ± 2.67
0.0GlnXaa: 0.0 ± 0.0
Arg
2.439ArgAla: 2.439 ± 0.748
1.22ArgCys: 1.22 ± 0.886
3.659ArgAsp: 3.659 ± 1.66
1.22ArgGlu: 1.22 ± 0.886
4.878ArgPhe: 4.878 ± 1.715
2.439ArgGly: 2.439 ± 2.961
2.439ArgHis: 2.439 ± 1.78
7.317ArgIle: 7.317 ± 1.699
6.098ArgLys: 6.098 ± 3.631
2.439ArgLeu: 2.439 ± 1.745
1.22ArgMet: 1.22 ± 0.886
4.878ArgAsn: 4.878 ± 2.843
3.659ArgPro: 3.659 ± 1.66
2.439ArgGln: 2.439 ± 1.745
7.317ArgArg: 7.317 ± 2.754
7.317ArgSer: 7.317 ± 3.667
1.22ArgThr: 1.22 ± 0.89
2.439ArgVal: 2.439 ± 2.028
1.22ArgTrp: 1.22 ± 0.886
6.098ArgTyr: 6.098 ± 2.167
0.0ArgXaa: 0.0 ± 0.0
Ser
7.317SerAla: 7.317 ± 1.985
1.22SerCys: 1.22 ± 2.116
4.878SerAsp: 4.878 ± 1.497
1.22SerGlu: 1.22 ± 1.929
6.098SerPhe: 6.098 ± 5.833
3.659SerGly: 3.659 ± 1.377
1.22SerHis: 1.22 ± 1.929
4.878SerIle: 4.878 ± 2.866
3.659SerLys: 3.659 ± 1.386
8.537SerLeu: 8.537 ± 5.234
1.22SerMet: 1.22 ± 0.886
7.317SerAsn: 7.317 ± 1.699
0.0SerPro: 0.0 ± 0.0
2.439SerGln: 2.439 ± 1.771
3.659SerArg: 3.659 ± 1.66
8.537SerSer: 8.537 ± 6.676
8.537SerThr: 8.537 ± 6.866
6.098SerVal: 6.098 ± 2.041
2.439SerTrp: 2.439 ± 0.748
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.659ThrAla: 3.659 ± 1.377
0.0ThrCys: 0.0 ± 0.0
3.659ThrAsp: 3.659 ± 1.985
2.439ThrGlu: 2.439 ± 3.859
1.22ThrPhe: 1.22 ± 0.886
3.659ThrGly: 3.659 ± 1.386
3.659ThrHis: 3.659 ± 1.471
6.098ThrIle: 6.098 ± 3.103
4.878ThrLys: 4.878 ± 2.206
4.878ThrLeu: 4.878 ± 1.692
0.0ThrMet: 0.0 ± 0.0
1.22ThrAsn: 1.22 ± 1.929
3.659ThrPro: 3.659 ± 2.305
2.439ThrGln: 2.439 ± 1.771
3.659ThrArg: 3.659 ± 1.985
2.439ThrSer: 2.439 ± 0.748
0.0ThrThr: 0.0 ± 0.0
4.878ThrVal: 4.878 ± 2.191
2.439ThrTrp: 2.439 ± 1.78
3.659ThrTyr: 3.659 ± 1.377
0.0ThrXaa: 0.0 ± 0.0
Val
3.659ValAla: 3.659 ± 1.377
3.659ValCys: 3.659 ± 2.67
4.878ValAsp: 4.878 ± 1.497
0.0ValGlu: 0.0 ± 0.0
2.439ValPhe: 2.439 ± 0.748
6.098ValGly: 6.098 ± 3.046
0.0ValHis: 0.0 ± 0.0
6.098ValIle: 6.098 ± 2.032
3.659ValLys: 3.659 ± 1.66
7.317ValLeu: 7.317 ± 3.7
0.0ValMet: 0.0 ± 0.0
3.659ValAsn: 3.659 ± 1.377
3.659ValPro: 3.659 ± 2.305
2.439ValGln: 2.439 ± 1.771
3.659ValArg: 3.659 ± 1.985
6.098ValSer: 6.098 ± 3.046
3.659ValThr: 3.659 ± 1.377
3.659ValVal: 3.659 ± 1.377
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.22TrpCys: 1.22 ± 0.89
1.22TrpAsp: 1.22 ± 0.89
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
2.439TrpLys: 2.439 ± 0.748
2.439TrpLeu: 2.439 ± 1.78
1.22TrpMet: 1.22 ± 0.89
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.22TrpGln: 1.22 ± 0.89
1.22TrpArg: 1.22 ± 0.886
0.0TrpSer: 0.0 ± 0.0
1.22TrpThr: 1.22 ± 0.89
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.22TyrAla: 1.22 ± 0.89
0.0TyrCys: 0.0 ± 0.0
2.439TyrAsp: 2.439 ± 0.748
0.0TyrGlu: 0.0 ± 0.0
1.22TyrPhe: 1.22 ± 2.116
4.878TyrGly: 4.878 ± 3.543
2.439TyrHis: 2.439 ± 0.748
0.0TyrIle: 0.0 ± 0.0
3.659TyrLys: 3.659 ± 1.386
6.098TyrLeu: 6.098 ± 1.467
2.439TyrMet: 2.439 ± 2.534
2.439TyrAsn: 2.439 ± 0.748
2.439TyrPro: 2.439 ± 1.78
2.439TyrGln: 2.439 ± 2.035
3.659TyrArg: 3.659 ± 2.657
1.22TyrSer: 1.22 ± 0.886
1.22TyrThr: 1.22 ± 0.886
2.439TyrVal: 2.439 ± 0.748
0.0TyrTrp: 0.0 ± 0.0
1.22TyrTyr: 1.22 ± 0.886
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (821 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski