Amino acid dipepetide frequency for Beihai sobemo-like virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.056AlaAla: 12.056 ± 1.064
3.173AlaCys: 3.173 ± 0.154
3.173AlaAsp: 3.173 ± 0.154
5.711AlaGlu: 5.711 ± 1.377
1.904AlaPhe: 1.904 ± 0.457
9.518AlaGly: 9.518 ± 0.913
3.807AlaHis: 3.807 ± 0.46
1.904AlaIle: 1.904 ± 0.917
4.442AlaLys: 4.442 ± 0.765
10.787AlaLeu: 10.787 ± 3.05
2.538AlaMet: 2.538 ± 0.733
1.904AlaAsn: 1.904 ± 1.832
3.807AlaPro: 3.807 ± 0.915
4.442AlaGln: 4.442 ± 0.765
4.442AlaArg: 4.442 ± 0.765
8.249AlaSer: 8.249 ± 4.273
5.711AlaThr: 5.711 ± 5.495
6.345AlaVal: 6.345 ± 1.067
1.269AlaTrp: 1.269 ± 0.763
5.711AlaTyr: 5.711 ± 2.751
0.0AlaXaa: 0.0 ± 0.0
Cys
1.269CysAla: 1.269 ± 0.611
0.0CysCys: 0.0 ± 0.0
1.269CysAsp: 1.269 ± 0.611
1.269CysGlu: 1.269 ± 0.611
1.269CysPhe: 1.269 ± 0.611
1.269CysGly: 1.269 ± 0.611
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.635CysLeu: 0.635 ± 0.306
0.0CysMet: 0.0 ± 0.0
0.635CysAsn: 0.635 ± 0.306
1.269CysPro: 1.269 ± 0.763
1.904CysGln: 1.904 ± 0.917
0.635CysArg: 0.635 ± 0.306
0.0CysSer: 0.0 ± 0.0
1.269CysThr: 1.269 ± 0.763
1.269CysVal: 1.269 ± 0.611
0.635CysTrp: 0.635 ± 1.069
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.98AspAla: 6.98 ± 1.988
0.635AspCys: 0.635 ± 0.306
1.904AspAsp: 1.904 ± 0.917
3.173AspGlu: 3.173 ± 1.528
2.538AspPhe: 2.538 ± 0.152
5.076AspGly: 5.076 ± 0.303
1.904AspHis: 1.904 ± 0.457
3.173AspIle: 3.173 ± 0.154
3.807AspLys: 3.807 ± 0.915
5.076AspLeu: 5.076 ± 0.303
2.538AspMet: 2.538 ± 1.223
1.904AspAsn: 1.904 ± 0.917
1.269AspPro: 1.269 ± 0.611
1.269AspGln: 1.269 ± 0.611
3.807AspArg: 3.807 ± 0.46
5.076AspSer: 5.076 ± 0.303
0.0AspThr: 0.0 ± 0.0
4.442AspVal: 4.442 ± 1.984
1.904AspTrp: 1.904 ± 0.917
1.269AspTyr: 1.269 ± 0.611
0.0AspXaa: 0.0 ± 0.0
Glu
5.076GluAla: 5.076 ± 2.445
0.0GluCys: 0.0 ± 0.0
4.442GluAsp: 4.442 ± 2.14
8.249GluGlu: 8.249 ± 3.974
1.269GluPhe: 1.269 ± 0.611
5.076GluGly: 5.076 ± 2.445
0.635GluHis: 0.635 ± 0.306
1.904GluIle: 1.904 ± 0.917
2.538GluLys: 2.538 ± 1.223
5.076GluLeu: 5.076 ± 1.071
1.904GluMet: 1.904 ± 0.808
0.0GluAsn: 0.0 ± 0.0
2.538GluPro: 2.538 ± 1.223
1.269GluGln: 1.269 ± 0.611
5.711GluArg: 5.711 ± 2.751
3.807GluSer: 3.807 ± 0.46
1.904GluThr: 1.904 ± 0.457
3.807GluVal: 3.807 ± 0.915
1.269GluTrp: 1.269 ± 0.611
1.904GluTyr: 1.904 ± 0.457
0.0GluXaa: 0.0 ± 0.0
Phe
1.904PheAla: 1.904 ± 0.917
0.635PheCys: 0.635 ± 0.306
1.269PheAsp: 1.269 ± 0.763
3.173PheGlu: 3.173 ± 1.528
1.269PhePhe: 1.269 ± 0.611
3.173PheGly: 3.173 ± 0.154
1.904PheHis: 1.904 ± 0.917
1.904PheIle: 1.904 ± 0.917
1.269PheLys: 1.269 ± 0.611
1.904PheLeu: 1.904 ± 0.917
0.635PheMet: 0.635 ± 0.306
0.635PheAsn: 0.635 ± 0.306
0.635PhePro: 0.635 ± 0.306
1.269PheGln: 1.269 ± 0.763
1.269PheArg: 1.269 ± 0.611
1.269PheSer: 1.269 ± 0.763
4.442PheThr: 4.442 ± 0.765
2.538PheVal: 2.538 ± 0.152
0.635PheTrp: 0.635 ± 0.306
0.635PheTyr: 0.635 ± 0.306
0.0PheXaa: 0.0 ± 0.0
Gly
9.518GlyAla: 9.518 ± 0.913
1.269GlyCys: 1.269 ± 0.763
7.614GlyAsp: 7.614 ± 0.919
3.173GlyGlu: 3.173 ± 1.528
1.904GlyPhe: 1.904 ± 0.917
4.442GlyGly: 4.442 ± 0.765
2.538GlyHis: 2.538 ± 1.526
2.538GlyIle: 2.538 ± 0.152
1.904GlyLys: 1.904 ± 0.917
4.442GlyLeu: 4.442 ± 0.765
3.807GlyMet: 3.807 ± 2.289
2.538GlyAsn: 2.538 ± 0.152
5.711GlyPro: 5.711 ± 1.377
3.173GlyGln: 3.173 ± 1.528
5.076GlyArg: 5.076 ± 3.052
3.807GlySer: 3.807 ± 1.834
5.711GlyThr: 5.711 ± 0.002
6.98GlyVal: 6.98 ± 0.761
3.173GlyTrp: 3.173 ± 0.154
1.269GlyTyr: 1.269 ± 0.611
0.0GlyXaa: 0.0 ± 0.0
His
1.904HisAla: 1.904 ± 0.457
0.0HisCys: 0.0 ± 0.0
1.269HisAsp: 1.269 ± 0.611
0.635HisGlu: 0.635 ± 0.306
0.0HisPhe: 0.0 ± 0.0
1.269HisGly: 1.269 ± 0.763
1.904HisHis: 1.904 ± 1.832
0.635HisIle: 0.635 ± 0.306
0.635HisLys: 0.635 ± 1.069
3.173HisLeu: 3.173 ± 0.154
0.0HisMet: 0.0 ± 0.0
1.269HisAsn: 1.269 ± 0.611
2.538HisPro: 2.538 ± 2.901
1.269HisGln: 1.269 ± 0.763
0.635HisArg: 0.635 ± 0.306
2.538HisSer: 2.538 ± 0.152
3.173HisThr: 3.173 ± 0.154
1.904HisVal: 1.904 ± 0.917
0.635HisTrp: 0.635 ± 0.306
0.635HisTyr: 0.635 ± 0.306
0.0HisXaa: 0.0 ± 0.0
Ile
5.711IleAla: 5.711 ± 0.002
1.269IleCys: 1.269 ± 0.611
1.904IleAsp: 1.904 ± 0.917
1.904IleGlu: 1.904 ± 0.917
1.269IlePhe: 1.269 ± 0.611
5.711IleGly: 5.711 ± 1.377
0.635IleHis: 0.635 ± 0.306
0.635IleIle: 0.635 ± 1.069
0.0IleLys: 0.0 ± 0.0
1.269IleLeu: 1.269 ± 0.611
1.904IleMet: 1.904 ± 1.832
1.904IleAsn: 1.904 ± 1.832
0.635IlePro: 0.635 ± 0.306
1.269IleGln: 1.269 ± 0.611
1.904IleArg: 1.904 ± 0.457
1.904IleSer: 1.904 ± 0.457
1.904IleThr: 1.904 ± 0.457
0.635IleVal: 0.635 ± 1.069
1.904IleTrp: 1.904 ± 0.917
2.538IleTyr: 2.538 ± 1.526
0.0IleXaa: 0.0 ± 0.0
Lys
3.173LysAla: 3.173 ± 1.528
0.0LysCys: 0.0 ± 0.0
3.807LysAsp: 3.807 ± 1.834
2.538LysGlu: 2.538 ± 1.223
1.904LysPhe: 1.904 ± 1.832
1.904LysGly: 1.904 ± 0.917
1.904LysHis: 1.904 ± 1.832
1.269LysIle: 1.269 ± 0.611
3.807LysLys: 3.807 ± 1.834
5.076LysLeu: 5.076 ± 0.303
1.904LysMet: 1.904 ± 0.457
0.635LysAsn: 0.635 ± 0.306
1.269LysPro: 1.269 ± 0.611
1.269LysGln: 1.269 ± 0.611
3.807LysArg: 3.807 ± 0.915
1.904LysSer: 1.904 ± 0.917
3.807LysThr: 3.807 ± 1.834
3.807LysVal: 3.807 ± 1.834
0.0LysTrp: 0.0 ± 0.0
1.904LysTyr: 1.904 ± 0.457
0.0LysXaa: 0.0 ± 0.0
Leu
8.883LeuAla: 8.883 ± 3.967
0.0LeuCys: 0.0 ± 0.0
6.345LeuAsp: 6.345 ± 1.067
2.538LeuGlu: 2.538 ± 0.152
2.538LeuPhe: 2.538 ± 1.223
6.98LeuGly: 6.98 ± 0.761
0.635LeuHis: 0.635 ± 0.306
3.173LeuIle: 3.173 ± 2.595
4.442LeuLys: 4.442 ± 2.14
6.345LeuLeu: 6.345 ± 0.308
2.538LeuMet: 2.538 ± 0.152
1.904LeuAsn: 1.904 ± 0.457
3.807LeuPro: 3.807 ± 2.289
2.538LeuGln: 2.538 ± 0.152
6.345LeuArg: 6.345 ± 1.067
6.98LeuSer: 6.98 ± 1.988
1.904LeuThr: 1.904 ± 0.917
5.076LeuVal: 5.076 ± 1.071
2.538LeuTrp: 2.538 ± 1.223
1.269LeuTyr: 1.269 ± 0.611
0.0LeuXaa: 0.0 ± 0.0
Met
3.173MetAla: 3.173 ± 1.22
0.635MetCys: 0.635 ± 1.069
2.538MetAsp: 2.538 ± 0.152
2.538MetGlu: 2.538 ± 1.223
0.635MetPhe: 0.635 ± 0.306
1.904MetGly: 1.904 ± 1.832
0.635MetHis: 0.635 ± 0.306
0.635MetIle: 0.635 ± 0.306
0.635MetLys: 0.635 ± 0.306
2.538MetLeu: 2.538 ± 0.152
1.904MetMet: 1.904 ± 0.917
1.269MetAsn: 1.269 ± 2.137
0.635MetPro: 0.635 ± 0.306
1.269MetGln: 1.269 ± 0.611
3.173MetArg: 3.173 ± 0.154
1.269MetSer: 1.269 ± 0.611
2.538MetThr: 2.538 ± 0.152
3.807MetVal: 3.807 ± 0.915
1.269MetTrp: 1.269 ± 0.763
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.904AsnAla: 1.904 ± 1.832
0.635AsnCys: 0.635 ± 0.306
1.904AsnAsp: 1.904 ± 0.457
1.904AsnGlu: 1.904 ± 0.917
0.635AsnPhe: 0.635 ± 0.306
2.538AsnGly: 2.538 ± 1.223
0.0AsnHis: 0.0 ± 0.0
0.635AsnIle: 0.635 ± 0.306
2.538AsnLys: 2.538 ± 1.526
2.538AsnLeu: 2.538 ± 1.526
1.269AsnMet: 1.269 ± 0.763
1.904AsnAsn: 1.904 ± 1.832
1.904AsnPro: 1.904 ± 0.457
0.0AsnGln: 0.0 ± 0.0
1.904AsnArg: 1.904 ± 0.457
3.807AsnSer: 3.807 ± 0.915
0.635AsnThr: 0.635 ± 1.069
2.538AsnVal: 2.538 ± 0.152
0.635AsnTrp: 0.635 ± 0.306
0.635AsnTyr: 0.635 ± 0.306
0.0AsnXaa: 0.0 ± 0.0
Pro
1.269ProAla: 1.269 ± 0.763
0.635ProCys: 0.635 ± 0.306
1.269ProAsp: 1.269 ± 0.611
1.269ProGlu: 1.269 ± 0.611
1.904ProPhe: 1.904 ± 0.917
4.442ProGly: 4.442 ± 2.14
1.904ProHis: 1.904 ± 1.832
2.538ProIle: 2.538 ± 1.526
1.269ProLys: 1.269 ± 0.763
5.076ProLeu: 5.076 ± 0.303
0.0ProMet: 0.0 ± 0.0
0.635ProAsn: 0.635 ± 0.306
4.442ProPro: 4.442 ± 2.14
2.538ProGln: 2.538 ± 0.152
6.345ProArg: 6.345 ± 1.067
5.076ProSer: 5.076 ± 1.071
1.269ProThr: 1.269 ± 2.137
2.538ProVal: 2.538 ± 1.223
0.0ProTrp: 0.0 ± 0.0
1.269ProTyr: 1.269 ± 0.763
0.0ProXaa: 0.0 ± 0.0
Gln
2.538GlnAla: 2.538 ± 1.526
1.269GlnCys: 1.269 ± 0.611
2.538GlnAsp: 2.538 ± 1.223
3.807GlnGlu: 3.807 ± 1.834
1.269GlnPhe: 1.269 ± 0.763
3.807GlnGly: 3.807 ± 0.915
0.635GlnHis: 0.635 ± 0.306
2.538GlnIle: 2.538 ± 1.526
0.635GlnLys: 0.635 ± 0.306
3.807GlnLeu: 3.807 ± 0.915
2.538GlnMet: 2.538 ± 1.223
1.269GlnAsn: 1.269 ± 0.611
1.904GlnPro: 1.904 ± 0.917
1.269GlnGln: 1.269 ± 0.611
1.269GlnArg: 1.269 ± 0.763
1.904GlnSer: 1.904 ± 0.457
1.269GlnThr: 1.269 ± 0.763
3.173GlnVal: 3.173 ± 0.154
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
9.518ArgAla: 9.518 ± 6.41
0.635ArgCys: 0.635 ± 0.306
1.269ArgAsp: 1.269 ± 0.611
3.173ArgGlu: 3.173 ± 1.528
1.904ArgPhe: 1.904 ± 0.457
3.173ArgGly: 3.173 ± 0.154
1.904ArgHis: 1.904 ± 0.917
5.076ArgIle: 5.076 ± 2.445
0.635ArgLys: 0.635 ± 0.306
4.442ArgLeu: 4.442 ± 0.765
1.904ArgMet: 1.904 ± 0.917
1.269ArgAsn: 1.269 ± 0.611
3.807ArgPro: 3.807 ± 0.46
4.442ArgGln: 4.442 ± 0.609
10.152ArgArg: 10.152 ± 6.105
8.249ArgSer: 8.249 ± 1.225
3.173ArgThr: 3.173 ± 1.22
4.442ArgVal: 4.442 ± 1.984
1.904ArgTrp: 1.904 ± 0.917
0.635ArgTyr: 0.635 ± 0.306
0.0ArgXaa: 0.0 ± 0.0
Ser
6.345SerAla: 6.345 ± 1.067
0.635SerCys: 0.635 ± 0.306
5.076SerAsp: 5.076 ± 0.303
3.173SerGlu: 3.173 ± 1.22
3.807SerPhe: 3.807 ± 1.834
6.98SerGly: 6.98 ± 1.988
2.538SerHis: 2.538 ± 0.152
3.807SerIle: 3.807 ± 0.46
4.442SerLys: 4.442 ± 0.609
5.076SerLeu: 5.076 ± 0.303
1.904SerMet: 1.904 ± 0.457
1.904SerAsn: 1.904 ± 0.457
4.442SerPro: 4.442 ± 0.609
1.904SerGln: 1.904 ± 0.917
5.711SerArg: 5.711 ± 2.751
9.518SerSer: 9.518 ± 1.836
5.076SerThr: 5.076 ± 1.678
5.076SerVal: 5.076 ± 0.303
2.538SerTrp: 2.538 ± 1.223
1.904SerTyr: 1.904 ± 1.832
0.0SerXaa: 0.0 ± 0.0
Thr
8.249ThrAla: 8.249 ± 0.15
0.635ThrCys: 0.635 ± 0.306
1.269ThrAsp: 1.269 ± 0.611
3.173ThrGlu: 3.173 ± 1.528
3.173ThrPhe: 3.173 ± 1.528
5.076ThrGly: 5.076 ± 3.052
0.635ThrHis: 0.635 ± 1.069
0.635ThrIle: 0.635 ± 0.306
4.442ThrLys: 4.442 ± 0.765
3.173ThrLeu: 3.173 ± 2.595
0.0ThrMet: 0.0 ± 0.0
3.173ThrAsn: 3.173 ± 2.595
1.904ThrPro: 1.904 ± 0.917
1.904ThrGln: 1.904 ± 1.832
1.904ThrArg: 1.904 ± 0.917
2.538ThrSer: 2.538 ± 0.152
1.269ThrThr: 1.269 ± 0.763
3.807ThrVal: 3.807 ± 3.664
1.269ThrTrp: 1.269 ± 2.137
2.538ThrTyr: 2.538 ± 0.152
0.0ThrXaa: 0.0 ± 0.0
Val
5.076ValAla: 5.076 ± 1.678
1.269ValCys: 1.269 ± 0.611
3.807ValAsp: 3.807 ± 0.915
3.807ValGlu: 3.807 ± 0.46
1.904ValPhe: 1.904 ± 0.917
6.345ValGly: 6.345 ± 2.441
0.635ValHis: 0.635 ± 0.306
2.538ValIle: 2.538 ± 2.901
3.807ValLys: 3.807 ± 1.834
3.173ValLeu: 3.173 ± 0.154
3.173ValMet: 3.173 ± 1.22
2.538ValAsn: 2.538 ± 0.152
1.904ValPro: 1.904 ± 0.457
3.807ValGln: 3.807 ± 2.289
3.173ValArg: 3.173 ± 1.22
8.883ValSer: 8.883 ± 0.156
3.173ValThr: 3.173 ± 0.154
4.442ValVal: 4.442 ± 0.765
2.538ValTrp: 2.538 ± 1.223
1.904ValTyr: 1.904 ± 0.917
0.0ValXaa: 0.0 ± 0.0
Trp
3.173TrpAla: 3.173 ± 0.154
0.0TrpCys: 0.0 ± 0.0
4.442TrpAsp: 4.442 ± 1.984
0.635TrpGlu: 0.635 ± 0.306
0.635TrpPhe: 0.635 ± 0.306
1.269TrpGly: 1.269 ± 0.611
0.635TrpHis: 0.635 ± 0.306
1.269TrpIle: 1.269 ± 0.611
1.904TrpLys: 1.904 ± 0.917
0.635TrpLeu: 0.635 ± 0.306
1.269TrpMet: 1.269 ± 0.611
0.635TrpAsn: 0.635 ± 0.306
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.269TrpArg: 1.269 ± 0.611
3.807TrpSer: 3.807 ± 0.915
1.269TrpThr: 1.269 ± 0.611
0.635TrpVal: 0.635 ± 0.306
0.0TrpTrp: 0.0 ± 0.0
0.635TrpTyr: 0.635 ± 0.306
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.807TyrAla: 3.807 ± 0.46
1.269TyrCys: 1.269 ± 0.611
0.635TyrAsp: 0.635 ± 0.306
2.538TyrGlu: 2.538 ± 0.152
0.635TyrPhe: 0.635 ± 0.306
0.635TyrGly: 0.635 ± 0.306
0.635TyrHis: 0.635 ± 0.306
0.0TyrIle: 0.0 ± 0.0
2.538TyrLys: 2.538 ± 1.223
2.538TyrLeu: 2.538 ± 1.223
0.635TyrMet: 0.635 ± 1.069
2.538TyrAsn: 2.538 ± 0.152
1.269TyrPro: 1.269 ± 0.763
0.635TyrGln: 0.635 ± 1.069
3.173TyrArg: 3.173 ± 0.154
1.269TyrSer: 1.269 ± 0.611
1.269TyrThr: 1.269 ± 0.763
0.635TyrVal: 0.635 ± 0.306
0.0TyrTrp: 0.0 ± 0.0
0.635TyrTyr: 0.635 ± 0.306
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1577 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski