Amino acid dipepetide frequency for Sanxia sobemo-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
1.813AlaAsp: 1.813 ± 0.597
5.44AlaGlu: 5.44 ± 1.742
2.72AlaPhe: 2.72 ± 0.626
3.626AlaGly: 3.626 ± 1.368
1.813AlaHis: 1.813 ± 0.597
2.72AlaIle: 2.72 ± 2.091
5.44AlaLys: 5.44 ± 1.79
10.879AlaLeu: 10.879 ± 3.99
0.907AlaMet: 0.907 ± 0.539
5.44AlaAsn: 5.44 ± 1.79
1.813AlaPro: 1.813 ± 1.192
5.44AlaGln: 5.44 ± 1.48
0.907AlaArg: 0.907 ± 1.139
5.44AlaSer: 5.44 ± 2.692
2.72AlaThr: 2.72 ± 1.789
5.44AlaVal: 5.44 ± 2.517
0.907AlaTrp: 0.907 ± 0.596
0.907AlaTyr: 0.907 ± 0.815
0.0AlaXaa: 0.0 ± 0.0
Cys
1.813CysAla: 1.813 ± 1.192
0.907CysCys: 0.907 ± 1.139
0.0CysAsp: 0.0 ± 0.0
0.907CysGlu: 0.907 ± 0.815
0.0CysPhe: 0.0 ± 0.0
0.907CysGly: 0.907 ± 0.815
0.0CysHis: 0.0 ± 0.0
1.813CysIle: 1.813 ± 1.135
0.907CysLys: 0.907 ± 1.139
1.813CysLeu: 1.813 ± 2.278
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.907CysGln: 0.907 ± 0.596
0.907CysArg: 0.907 ± 1.139
0.0CysSer: 0.0 ± 0.0
0.907CysThr: 0.907 ± 0.596
0.907CysVal: 0.907 ± 0.815
0.0CysTrp: 0.0 ± 0.0
0.907CysTyr: 0.907 ± 0.815
0.0CysXaa: 0.0 ± 0.0
Asp
2.72AspAla: 2.72 ± 2.445
0.0AspCys: 0.0 ± 0.0
5.44AspAsp: 5.44 ± 2.497
0.907AspGlu: 0.907 ± 0.815
1.813AspPhe: 1.813 ± 1.032
5.44AspGly: 5.44 ± 2.596
1.813AspHis: 1.813 ± 1.192
2.72AspIle: 2.72 ± 0.871
2.72AspLys: 2.72 ± 1.298
7.253AspLeu: 7.253 ± 2.386
2.72AspMet: 2.72 ± 1.789
3.626AspAsn: 3.626 ± 1.368
2.72AspPro: 2.72 ± 1.614
4.533AspGln: 4.533 ± 1.74
4.533AspArg: 4.533 ± 1.74
2.72AspSer: 2.72 ± 1.243
3.626AspThr: 3.626 ± 2.065
0.907AspVal: 0.907 ± 0.596
1.813AspTrp: 1.813 ± 0.597
1.813AspTyr: 1.813 ± 1.192
0.0AspXaa: 0.0 ± 0.0
Glu
2.72GluAla: 2.72 ± 0.871
0.907GluCys: 0.907 ± 1.139
3.626GluAsp: 3.626 ± 0.455
5.44GluGlu: 5.44 ± 1.742
1.813GluPhe: 1.813 ± 1.63
0.907GluGly: 0.907 ± 0.815
0.907GluHis: 0.907 ± 0.815
0.0GluIle: 0.0 ± 0.0
4.533GluLys: 4.533 ± 0.444
3.626GluLeu: 3.626 ± 1.193
1.813GluMet: 1.813 ± 0.597
0.907GluAsn: 0.907 ± 0.815
2.72GluPro: 2.72 ± 0.871
0.907GluGln: 0.907 ± 0.596
3.626GluArg: 3.626 ± 0.455
4.533GluSer: 4.533 ± 0.856
0.907GluThr: 0.907 ± 0.596
0.0GluVal: 0.0 ± 0.0
0.907GluTrp: 0.907 ± 0.596
2.72GluTyr: 2.72 ± 1.298
0.0GluXaa: 0.0 ± 0.0
Phe
2.72PheAla: 2.72 ± 2.445
0.0PheCys: 0.0 ± 0.0
4.533PheAsp: 4.533 ± 4.074
1.813PheGlu: 1.813 ± 0.597
0.907PhePhe: 0.907 ± 0.596
0.907PheGly: 0.907 ± 1.139
1.813PheHis: 1.813 ± 0.597
2.72PheIle: 2.72 ± 1.298
1.813PheLys: 1.813 ± 1.63
5.44PheLeu: 5.44 ± 2.486
0.907PheMet: 0.907 ± 0.815
0.0PheAsn: 0.0 ± 0.0
1.813PhePro: 1.813 ± 0.597
3.626PheGln: 3.626 ± 2.065
3.626PheArg: 3.626 ± 1.023
3.626PheSer: 3.626 ± 1.739
3.626PheThr: 3.626 ± 1.654
2.72PheVal: 2.72 ± 1.298
0.0PheTrp: 0.0 ± 0.0
1.813PheTyr: 1.813 ± 1.135
0.0PheXaa: 0.0 ± 0.0
Gly
2.72GlyAla: 2.72 ± 1.243
0.907GlyCys: 0.907 ± 0.815
0.907GlyAsp: 0.907 ± 0.815
2.72GlyGlu: 2.72 ± 0.871
2.72GlyPhe: 2.72 ± 0.871
0.907GlyGly: 0.907 ± 0.815
1.813GlyHis: 1.813 ± 1.63
2.72GlyIle: 2.72 ± 1.298
0.907GlyLys: 0.907 ± 0.596
2.72GlyLeu: 2.72 ± 0.871
1.813GlyMet: 1.813 ± 1.192
2.72GlyAsn: 2.72 ± 0.871
1.813GlyPro: 1.813 ± 1.032
1.813GlyGln: 1.813 ± 1.192
1.813GlyArg: 1.813 ± 1.192
5.44GlySer: 5.44 ± 2.497
1.813GlyThr: 1.813 ± 0.597
5.44GlyVal: 5.44 ± 1.742
0.0GlyTrp: 0.0 ± 0.0
1.813GlyTyr: 1.813 ± 0.597
0.0GlyXaa: 0.0 ± 0.0
His
1.813HisAla: 1.813 ± 1.192
1.813HisCys: 1.813 ± 1.63
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.907HisPhe: 0.907 ± 0.815
0.907HisGly: 0.907 ± 0.596
0.0HisHis: 0.0 ± 0.0
0.907HisIle: 0.907 ± 0.596
0.907HisLys: 0.907 ± 0.815
3.626HisLeu: 3.626 ± 2.083
0.0HisMet: 0.0 ± 0.0
0.907HisAsn: 0.907 ± 0.815
0.907HisPro: 0.907 ± 0.596
1.813HisGln: 1.813 ± 1.135
0.907HisArg: 0.907 ± 0.815
1.813HisSer: 1.813 ± 1.192
0.907HisThr: 0.907 ± 0.815
1.813HisVal: 1.813 ± 1.032
0.0HisTrp: 0.0 ± 0.0
1.813HisTyr: 1.813 ± 0.597
0.0HisXaa: 0.0 ± 0.0
Ile
5.44IleAla: 5.44 ± 2.076
1.813IleCys: 1.813 ± 1.135
4.533IleAsp: 4.533 ± 1.369
3.626IleGlu: 3.626 ± 1.023
3.626IlePhe: 3.626 ± 1.193
2.72IleGly: 2.72 ± 0.871
0.907IleHis: 0.907 ± 0.815
6.346IleIle: 6.346 ± 1.487
3.626IleLys: 3.626 ± 1.193
8.16IleLeu: 8.16 ± 4.609
0.0IleMet: 0.0 ± 0.0
0.907IleAsn: 0.907 ± 0.815
1.813IlePro: 1.813 ± 1.032
0.907IleGln: 0.907 ± 1.139
2.72IleArg: 2.72 ± 0.626
4.533IleSer: 4.533 ± 2.872
2.72IleThr: 2.72 ± 1.789
0.907IleVal: 0.907 ± 0.596
0.907IleTrp: 0.907 ± 1.139
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
6.346LysAla: 6.346 ± 0.796
0.907LysCys: 0.907 ± 0.596
1.813LysAsp: 1.813 ± 1.192
1.813LysGlu: 1.813 ± 1.135
1.813LysPhe: 1.813 ± 1.63
2.72LysGly: 2.72 ± 0.871
3.626LysHis: 3.626 ± 2.083
2.72LysIle: 2.72 ± 0.871
4.533LysLys: 4.533 ± 1.923
7.253LysLeu: 7.253 ± 0.907
0.907LysMet: 0.907 ± 0.596
0.907LysAsn: 0.907 ± 0.815
2.72LysPro: 2.72 ± 2.445
3.626LysGln: 3.626 ± 1.739
5.44LysArg: 5.44 ± 0.245
2.72LysSer: 2.72 ± 2.445
3.626LysThr: 3.626 ± 1.193
2.72LysVal: 2.72 ± 1.243
0.907LysTrp: 0.907 ± 0.815
1.813LysTyr: 1.813 ± 0.597
0.0LysXaa: 0.0 ± 0.0
Leu
10.879LeuAla: 10.879 ± 2.414
0.907LeuCys: 0.907 ± 1.139
8.16LeuAsp: 8.16 ± 3.284
3.626LeuGlu: 3.626 ± 0.455
5.44LeuPhe: 5.44 ± 3.228
2.72LeuGly: 2.72 ± 1.243
0.0LeuHis: 0.0 ± 0.0
7.253LeuIle: 7.253 ± 3.829
4.533LeuLys: 4.533 ± 1.74
18.132LeuLeu: 18.132 ± 7.895
5.44LeuMet: 5.44 ± 4.181
6.346LeuAsn: 6.346 ± 1.487
5.44LeuPro: 5.44 ± 1.79
3.626LeuGln: 3.626 ± 1.023
3.626LeuArg: 3.626 ± 2.291
13.599LeuSer: 13.599 ± 2.569
8.16LeuThr: 8.16 ± 2.607
6.346LeuVal: 6.346 ± 3.313
0.0LeuTrp: 0.0 ± 0.0
1.813LeuTyr: 1.813 ± 0.597
0.0LeuXaa: 0.0 ± 0.0
Met
0.907MetAla: 0.907 ± 0.815
0.0MetCys: 0.0 ± 0.0
3.626MetAsp: 3.626 ± 1.368
0.907MetGlu: 0.907 ± 0.815
0.907MetPhe: 0.907 ± 0.596
0.0MetGly: 0.0 ± 0.0
1.813MetHis: 1.813 ± 0.597
1.813MetIle: 1.813 ± 2.278
2.72MetLys: 2.72 ± 0.871
5.44MetLeu: 5.44 ± 0.245
0.907MetMet: 0.907 ± 0.596
0.907MetAsn: 0.907 ± 1.139
0.0MetPro: 0.0 ± 0.0
0.907MetGln: 0.907 ± 0.596
0.907MetArg: 0.907 ± 0.596
2.72MetSer: 2.72 ± 1.243
3.626MetThr: 3.626 ± 3.213
0.907MetVal: 0.907 ± 0.596
0.0MetTrp: 0.0 ± 0.0
1.813MetTyr: 1.813 ± 0.597
0.0MetXaa: 0.0 ± 0.0
Asn
2.72AsnAla: 2.72 ± 0.871
0.0AsnCys: 0.0 ± 0.0
2.72AsnAsp: 2.72 ± 1.298
1.813AsnGlu: 1.813 ± 1.192
2.72AsnPhe: 2.72 ± 0.871
1.813AsnGly: 1.813 ± 1.192
0.0AsnHis: 0.0 ± 0.0
0.907AsnIle: 0.907 ± 0.596
1.813AsnLys: 1.813 ± 1.192
1.813AsnLeu: 1.813 ± 1.135
1.813AsnMet: 1.813 ± 1.135
0.907AsnAsn: 0.907 ± 0.815
1.813AsnPro: 1.813 ± 1.63
0.0AsnGln: 0.0 ± 0.0
3.626AsnArg: 3.626 ± 1.368
2.72AsnSer: 2.72 ± 2.445
3.626AsnThr: 3.626 ± 2.269
4.533AsnVal: 4.533 ± 0.444
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.72ProAla: 2.72 ± 1.789
0.907ProCys: 0.907 ± 0.596
3.626ProAsp: 3.626 ± 2.065
0.907ProGlu: 0.907 ± 0.815
1.813ProPhe: 1.813 ± 1.032
2.72ProGly: 2.72 ± 0.871
1.813ProHis: 1.813 ± 0.597
2.72ProIle: 2.72 ± 1.614
5.44ProLys: 5.44 ± 2.596
2.72ProLeu: 2.72 ± 0.626
0.0ProMet: 0.0 ± 0.0
0.907ProAsn: 0.907 ± 0.596
4.533ProPro: 4.533 ± 2.206
2.72ProGln: 2.72 ± 0.871
2.72ProArg: 2.72 ± 1.243
1.813ProSer: 1.813 ± 1.192
5.44ProThr: 5.44 ± 1.252
2.72ProVal: 2.72 ± 2.123
0.907ProTrp: 0.907 ± 0.815
1.813ProTyr: 1.813 ± 1.192
0.0ProXaa: 0.0 ± 0.0
Gln
2.72GlnAla: 2.72 ± 1.243
0.0GlnCys: 0.0 ± 0.0
4.533GlnAsp: 4.533 ± 1.733
2.72GlnGlu: 2.72 ± 1.789
1.813GlnPhe: 1.813 ± 1.192
1.813GlnGly: 1.813 ± 0.597
0.0GlnHis: 0.0 ± 0.0
2.72GlnIle: 2.72 ± 2.091
5.44GlnLys: 5.44 ± 1.742
1.813GlnLeu: 1.813 ± 1.63
0.907GlnMet: 0.907 ± 1.139
2.72GlnAsn: 2.72 ± 1.243
2.72GlnPro: 2.72 ± 0.871
1.813GlnGln: 1.813 ± 1.032
0.0GlnArg: 0.0 ± 0.0
4.533GlnSer: 4.533 ± 2.206
1.813GlnThr: 1.813 ± 1.135
5.44GlnVal: 5.44 ± 1.231
0.907GlnTrp: 0.907 ± 0.815
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.72ArgAla: 2.72 ± 0.626
0.0ArgCys: 0.0 ± 0.0
0.907ArgAsp: 0.907 ± 0.815
1.813ArgGlu: 1.813 ± 1.135
3.626ArgPhe: 3.626 ± 3.259
2.72ArgGly: 2.72 ± 1.789
1.813ArgHis: 1.813 ± 1.032
2.72ArgIle: 2.72 ± 0.871
3.626ArgLys: 3.626 ± 2.065
7.253ArgLeu: 7.253 ± 3.829
1.813ArgMet: 1.813 ± 1.63
0.907ArgAsn: 0.907 ± 0.815
2.72ArgPro: 2.72 ± 2.091
3.626ArgGln: 3.626 ± 1.193
2.72ArgArg: 2.72 ± 1.243
6.346ArgSer: 6.346 ± 0.796
4.533ArgThr: 4.533 ± 2.206
3.626ArgVal: 3.626 ± 2.385
0.0ArgTrp: 0.0 ± 0.0
4.533ArgTyr: 4.533 ± 1.923
0.0ArgXaa: 0.0 ± 0.0
Ser
4.533SerAla: 4.533 ± 2.206
0.907SerCys: 0.907 ± 0.596
5.44SerAsp: 5.44 ± 1.79
3.626SerGlu: 3.626 ± 0.455
1.813SerPhe: 1.813 ± 1.032
6.346SerGly: 6.346 ± 3.079
0.907SerHis: 0.907 ± 1.139
4.533SerIle: 4.533 ± 1.848
3.626SerLys: 3.626 ± 0.455
6.346SerLeu: 6.346 ± 1.487
0.907SerMet: 0.907 ± 0.596
1.813SerAsn: 1.813 ± 0.597
6.346SerPro: 6.346 ± 2.864
1.813SerGln: 1.813 ± 1.192
7.253SerArg: 7.253 ± 4.944
11.786SerSer: 11.786 ± 2.885
8.16SerThr: 8.16 ± 4.255
8.16SerVal: 8.16 ± 1.235
0.0SerTrp: 0.0 ± 0.0
1.813SerTyr: 1.813 ± 0.597
0.0SerXaa: 0.0 ± 0.0
Thr
6.346ThrAla: 6.346 ± 1.98
1.813ThrCys: 1.813 ± 1.135
4.533ThrAsp: 4.533 ± 1.5
0.907ThrGlu: 0.907 ± 0.815
6.346ThrPhe: 6.346 ± 2.498
4.533ThrGly: 4.533 ± 1.369
0.0ThrHis: 0.0 ± 0.0
2.72ThrIle: 2.72 ± 0.626
0.907ThrLys: 0.907 ± 1.139
6.346ThrLeu: 6.346 ± 0.796
4.533ThrMet: 4.533 ± 1.197
1.813ThrAsn: 1.813 ± 0.597
3.626ThrPro: 3.626 ± 2.385
3.626ThrGln: 3.626 ± 1.654
5.44ThrArg: 5.44 ± 2.497
6.346ThrSer: 6.346 ± 0.846
5.44ThrThr: 5.44 ± 3.097
3.626ThrVal: 3.626 ± 1.654
1.813ThrTrp: 1.813 ± 1.135
1.813ThrTyr: 1.813 ± 1.63
0.0ThrXaa: 0.0 ± 0.0
Val
4.533ValAla: 4.533 ± 1.369
1.813ValCys: 1.813 ± 1.032
2.72ValAsp: 2.72 ± 2.445
0.907ValGlu: 0.907 ± 0.596
2.72ValPhe: 2.72 ± 1.614
0.907ValGly: 0.907 ± 0.596
0.0ValHis: 0.0 ± 0.0
3.626ValIle: 3.626 ± 0.455
3.626ValLys: 3.626 ± 1.023
9.066ValLeu: 9.066 ± 0.889
1.813ValMet: 1.813 ± 0.597
2.72ValAsn: 2.72 ± 1.243
4.533ValPro: 4.533 ± 1.733
2.72ValGln: 2.72 ± 1.298
4.533ValArg: 4.533 ± 1.923
4.533ValSer: 4.533 ± 0.856
2.72ValThr: 2.72 ± 1.789
1.813ValVal: 1.813 ± 0.597
2.72ValTrp: 2.72 ± 1.243
1.813ValTyr: 1.813 ± 0.597
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.907TrpPhe: 0.907 ± 0.815
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.813TrpIle: 1.813 ± 2.278
0.0TrpLys: 0.0 ± 0.0
0.907TrpLeu: 0.907 ± 1.139
1.813TrpMet: 1.813 ± 0.597
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.907TrpSer: 0.907 ± 0.596
5.44TrpThr: 5.44 ± 1.79
0.0TrpVal: 0.0 ± 0.0
0.907TrpTrp: 0.907 ± 0.815
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.907TyrAla: 0.907 ± 0.596
0.0TyrCys: 0.0 ± 0.0
0.907TyrAsp: 0.907 ± 0.596
3.626TyrGlu: 3.626 ± 1.193
0.0TyrPhe: 0.0 ± 0.0
0.907TyrGly: 0.907 ± 0.815
2.72TyrHis: 2.72 ± 0.871
2.72TyrIle: 2.72 ± 0.871
1.813TyrLys: 1.813 ± 1.63
5.44TyrLeu: 5.44 ± 3.693
0.907TyrMet: 0.907 ± 0.596
0.907TyrAsn: 0.907 ± 0.815
0.907TyrPro: 0.907 ± 1.139
0.0TyrGln: 0.0 ± 0.0
2.72TyrArg: 2.72 ± 0.871
0.0TyrSer: 0.0 ± 0.0
2.72TyrThr: 2.72 ± 0.871
1.813TyrVal: 1.813 ± 1.192
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1104 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski