Amino acid dipepetide frequency for Sanxia picorna-like virus 11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.623AlaAla: 9.623 ± 1.25
0.418AlaCys: 0.418 ± 0.233
2.929AlaAsp: 2.929 ± 2.043
3.347AlaGlu: 3.347 ± 0.339
4.184AlaPhe: 4.184 ± 1.343
5.439AlaGly: 5.439 ± 0.092
1.674AlaHis: 1.674 ± 0.933
2.929AlaIle: 2.929 ± 0.163
2.092AlaLys: 2.092 ± 0.304
6.276AlaLeu: 6.276 ± 3.5
4.184AlaMet: 4.184 ± 0.863
3.347AlaAsn: 3.347 ± 0.339
5.439AlaPro: 5.439 ± 1.378
0.418AlaGln: 0.418 ± 0.233
1.674AlaArg: 1.674 ± 0.933
3.347AlaSer: 3.347 ± 1.809
5.439AlaThr: 5.439 ± 0.643
8.787AlaVal: 8.787 ± 1.224
0.0AlaTrp: 0.0 ± 0.0
2.929AlaTyr: 2.929 ± 0.572
0.0AlaXaa: 0.0 ± 0.0
Cys
1.674CysAla: 1.674 ± 0.198
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.837CysGlu: 0.837 ± 0.467
1.255CysPhe: 1.255 ± 1.506
0.418CysGly: 0.418 ± 0.233
0.0CysHis: 0.0 ± 0.0
0.418CysIle: 0.418 ± 0.233
0.837CysLys: 0.837 ± 0.467
0.418CysLeu: 0.418 ± 0.233
0.0CysMet: 0.0 ± 0.0
0.418CysAsn: 0.418 ± 0.233
0.418CysPro: 0.418 ± 0.502
0.0CysGln: 0.0 ± 0.0
0.418CysArg: 0.418 ± 0.233
0.837CysSer: 0.837 ± 0.269
1.674CysThr: 1.674 ± 0.933
1.674CysVal: 1.674 ± 1.272
0.0CysTrp: 0.0 ± 0.0
0.837CysTyr: 0.837 ± 0.269
0.0CysXaa: 0.0 ± 0.0
Asp
3.347AspAla: 3.347 ± 0.396
0.418AspCys: 0.418 ± 0.233
4.603AspAsp: 4.603 ± 1.831
3.347AspGlu: 3.347 ± 0.396
3.766AspPhe: 3.766 ± 0.106
4.184AspGly: 4.184 ± 0.863
0.418AspHis: 0.418 ± 0.233
2.092AspIle: 2.092 ± 1.167
2.929AspLys: 2.929 ± 1.633
5.439AspLeu: 5.439 ± 0.092
0.837AspMet: 0.837 ± 0.467
2.929AspAsn: 2.929 ± 1.308
1.255AspPro: 1.255 ± 0.77
1.674AspGln: 1.674 ± 0.198
2.092AspArg: 2.092 ± 0.431
3.347AspSer: 3.347 ± 1.074
2.51AspThr: 2.51 ± 1.4
5.439AspVal: 5.439 ± 1.563
0.837AspTrp: 0.837 ± 1.004
0.837AspTyr: 0.837 ± 0.467
0.0AspXaa: 0.0 ± 0.0
Glu
5.858GluAla: 5.858 ± 1.796
1.255GluCys: 1.255 ± 0.7
2.929GluAsp: 2.929 ± 0.898
2.092GluGlu: 2.092 ± 0.304
2.929GluPhe: 2.929 ± 0.898
3.347GluGly: 3.347 ± 1.867
1.674GluHis: 1.674 ± 0.933
3.347GluIle: 3.347 ± 0.396
2.092GluLys: 2.092 ± 0.431
4.603GluLeu: 4.603 ± 1.096
1.255GluMet: 1.255 ± 0.7
1.674GluAsn: 1.674 ± 0.198
1.255GluPro: 1.255 ± 0.035
2.092GluGln: 2.092 ± 0.431
0.837GluArg: 0.837 ± 0.467
4.603GluSer: 4.603 ± 1.831
2.092GluThr: 2.092 ± 0.304
4.603GluVal: 4.603 ± 0.361
1.674GluTrp: 1.674 ± 0.933
1.674GluTyr: 1.674 ± 0.537
0.0GluXaa: 0.0 ± 0.0
Phe
3.347PheAla: 3.347 ± 0.339
0.0PheCys: 0.0 ± 0.0
5.858PheAsp: 5.858 ± 0.326
2.929PheGlu: 2.929 ± 0.163
0.418PhePhe: 0.418 ± 0.233
3.766PheGly: 3.766 ± 0.106
1.255PheHis: 1.255 ± 1.506
2.092PheIle: 2.092 ± 0.304
2.092PheLys: 2.092 ± 0.304
4.184PheLeu: 4.184 ± 1.598
2.929PheMet: 2.929 ± 0.147
2.929PheAsn: 2.929 ± 0.572
2.092PhePro: 2.092 ± 0.304
2.092PheGln: 2.092 ± 0.431
2.51PheArg: 2.51 ± 1.541
2.51PheSer: 2.51 ± 2.276
1.255PheThr: 1.255 ± 0.035
5.858PheVal: 5.858 ± 1.145
0.837PheTrp: 0.837 ± 0.269
0.837PheTyr: 0.837 ± 0.269
0.0PheXaa: 0.0 ± 0.0
Gly
4.603GlyAla: 4.603 ± 0.374
1.674GlyCys: 1.674 ± 0.537
3.347GlyAsp: 3.347 ± 0.396
3.347GlyGlu: 3.347 ± 0.339
2.929GlyPhe: 2.929 ± 0.572
5.021GlyGly: 5.021 ± 1.611
1.255GlyHis: 1.255 ± 0.7
4.603GlyIle: 4.603 ± 0.361
4.603GlyLys: 4.603 ± 1.096
7.113GlyLeu: 7.113 ± 0.291
0.837GlyMet: 0.837 ± 0.467
3.766GlyAsn: 3.766 ± 1.576
1.255GlyPro: 1.255 ± 0.77
2.092GlyGln: 2.092 ± 0.304
2.51GlyArg: 2.51 ± 0.07
7.113GlySer: 7.113 ± 1.18
3.766GlyThr: 3.766 ± 0.106
8.368GlyVal: 8.368 ± 0.255
0.837GlyTrp: 0.837 ± 0.467
3.766GlyTyr: 3.766 ± 0.63
0.0GlyXaa: 0.0 ± 0.0
His
0.418HisAla: 0.418 ± 0.233
0.418HisCys: 0.418 ± 0.233
1.255HisAsp: 1.255 ± 0.77
1.255HisGlu: 1.255 ± 0.7
0.837HisPhe: 0.837 ± 0.467
1.674HisGly: 1.674 ± 0.198
0.837HisHis: 0.837 ± 0.269
1.255HisIle: 1.255 ± 0.7
0.418HisLys: 0.418 ± 0.233
2.092HisLeu: 2.092 ± 0.431
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.674HisPro: 1.674 ± 0.198
0.837HisGln: 0.837 ± 0.269
0.418HisArg: 0.418 ± 0.233
2.092HisSer: 2.092 ± 1.167
0.837HisThr: 0.837 ± 0.467
1.255HisVal: 1.255 ± 0.035
1.255HisTrp: 1.255 ± 0.7
0.418HisTyr: 0.418 ± 0.233
0.0HisXaa: 0.0 ± 0.0
Ile
7.531IleAla: 7.531 ± 1.259
0.0IleCys: 0.0 ± 0.0
1.255IleAsp: 1.255 ± 0.7
2.929IleGlu: 2.929 ± 1.633
2.51IlePhe: 2.51 ± 0.07
5.439IleGly: 5.439 ± 2.113
0.418IleHis: 0.418 ± 0.502
1.255IleIle: 1.255 ± 0.035
1.674IleLys: 1.674 ± 0.198
2.929IleLeu: 2.929 ± 1.633
1.674IleMet: 1.674 ± 0.198
2.51IleAsn: 2.51 ± 1.4
2.51IlePro: 2.51 ± 1.541
0.837IleGln: 0.837 ± 0.467
2.51IleArg: 2.51 ± 1.4
5.439IleSer: 5.439 ± 0.828
3.347IleThr: 3.347 ± 1.809
2.929IleVal: 2.929 ± 0.572
0.0IleTrp: 0.0 ± 0.0
0.418IleTyr: 0.418 ± 0.233
0.0IleXaa: 0.0 ± 0.0
Lys
2.092LysAla: 2.092 ± 0.431
0.418LysCys: 0.418 ± 0.233
2.929LysAsp: 2.929 ± 0.163
1.674LysGlu: 1.674 ± 0.933
2.929LysPhe: 2.929 ± 0.572
3.347LysGly: 3.347 ± 0.396
1.255LysHis: 1.255 ± 0.7
2.092LysIle: 2.092 ± 0.304
1.255LysLys: 1.255 ± 0.7
4.603LysLeu: 4.603 ± 1.831
1.674LysMet: 1.674 ± 0.198
2.51LysAsn: 2.51 ± 1.4
3.766LysPro: 3.766 ± 0.106
1.255LysGln: 1.255 ± 0.035
2.929LysArg: 2.929 ± 1.633
3.766LysSer: 3.766 ± 1.365
4.603LysThr: 4.603 ± 0.374
4.184LysVal: 4.184 ± 2.333
0.837LysTrp: 0.837 ± 0.269
2.929LysTyr: 2.929 ± 0.572
0.0LysXaa: 0.0 ± 0.0
Leu
5.439LeuAla: 5.439 ± 0.828
2.092LeuCys: 2.092 ± 0.304
3.766LeuAsp: 3.766 ± 1.365
6.695LeuGlu: 6.695 ± 1.528
2.929LeuPhe: 2.929 ± 0.572
4.184LeuGly: 4.184 ± 0.128
3.347LeuHis: 3.347 ± 0.396
3.347LeuIle: 3.347 ± 1.131
5.858LeuLys: 5.858 ± 0.326
5.858LeuLeu: 5.858 ± 0.326
1.674LeuMet: 1.674 ± 0.198
2.929LeuAsn: 2.929 ± 0.572
2.929LeuPro: 2.929 ± 0.163
5.021LeuGln: 5.021 ± 2.065
5.439LeuArg: 5.439 ± 0.092
7.113LeuSer: 7.113 ± 1.18
4.184LeuThr: 4.184 ± 0.863
7.113LeuVal: 7.113 ± 1.026
0.837LeuTrp: 0.837 ± 0.467
2.092LeuTyr: 2.092 ± 0.304
0.0LeuXaa: 0.0 ± 0.0
Met
1.674MetAla: 1.674 ± 0.198
0.418MetCys: 0.418 ± 0.502
2.51MetAsp: 2.51 ± 0.665
0.0MetGlu: 0.0 ± 0.0
0.837MetPhe: 0.837 ± 0.467
0.837MetGly: 0.837 ± 0.269
0.0MetHis: 0.0 ± 0.0
2.092MetIle: 2.092 ± 0.304
2.092MetLys: 2.092 ± 1.167
2.092MetLeu: 2.092 ± 0.431
1.255MetMet: 1.255 ± 0.77
0.0MetAsn: 0.0 ± 0.0
2.092MetPro: 2.092 ± 0.304
2.929MetGln: 2.929 ± 1.633
2.092MetArg: 2.092 ± 1.039
2.092MetSer: 2.092 ± 0.304
1.255MetThr: 1.255 ± 0.035
2.929MetVal: 2.929 ± 0.163
1.255MetTrp: 1.255 ± 0.035
0.837MetTyr: 0.837 ± 0.467
0.0MetXaa: 0.0 ± 0.0
Asn
3.347AsnAla: 3.347 ± 1.809
0.418AsnCys: 0.418 ± 0.502
2.092AsnAsp: 2.092 ± 0.431
1.255AsnGlu: 1.255 ± 0.7
1.674AsnPhe: 1.674 ± 0.198
4.184AsnGly: 4.184 ± 0.608
1.255AsnHis: 1.255 ± 0.7
2.929AsnIle: 2.929 ± 0.163
1.255AsnLys: 1.255 ± 0.035
2.929AsnLeu: 2.929 ± 1.308
0.837AsnMet: 0.837 ± 0.467
3.766AsnAsn: 3.766 ± 1.576
0.837AsnPro: 0.837 ± 0.269
2.092AsnGln: 2.092 ± 0.304
4.184AsnArg: 4.184 ± 0.863
3.347AsnSer: 3.347 ± 0.339
4.184AsnThr: 4.184 ± 2.813
4.184AsnVal: 4.184 ± 2.078
1.674AsnTrp: 1.674 ± 0.537
0.837AsnTyr: 0.837 ± 0.269
0.0AsnXaa: 0.0 ± 0.0
Pro
0.837ProAla: 0.837 ± 0.269
0.418ProCys: 0.418 ± 0.502
2.092ProAsp: 2.092 ± 0.304
3.347ProGlu: 3.347 ± 0.339
2.092ProPhe: 2.092 ± 1.039
4.184ProGly: 4.184 ± 0.128
0.837ProHis: 0.837 ± 0.269
1.674ProIle: 1.674 ± 0.198
1.674ProLys: 1.674 ± 0.537
4.603ProLeu: 4.603 ± 0.374
1.674ProMet: 1.674 ± 0.198
2.092ProAsn: 2.092 ± 1.774
2.929ProPro: 2.929 ± 0.572
0.418ProGln: 0.418 ± 0.233
1.255ProArg: 1.255 ± 0.035
5.021ProSer: 5.021 ± 3.082
2.929ProThr: 2.929 ± 1.308
3.347ProVal: 3.347 ± 2.545
1.255ProTrp: 1.255 ± 0.035
2.51ProTyr: 2.51 ± 1.541
0.0ProXaa: 0.0 ± 0.0
Gln
2.51GlnAla: 2.51 ± 0.806
0.418GlnCys: 0.418 ± 0.233
1.255GlnAsp: 1.255 ± 0.7
2.092GlnGlu: 2.092 ± 1.167
2.929GlnPhe: 2.929 ± 1.308
2.092GlnGly: 2.092 ± 0.431
1.255GlnHis: 1.255 ± 0.7
1.255GlnIle: 1.255 ± 0.7
2.51GlnLys: 2.51 ± 1.4
1.255GlnLeu: 1.255 ± 0.77
0.837GlnMet: 0.837 ± 0.269
0.837GlnAsn: 0.837 ± 0.269
2.092GlnPro: 2.092 ± 0.431
0.0GlnGln: 0.0 ± 0.0
2.929GlnArg: 2.929 ± 1.633
2.092GlnSer: 2.092 ± 0.431
2.092GlnThr: 2.092 ± 0.304
3.766GlnVal: 3.766 ± 0.63
0.837GlnTrp: 0.837 ± 0.467
1.255GlnTyr: 1.255 ± 0.77
0.0GlnXaa: 0.0 ± 0.0
Arg
2.929ArgAla: 2.929 ± 0.163
0.837ArgCys: 0.837 ± 0.467
3.347ArgAsp: 3.347 ± 1.131
4.184ArgGlu: 4.184 ± 1.598
3.766ArgPhe: 3.766 ± 0.63
4.184ArgGly: 4.184 ± 0.608
0.837ArgHis: 0.837 ± 0.467
2.092ArgIle: 2.092 ± 1.039
2.929ArgLys: 2.929 ± 1.633
4.603ArgLeu: 4.603 ± 1.096
1.255ArgMet: 1.255 ± 0.035
2.092ArgAsn: 2.092 ± 0.431
0.837ArgPro: 0.837 ± 0.467
1.255ArgGln: 1.255 ± 0.7
2.51ArgArg: 2.51 ± 0.665
2.929ArgSer: 2.929 ± 0.572
1.255ArgThr: 1.255 ± 0.035
3.347ArgVal: 3.347 ± 0.339
0.837ArgTrp: 0.837 ± 0.269
1.255ArgTyr: 1.255 ± 0.7
0.0ArgXaa: 0.0 ± 0.0
Ser
3.347SerAla: 3.347 ± 0.396
0.837SerCys: 0.837 ± 0.269
3.347SerAsp: 3.347 ± 1.131
3.766SerGlu: 3.766 ± 1.365
5.021SerPhe: 5.021 ± 1.611
5.021SerGly: 5.021 ± 0.876
0.0SerHis: 0.0 ± 0.0
2.092SerIle: 2.092 ± 0.431
4.603SerLys: 4.603 ± 1.109
10.879SerLeu: 10.879 ± 2.756
1.674SerMet: 1.674 ± 0.198
4.184SerAsn: 4.184 ± 1.343
2.929SerPro: 2.929 ± 3.513
3.347SerGln: 3.347 ± 1.809
5.021SerArg: 5.021 ± 0.141
4.603SerSer: 4.603 ± 1.109
5.021SerThr: 5.021 ± 1.611
7.95SerVal: 7.95 ± 0.713
1.674SerTrp: 1.674 ± 0.198
4.184SerTyr: 4.184 ± 0.608
0.0SerXaa: 0.0 ± 0.0
Thr
5.021ThrAla: 5.021 ± 1.611
0.0ThrCys: 0.0 ± 0.0
2.51ThrAsp: 2.51 ± 0.07
3.766ThrGlu: 3.766 ± 2.1
2.092ThrPhe: 2.092 ± 0.304
5.439ThrGly: 5.439 ± 0.828
0.0ThrHis: 0.0 ± 0.0
2.092ThrIle: 2.092 ± 0.431
3.766ThrLys: 3.766 ± 0.841
5.439ThrLeu: 5.439 ± 0.643
1.674ThrMet: 1.674 ± 1.272
3.766ThrAsn: 3.766 ± 3.046
3.347ThrPro: 3.347 ± 1.074
1.674ThrGln: 1.674 ± 0.198
2.092ThrArg: 2.092 ± 1.167
6.276ThrSer: 6.276 ± 1.646
2.929ThrThr: 2.929 ± 0.572
6.276ThrVal: 6.276 ± 0.176
0.0ThrTrp: 0.0 ± 0.0
2.092ThrTyr: 2.092 ± 1.039
0.0ThrXaa: 0.0 ± 0.0
Val
8.368ValAla: 8.368 ± 1.215
1.674ValCys: 1.674 ± 0.537
3.766ValAsp: 3.766 ± 0.106
4.184ValGlu: 4.184 ± 0.863
2.51ValPhe: 2.51 ± 0.07
6.695ValGly: 6.695 ± 0.057
2.092ValHis: 2.092 ± 1.167
4.184ValIle: 4.184 ± 0.608
5.858ValLys: 5.858 ± 2.531
5.021ValLeu: 5.021 ± 0.141
1.674ValMet: 1.674 ± 0.933
5.439ValAsn: 5.439 ± 0.828
6.695ValPro: 6.695 ± 3.619
3.347ValGln: 3.347 ± 0.339
3.766ValArg: 3.766 ± 0.63
8.368ValSer: 8.368 ± 1.95
7.95ValThr: 7.95 ± 0.713
5.858ValVal: 5.858 ± 1.88
2.092ValTrp: 2.092 ± 1.167
1.674ValTyr: 1.674 ± 0.198
0.0ValXaa: 0.0 ± 0.0
Trp
0.837TrpAla: 0.837 ± 0.467
0.418TrpCys: 0.418 ± 0.233
1.674TrpAsp: 1.674 ± 0.198
0.0TrpGlu: 0.0 ± 0.0
2.092TrpPhe: 2.092 ± 0.304
0.418TrpGly: 0.418 ± 0.502
0.0TrpHis: 0.0 ± 0.0
1.674TrpIle: 1.674 ± 0.933
0.837TrpLys: 0.837 ± 0.467
0.837TrpLeu: 0.837 ± 0.467
1.255TrpMet: 1.255 ± 0.035
1.255TrpAsn: 1.255 ± 0.035
0.0TrpPro: 0.0 ± 0.0
1.255TrpGln: 1.255 ± 0.7
0.837TrpArg: 0.837 ± 0.269
1.255TrpSer: 1.255 ± 0.035
1.255TrpThr: 1.255 ± 0.035
0.837TrpVal: 0.837 ± 0.467
0.418TrpTrp: 0.418 ± 0.502
1.255TrpTyr: 1.255 ± 0.035
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.674TyrAla: 1.674 ± 0.537
0.0TyrCys: 0.0 ± 0.0
0.837TyrAsp: 0.837 ± 0.467
0.837TyrGlu: 0.837 ± 0.467
2.092TyrPhe: 2.092 ± 0.431
2.929TyrGly: 2.929 ± 1.308
0.837TyrHis: 0.837 ± 0.467
4.603TyrIle: 4.603 ± 1.109
1.674TyrLys: 1.674 ± 0.933
1.674TyrLeu: 1.674 ± 0.933
1.674TyrMet: 1.674 ± 0.799
0.837TyrAsn: 0.837 ± 1.004
0.837TyrPro: 0.837 ± 0.269
1.674TyrGln: 1.674 ± 0.198
1.255TyrArg: 1.255 ± 0.035
3.347TyrSer: 3.347 ± 2.545
1.674TyrThr: 1.674 ± 0.537
2.51TyrVal: 2.51 ± 0.07
1.255TyrTrp: 1.255 ± 0.7
0.418TyrTyr: 0.418 ± 0.233
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2391 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski