Amino acid dipepetide frequency for Melon necrotic spot virus (MNSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.113AlaAla: 4.113 ± 0.748
1.763AlaCys: 1.763 ± 1.046
2.938AlaAsp: 2.938 ± 1.422
3.525AlaGlu: 3.525 ± 0.893
2.938AlaPhe: 2.938 ± 1.322
2.35AlaGly: 2.35 ± 1.499
1.763AlaHis: 1.763 ± 0.941
9.988AlaIle: 9.988 ± 1.162
5.288AlaLys: 5.288 ± 1.428
5.288AlaLeu: 5.288 ± 1.937
2.35AlaMet: 2.35 ± 0.827
2.35AlaAsn: 2.35 ± 1.121
2.35AlaPro: 2.35 ± 0.86
0.588AlaGln: 0.588 ± 0.575
2.938AlaArg: 2.938 ± 1.907
3.525AlaSer: 3.525 ± 1.209
3.525AlaThr: 3.525 ± 0.952
6.463AlaVal: 6.463 ± 1.256
2.938AlaTrp: 2.938 ± 1.205
1.763AlaTyr: 1.763 ± 0.941
0.0AlaXaa: 0.0 ± 0.0
Cys
1.175CysAla: 1.175 ± 0.763
1.175CysCys: 1.175 ± 1.154
2.35CysAsp: 2.35 ± 1.062
1.175CysGlu: 1.175 ± 0.763
0.0CysPhe: 0.0 ± 0.0
1.763CysGly: 1.763 ± 0.703
1.763CysHis: 1.763 ± 0.703
0.0CysIle: 0.0 ± 0.0
1.175CysLys: 1.175 ± 0.552
2.938CysLeu: 2.938 ± 1.322
1.175CysMet: 1.175 ± 0.867
0.0CysAsn: 0.0 ± 0.0
1.175CysPro: 1.175 ± 0.552
0.588CysGln: 0.588 ± 0.381
2.938CysArg: 2.938 ± 1.133
1.763CysSer: 1.763 ± 0.476
0.588CysThr: 0.588 ± 0.381
0.588CysVal: 0.588 ± 0.381
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.463AspAla: 6.463 ± 2.446
1.763AspCys: 1.763 ± 0.703
0.588AspAsp: 0.588 ± 0.381
1.763AspGlu: 1.763 ± 0.703
1.763AspPhe: 1.763 ± 0.575
4.113AspGly: 4.113 ± 1.33
0.0AspHis: 0.0 ± 0.0
3.525AspIle: 3.525 ± 1.655
1.763AspLys: 1.763 ± 1.072
1.763AspLeu: 1.763 ± 0.941
4.113AspMet: 4.113 ± 1.253
1.763AspAsn: 1.763 ± 0.476
1.763AspPro: 1.763 ± 0.941
0.588AspGln: 0.588 ± 0.381
5.288AspArg: 5.288 ± 2.07
10.576AspSer: 10.576 ± 3.259
1.763AspThr: 1.763 ± 0.476
2.35AspVal: 2.35 ± 1.499
1.175AspTrp: 1.175 ± 0.552
1.763AspTyr: 1.763 ± 1.046
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
1.175GluCys: 1.175 ± 0.763
1.763GluAsp: 1.763 ± 0.703
5.288GluGlu: 5.288 ± 2.28
2.35GluPhe: 2.35 ± 0.86
2.35GluGly: 2.35 ± 0.86
1.175GluHis: 1.175 ± 0.763
3.525GluIle: 3.525 ± 1.407
1.763GluLys: 1.763 ± 1.144
4.7GluLeu: 4.7 ± 2.039
0.0GluMet: 0.0 ± 0.0
2.35GluAsn: 2.35 ± 0.355
0.588GluPro: 0.588 ± 0.381
3.525GluGln: 3.525 ± 1.407
1.763GluArg: 1.763 ± 1.072
2.938GluSer: 2.938 ± 1.11
0.588GluThr: 0.588 ± 0.575
3.525GluVal: 3.525 ± 0.822
0.588GluTrp: 0.588 ± 0.381
0.588GluTyr: 0.588 ± 0.575
0.0GluXaa: 0.0 ± 0.0
Phe
1.175PheAla: 1.175 ± 0.43
1.763PheCys: 1.763 ± 1.144
3.525PheAsp: 3.525 ± 0.969
1.175PheGlu: 1.175 ± 0.763
2.35PhePhe: 2.35 ± 1.07
2.938PheGly: 2.938 ± 0.562
0.0PheHis: 0.0 ± 0.0
2.938PheIle: 2.938 ± 1.181
1.763PheLys: 1.763 ± 0.476
2.35PheLeu: 2.35 ± 0.355
0.588PheMet: 0.588 ± 0.819
1.763PheAsn: 1.763 ± 1.072
0.588PhePro: 0.588 ± 0.381
0.588PheGln: 0.588 ± 0.381
1.763PheArg: 1.763 ± 1.144
1.175PheSer: 1.175 ± 0.43
2.35PheThr: 2.35 ± 2.3
5.288PheVal: 5.288 ± 1.095
0.0PheTrp: 0.0 ± 0.0
2.35PheTyr: 2.35 ± 1.07
0.588PheXaa: 0.588 ± 0.654
Gly
8.226GlyAla: 8.226 ± 2.257
2.35GlyCys: 2.35 ± 0.988
5.875GlyAsp: 5.875 ± 2.785
2.938GlyGlu: 2.938 ± 1.205
2.938GlyPhe: 2.938 ± 1.907
6.463GlyGly: 6.463 ± 1.006
0.0GlyHis: 0.0 ± 0.0
9.401GlyIle: 9.401 ± 3.094
2.35GlyLys: 2.35 ± 1.121
5.875GlyLeu: 5.875 ± 2.368
2.35GlyMet: 2.35 ± 1.296
5.288GlyAsn: 5.288 ± 1.805
1.763GlyPro: 1.763 ± 0.575
1.763GlyGln: 1.763 ± 0.941
4.7GlyArg: 4.7 ± 2.242
3.525GlySer: 3.525 ± 1.99
4.113GlyThr: 4.113 ± 0.712
5.288GlyVal: 5.288 ± 1.225
1.175GlyTrp: 1.175 ± 0.552
2.35GlyTyr: 2.35 ± 0.988
0.0GlyXaa: 0.0 ± 0.0
His
0.588HisAla: 0.588 ± 0.381
0.0HisCys: 0.0 ± 0.0
2.35HisAsp: 2.35 ± 0.988
0.0HisGlu: 0.0 ± 0.0
2.35HisPhe: 2.35 ± 1.216
1.763HisGly: 1.763 ± 0.476
1.175HisHis: 1.175 ± 1.154
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.175HisLeu: 1.175 ± 0.552
0.0HisMet: 0.0 ± 0.0
1.763HisAsn: 1.763 ± 0.703
1.763HisPro: 1.763 ± 0.703
1.175HisGln: 1.175 ± 0.552
1.763HisArg: 1.763 ± 0.476
2.35HisSer: 2.35 ± 0.875
1.763HisThr: 1.763 ± 0.703
0.588HisVal: 0.588 ± 0.381
0.588HisTrp: 0.588 ± 0.381
0.588HisTyr: 0.588 ± 0.381
0.0HisXaa: 0.0 ± 0.0
Ile
7.638IleAla: 7.638 ± 2.027
0.588IleCys: 0.588 ± 0.381
4.113IleAsp: 4.113 ± 0.748
1.175IleGlu: 1.175 ± 0.763
0.588IlePhe: 0.588 ± 0.381
8.813IleGly: 8.813 ± 1.935
2.35IleHis: 2.35 ± 1.104
1.763IleIle: 1.763 ± 0.703
4.7IleLys: 4.7 ± 0.877
4.113IleLeu: 4.113 ± 1.443
1.175IleMet: 1.175 ± 0.552
3.525IleAsn: 3.525 ± 1.209
1.763IlePro: 1.763 ± 0.941
1.175IleGln: 1.175 ± 0.763
2.35IleArg: 2.35 ± 0.988
9.988IleSer: 9.988 ± 2.469
6.463IleThr: 6.463 ± 0.992
5.288IleVal: 5.288 ± 1.144
0.588IleTrp: 0.588 ± 0.381
0.588IleTyr: 0.588 ± 0.381
0.0IleXaa: 0.0 ± 0.0
Lys
5.875LysAla: 5.875 ± 1.139
0.0LysCys: 0.0 ± 0.0
1.175LysAsp: 1.175 ± 0.43
2.35LysGlu: 2.35 ± 1.081
2.938LysPhe: 2.938 ± 1.205
1.763LysGly: 1.763 ± 0.575
0.0LysHis: 0.0 ± 0.0
6.463LysIle: 6.463 ± 1.999
2.35LysLys: 2.35 ± 0.988
5.288LysLeu: 5.288 ± 2.07
0.0LysMet: 0.0 ± 0.0
2.35LysAsn: 2.35 ± 1.208
2.35LysPro: 2.35 ± 0.86
4.113LysGln: 4.113 ± 2.347
2.938LysArg: 2.938 ± 0.646
2.35LysSer: 2.35 ± 0.988
3.525LysThr: 3.525 ± 1.175
5.288LysVal: 5.288 ± 1.796
1.763LysTrp: 1.763 ± 1.144
1.763LysTyr: 1.763 ± 0.575
0.0LysXaa: 0.0 ± 0.0
Leu
5.875LeuAla: 5.875 ± 1.244
1.763LeuCys: 1.763 ± 0.703
4.7LeuAsp: 4.7 ± 1.895
2.35LeuGlu: 2.35 ± 1.525
2.35LeuPhe: 2.35 ± 1.626
2.938LeuGly: 2.938 ± 0.562
0.588LeuHis: 0.588 ± 0.381
2.938LeuIle: 2.938 ± 1.133
4.113LeuLys: 4.113 ± 1.672
4.7LeuLeu: 4.7 ± 1.464
1.175LeuMet: 1.175 ± 0.722
3.525LeuAsn: 3.525 ± 1.171
5.875LeuPro: 5.875 ± 1.768
1.175LeuGln: 1.175 ± 0.43
2.35LeuArg: 2.35 ± 0.875
8.226LeuSer: 8.226 ± 1.206
8.813LeuThr: 8.813 ± 1.714
10.576LeuVal: 10.576 ± 2.316
0.588LeuTrp: 0.588 ± 0.575
2.35LeuTyr: 2.35 ± 0.86
0.0LeuXaa: 0.0 ± 0.0
Met
2.938MetAla: 2.938 ± 1.181
0.0MetCys: 0.0 ± 0.0
2.35MetAsp: 2.35 ± 1.208
0.588MetGlu: 0.588 ± 0.381
0.588MetPhe: 0.588 ± 0.575
2.938MetGly: 2.938 ± 2.235
0.0MetHis: 0.0 ± 0.0
1.175MetIle: 1.175 ± 0.763
1.175MetLys: 1.175 ± 0.43
0.0MetLeu: 0.0 ± 0.0
0.588MetMet: 0.588 ± 0.381
1.763MetAsn: 1.763 ± 0.575
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.588MetArg: 0.588 ± 0.381
1.763MetSer: 1.763 ± 0.703
1.175MetThr: 1.175 ± 0.43
2.35MetVal: 2.35 ± 0.86
0.0MetTrp: 0.0 ± 0.0
1.763MetTyr: 1.763 ± 0.476
0.0MetXaa: 0.0 ± 0.0
Asn
2.35AsnAla: 2.35 ± 1.499
1.763AsnCys: 1.763 ± 0.703
2.35AsnAsp: 2.35 ± 1.121
0.588AsnGlu: 0.588 ± 0.381
2.35AsnPhe: 2.35 ± 2.384
3.525AsnGly: 3.525 ± 0.969
1.763AsnHis: 1.763 ± 0.703
2.938AsnIle: 2.938 ± 2.874
1.175AsnLys: 1.175 ± 0.43
4.113AsnLeu: 4.113 ± 0.748
1.175AsnMet: 1.175 ± 0.564
2.938AsnAsn: 2.938 ± 1.345
4.7AsnPro: 4.7 ± 0.879
0.0AsnGln: 0.0 ± 0.0
1.763AsnArg: 1.763 ± 0.703
7.051AsnSer: 7.051 ± 1.412
3.525AsnThr: 3.525 ± 0.969
4.7AsnVal: 4.7 ± 1.199
0.0AsnTrp: 0.0 ± 0.0
0.588AsnTyr: 0.588 ± 0.575
0.588AsnXaa: 0.588 ± 0.381
Pro
1.175ProAla: 1.175 ± 0.763
0.588ProCys: 0.588 ± 0.575
5.288ProAsp: 5.288 ± 2.11
1.763ProGlu: 1.763 ± 0.476
1.175ProPhe: 1.175 ± 0.552
4.113ProGly: 4.113 ± 1.093
1.175ProHis: 1.175 ± 0.552
2.35ProIle: 2.35 ± 0.86
1.175ProLys: 1.175 ± 0.43
5.875ProLeu: 5.875 ± 1.292
0.0ProMet: 0.0 ± 0.0
0.588ProAsn: 0.588 ± 0.575
0.0ProPro: 0.0 ± 0.0
2.938ProGln: 2.938 ± 0.942
3.525ProArg: 3.525 ± 1.002
2.35ProSer: 2.35 ± 0.9
1.763ProThr: 1.763 ± 0.941
2.35ProVal: 2.35 ± 0.875
1.175ProTrp: 1.175 ± 0.43
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.175GlnAla: 1.175 ± 1.15
1.763GlnCys: 1.763 ± 0.703
1.175GlnAsp: 1.175 ± 0.43
0.0GlnGlu: 0.0 ± 0.0
1.763GlnPhe: 1.763 ± 0.575
4.113GlnGly: 4.113 ± 2.055
1.763GlnHis: 1.763 ± 0.703
1.175GlnIle: 1.175 ± 0.763
1.175GlnLys: 1.175 ± 1.192
1.175GlnLeu: 1.175 ± 0.43
0.588GlnMet: 0.588 ± 0.381
0.0GlnAsn: 0.0 ± 0.0
1.763GlnPro: 1.763 ± 0.575
0.0GlnGln: 0.0 ± 0.0
2.35GlnArg: 2.35 ± 1.081
1.175GlnSer: 1.175 ± 0.763
1.175GlnThr: 1.175 ± 0.552
1.175GlnVal: 1.175 ± 0.552
0.0GlnTrp: 0.0 ± 0.0
1.175GlnTyr: 1.175 ± 1.154
0.0GlnXaa: 0.0 ± 0.0
Arg
5.288ArgAla: 5.288 ± 1.454
1.763ArgCys: 1.763 ± 1.287
2.938ArgAsp: 2.938 ± 0.94
1.175ArgGlu: 1.175 ± 0.43
2.938ArgPhe: 2.938 ± 0.94
4.7ArgGly: 4.7 ± 2.163
1.763ArgHis: 1.763 ± 0.703
2.938ArgIle: 2.938 ± 0.646
5.875ArgLys: 5.875 ± 0.781
5.875ArgLeu: 5.875 ± 0.837
1.175ArgMet: 1.175 ± 0.43
1.763ArgAsn: 1.763 ± 1.144
2.35ArgPro: 2.35 ± 0.988
0.588ArgGln: 0.588 ± 0.381
8.226ArgArg: 8.226 ± 2.807
3.525ArgSer: 3.525 ± 0.965
2.938ArgThr: 2.938 ± 1.11
3.525ArgVal: 3.525 ± 1.882
0.0ArgTrp: 0.0 ± 0.0
1.763ArgTyr: 1.763 ± 0.575
0.0ArgXaa: 0.0 ± 0.0
Ser
7.051SerAla: 7.051 ± 1.904
0.0SerCys: 0.0 ± 0.0
4.113SerAsp: 4.113 ± 0.712
4.113SerGlu: 4.113 ± 1.135
3.525SerPhe: 3.525 ± 0.969
7.638SerGly: 7.638 ± 1.211
1.175SerHis: 1.175 ± 0.552
5.288SerIle: 5.288 ± 1.454
8.813SerLys: 8.813 ± 2.378
7.638SerLeu: 7.638 ± 0.627
1.175SerMet: 1.175 ± 1.192
1.763SerAsn: 1.763 ± 0.476
5.288SerPro: 5.288 ± 1.944
1.763SerGln: 1.763 ± 1.348
2.35SerArg: 2.35 ± 0.355
16.451SerSer: 16.451 ± 3.446
5.288SerThr: 5.288 ± 3.547
7.638SerVal: 7.638 ± 1.081
1.763SerTrp: 1.763 ± 0.703
2.938SerTyr: 2.938 ± 0.94
0.0SerXaa: 0.0 ± 0.0
Thr
1.763ThrAla: 1.763 ± 0.941
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
2.35ThrGlu: 2.35 ± 0.355
1.763ThrPhe: 1.763 ± 0.941
5.288ThrGly: 5.288 ± 2.07
2.938ThrHis: 2.938 ± 1.322
5.875ThrIle: 5.875 ± 0.991
1.763ThrLys: 1.763 ± 0.703
3.525ThrLeu: 3.525 ± 0.822
0.0ThrMet: 0.0 ± 0.0
5.288ThrAsn: 5.288 ± 1.989
1.763ThrPro: 1.763 ± 0.703
1.763ThrGln: 1.763 ± 1.046
3.525ThrArg: 3.525 ± 1.171
6.463ThrSer: 6.463 ± 0.992
1.763ThrThr: 1.763 ± 0.941
3.525ThrVal: 3.525 ± 1.858
1.763ThrTrp: 1.763 ± 0.941
5.288ThrTyr: 5.288 ± 2.414
0.0ThrXaa: 0.0 ± 0.0
Val
3.525ValAla: 3.525 ± 1.171
1.763ValCys: 1.763 ± 0.476
2.938ValAsp: 2.938 ± 0.646
5.875ValGlu: 5.875 ± 0.781
1.763ValPhe: 1.763 ± 1.046
8.813ValGly: 8.813 ± 0.983
2.35ValHis: 2.35 ± 1.062
2.938ValIle: 2.938 ± 0.562
6.463ValLys: 6.463 ± 0.828
2.938ValLeu: 2.938 ± 1.433
2.938ValMet: 2.938 ± 1.11
8.813ValAsn: 8.813 ± 1.939
3.525ValPro: 3.525 ± 1.171
1.763ValGln: 1.763 ± 0.575
5.875ValArg: 5.875 ± 1.124
8.226ValSer: 8.226 ± 1.909
1.763ValThr: 1.763 ± 1.348
6.463ValVal: 6.463 ± 1.15
0.0ValTrp: 0.0 ± 0.0
2.35ValTyr: 2.35 ± 0.988
0.0ValXaa: 0.0 ± 0.0
Trp
0.588TrpAla: 0.588 ± 0.575
1.175TrpCys: 1.175 ± 0.552
1.175TrpAsp: 1.175 ± 0.43
0.588TrpGlu: 0.588 ± 0.381
0.588TrpPhe: 0.588 ± 0.381
0.588TrpGly: 0.588 ± 0.381
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.175TrpLys: 1.175 ± 0.552
1.763TrpLeu: 1.763 ± 0.476
0.0TrpMet: 0.0 ± 0.0
1.175TrpAsn: 1.175 ± 0.763
0.0TrpPro: 0.0 ± 0.0
0.588TrpGln: 0.588 ± 0.381
2.35TrpArg: 2.35 ± 0.355
0.0TrpSer: 0.0 ± 0.0
1.763TrpThr: 1.763 ± 0.703
0.588TrpVal: 0.588 ± 0.381
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.763TyrAla: 1.763 ± 0.941
0.588TyrCys: 0.588 ± 0.381
2.938TyrAsp: 2.938 ± 0.562
1.763TyrGlu: 1.763 ± 0.703
0.0TyrPhe: 0.0 ± 0.0
1.763TyrGly: 1.763 ± 0.476
0.0TyrHis: 0.0 ± 0.0
3.525TyrIle: 3.525 ± 2.337
0.588TyrLys: 0.588 ± 0.381
5.875TyrLeu: 5.875 ± 1.68
0.588TyrMet: 0.588 ± 0.575
1.175TyrAsn: 1.175 ± 0.763
0.0TyrPro: 0.0 ± 0.0
0.0TyrGln: 0.0 ± 0.0
1.763TyrArg: 1.763 ± 0.941
2.35TyrSer: 2.35 ± 1.062
1.763TyrThr: 1.763 ± 0.575
3.525TyrVal: 3.525 ± 2.176
0.0TyrTrp: 0.0 ± 0.0
0.588TyrTyr: 0.588 ± 0.381
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.588XaaCys: 0.588 ± 0.654
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.588XaaGly: 0.588 ± 0.381
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1703 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski