Amino acid dipepetide frequency for Nectarine stem pitting associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.345AlaAla: 7.345 ± 1.269
0.525AlaCys: 0.525 ± 0.46
0.0AlaAsp: 0.0 ± 0.0
5.247AlaGlu: 5.247 ± 2.188
7.345AlaPhe: 7.345 ± 1.132
2.623AlaGly: 2.623 ± 1.775
0.0AlaHis: 0.0 ± 0.0
4.197AlaIle: 4.197 ± 1.582
10.493AlaLys: 10.493 ± 3.22
4.722AlaLeu: 4.722 ± 1.361
4.722AlaMet: 4.722 ± 0.95
2.623AlaAsn: 2.623 ± 0.797
4.722AlaPro: 4.722 ± 2.0
2.099AlaGln: 2.099 ± 0.828
4.197AlaArg: 4.197 ± 1.226
4.722AlaSer: 4.722 ± 0.699
1.574AlaThr: 1.574 ± 0.526
3.148AlaVal: 3.148 ± 0.897
0.0AlaTrp: 0.0 ± 0.0
2.623AlaTyr: 2.623 ± 0.575
0.0AlaXaa: 0.0 ± 0.0
Cys
0.525CysAla: 0.525 ± 0.354
0.0CysCys: 0.0 ± 0.0
2.099CysAsp: 2.099 ± 0.681
1.049CysGlu: 1.049 ± 0.709
1.049CysPhe: 1.049 ± 0.486
0.525CysGly: 0.525 ± 0.354
0.525CysHis: 0.525 ± 0.46
1.574CysIle: 1.574 ± 0.652
1.049CysLys: 1.049 ± 0.345
0.525CysLeu: 0.525 ± 0.354
0.0CysMet: 0.0 ± 0.0
1.049CysAsn: 1.049 ± 0.709
1.574CysPro: 1.574 ± 0.371
1.049CysGln: 1.049 ± 0.345
2.623CysArg: 2.623 ± 1.094
2.099CysSer: 2.099 ± 0.689
1.574CysThr: 1.574 ± 1.068
1.049CysVal: 1.049 ± 0.345
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.296AspAla: 6.296 ± 0.951
1.574AspCys: 1.574 ± 0.732
6.821AspAsp: 6.821 ± 0.656
4.197AspGlu: 4.197 ± 0.753
3.673AspPhe: 3.673 ± 1.185
2.099AspGly: 2.099 ± 0.33
1.049AspHis: 1.049 ± 0.345
1.574AspIle: 1.574 ± 0.521
2.623AspLys: 2.623 ± 0.464
2.099AspLeu: 2.099 ± 0.589
2.099AspMet: 2.099 ± 0.971
2.099AspAsn: 2.099 ± 0.828
0.525AspPro: 0.525 ± 0.354
3.148AspGln: 3.148 ± 0.458
0.525AspArg: 0.525 ± 0.354
2.099AspSer: 2.099 ± 0.689
2.623AspThr: 2.623 ± 0.375
4.722AspVal: 4.722 ± 0.868
1.574AspTrp: 1.574 ± 0.371
2.099AspTyr: 2.099 ± 0.503
0.0AspXaa: 0.0 ± 0.0
Glu
7.345GluAla: 7.345 ± 2.448
3.148GluCys: 3.148 ± 0.742
5.247GluAsp: 5.247 ± 1.567
5.771GluGlu: 5.771 ± 2.382
0.525GluPhe: 0.525 ± 0.354
0.525GluGly: 0.525 ± 0.46
0.525GluHis: 0.525 ± 0.354
4.722GluIle: 4.722 ± 0.413
3.673GluLys: 3.673 ± 0.925
5.771GluLeu: 5.771 ± 1.558
0.0GluMet: 0.0 ± 0.0
2.099GluAsn: 2.099 ± 1.173
1.574GluPro: 1.574 ± 0.526
3.148GluGln: 3.148 ± 0.79
1.574GluArg: 1.574 ± 1.063
2.623GluSer: 2.623 ± 1.094
3.148GluThr: 3.148 ± 0.742
8.395GluVal: 8.395 ± 3.155
0.525GluTrp: 0.525 ± 0.46
3.673GluTyr: 3.673 ± 0.606
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
2.099PheCys: 2.099 ± 0.33
1.574PheAsp: 1.574 ± 0.652
4.197PheGlu: 4.197 ± 0.491
0.0PhePhe: 0.0 ± 0.0
3.148PheGly: 3.148 ± 0.984
2.623PheHis: 2.623 ± 1.206
6.296PheIle: 6.296 ± 1.678
2.099PheLys: 2.099 ± 0.589
4.722PheLeu: 4.722 ± 1.931
1.574PheMet: 1.574 ± 0.393
1.049PheAsn: 1.049 ± 0.709
2.623PhePro: 2.623 ± 0.797
2.623PheGln: 2.623 ± 1.094
1.574PheArg: 1.574 ± 0.526
4.722PheSer: 4.722 ± 0.363
3.148PheThr: 3.148 ± 1.505
3.673PheVal: 3.673 ± 0.839
2.099PheTrp: 2.099 ± 1.47
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.197GlyAla: 4.197 ± 2.228
1.049GlyCys: 1.049 ± 0.921
4.197GlyAsp: 4.197 ± 1.015
2.099GlyGlu: 2.099 ± 0.971
2.623GlyPhe: 2.623 ± 1.048
1.049GlyGly: 1.049 ± 0.345
1.574GlyHis: 1.574 ± 0.732
1.574GlyIle: 1.574 ± 0.521
4.722GlyLys: 4.722 ± 1.477
2.099GlyLeu: 2.099 ± 1.417
0.525GlyMet: 0.525 ± 0.407
2.623GlyAsn: 2.623 ± 0.643
3.148GlyPro: 3.148 ± 2.207
1.049GlyGln: 1.049 ± 0.736
1.574GlyArg: 1.574 ± 1.063
2.623GlySer: 2.623 ± 1.048
1.049GlyThr: 1.049 ± 0.486
5.247GlyVal: 5.247 ± 1.298
1.574GlyTrp: 1.574 ± 1.068
2.099GlyTyr: 2.099 ± 0.828
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.574HisCys: 1.574 ± 0.652
2.623HisAsp: 2.623 ± 1.206
0.0HisGlu: 0.0 ± 0.0
2.099HisPhe: 2.099 ± 0.681
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
0.525HisLeu: 0.525 ± 0.46
0.0HisMet: 0.0 ± 0.0
2.099HisAsn: 2.099 ± 0.931
1.574HisPro: 1.574 ± 0.371
1.049HisGln: 1.049 ± 0.486
2.623HisArg: 2.623 ± 1.248
3.673HisSer: 3.673 ± 1.922
1.574HisThr: 1.574 ± 1.068
1.574HisVal: 1.574 ± 0.652
1.574HisTrp: 1.574 ± 1.068
0.525HisTyr: 0.525 ± 0.46
0.0HisXaa: 0.0 ± 0.0
Ile
4.722IleAla: 4.722 ± 0.975
0.525IleCys: 0.525 ± 0.354
5.247IleAsp: 5.247 ± 0.75
2.623IleGlu: 2.623 ± 1.094
2.623IlePhe: 2.623 ± 1.094
0.525IleGly: 0.525 ± 0.354
0.0IleHis: 0.0 ± 0.0
3.673IleIle: 3.673 ± 0.192
3.148IleLys: 3.148 ± 1.457
1.049IleLeu: 1.049 ± 0.709
1.049IleMet: 1.049 ± 0.736
1.049IleAsn: 1.049 ± 0.709
2.099IlePro: 2.099 ± 0.33
1.574IleGln: 1.574 ± 0.371
5.247IleArg: 5.247 ± 1.593
6.821IleSer: 6.821 ± 0.656
1.574IleThr: 1.574 ± 1.068
4.197IleVal: 4.197 ± 1.099
1.049IleTrp: 1.049 ± 0.486
2.099IleTyr: 2.099 ± 0.931
0.0IleXaa: 0.0 ± 0.0
Lys
6.296LysAla: 6.296 ± 1.033
1.049LysCys: 1.049 ± 0.486
2.099LysAsp: 2.099 ± 0.33
3.673LysGlu: 3.673 ± 0.995
4.722LysPhe: 4.722 ± 1.161
2.623LysGly: 2.623 ± 0.643
2.623LysHis: 2.623 ± 1.094
5.771LysIle: 5.771 ± 1.823
9.444LysLys: 9.444 ± 1.137
4.722LysLeu: 4.722 ± 0.975
1.574LysMet: 1.574 ± 1.063
2.099LysAsn: 2.099 ± 0.33
2.623LysPro: 2.623 ± 1.048
3.673LysGln: 3.673 ± 1.488
4.197LysArg: 4.197 ± 1.861
5.247LysSer: 5.247 ± 2.957
5.771LysThr: 5.771 ± 0.836
5.247LysVal: 5.247 ± 0.862
3.148LysTrp: 3.148 ± 0.755
0.525LysTyr: 0.525 ± 0.354
0.0LysXaa: 0.0 ± 0.0
Leu
3.673LeuAla: 3.673 ± 1.568
0.525LeuCys: 0.525 ± 0.354
5.247LeuAsp: 5.247 ± 1.012
4.722LeuGlu: 4.722 ± 1.477
0.0LeuPhe: 0.0 ± 0.0
3.673LeuGly: 3.673 ± 1.342
1.574LeuHis: 1.574 ± 0.521
2.623LeuIle: 2.623 ± 0.731
7.87LeuLys: 7.87 ± 1.36
4.722LeuLeu: 4.722 ± 1.352
2.623LeuMet: 2.623 ± 1.094
4.197LeuAsn: 4.197 ± 1.118
2.623LeuPro: 2.623 ± 0.643
2.099LeuGln: 2.099 ± 0.681
3.673LeuArg: 3.673 ± 1.562
5.771LeuSer: 5.771 ± 1.362
3.148LeuThr: 3.148 ± 0.897
1.574LeuVal: 1.574 ± 1.063
2.099LeuTrp: 2.099 ± 0.589
2.623LeuTyr: 2.623 ± 0.375
0.0LeuXaa: 0.0 ± 0.0
Met
2.099MetAla: 2.099 ± 0.589
0.525MetCys: 0.525 ± 0.354
2.099MetAsp: 2.099 ± 0.971
2.099MetGlu: 2.099 ± 0.931
1.574MetPhe: 1.574 ± 0.652
1.049MetGly: 1.049 ± 0.709
0.525MetHis: 0.525 ± 0.354
1.574MetIle: 1.574 ± 0.652
2.099MetLys: 2.099 ± 0.807
0.0MetLeu: 0.0 ± 0.0
1.574MetMet: 1.574 ± 0.652
0.525MetAsn: 0.525 ± 0.354
0.0MetPro: 0.0 ± 0.0
0.525MetGln: 0.525 ± 0.354
3.148MetArg: 3.148 ± 0.79
5.247MetSer: 5.247 ± 0.75
0.0MetThr: 0.0 ± 0.0
3.673MetVal: 3.673 ± 0.925
0.0MetTrp: 0.0 ± 0.0
0.525MetTyr: 0.525 ± 0.46
0.0MetXaa: 0.0 ± 0.0
Asn
4.722AsnAla: 4.722 ± 1.161
0.0AsnCys: 0.0 ± 0.0
3.148AsnAsp: 3.148 ± 0.864
0.525AsnGlu: 0.525 ± 0.354
0.525AsnPhe: 0.525 ± 0.46
3.673AsnGly: 3.673 ± 1.278
0.0AsnHis: 0.0 ± 0.0
3.148AsnIle: 3.148 ± 1.305
1.049AsnLys: 1.049 ± 0.709
2.623AsnLeu: 2.623 ± 0.375
0.525AsnMet: 0.525 ± 0.354
0.525AsnAsn: 0.525 ± 0.354
2.623AsnPro: 2.623 ± 0.375
2.623AsnGln: 2.623 ± 0.575
3.673AsnArg: 3.673 ± 1.278
2.623AsnSer: 2.623 ± 0.816
2.099AsnThr: 2.099 ± 0.807
1.574AsnVal: 1.574 ± 0.526
1.574AsnTrp: 1.574 ± 0.526
3.148AsnTyr: 3.148 ± 2.081
0.0AsnXaa: 0.0 ± 0.0
Pro
4.722ProAla: 4.722 ± 1.323
1.049ProCys: 1.049 ± 0.921
1.049ProAsp: 1.049 ± 0.345
2.623ProGlu: 2.623 ± 0.464
1.574ProPhe: 1.574 ± 0.521
1.574ProGly: 1.574 ± 1.068
0.0ProHis: 0.0 ± 0.0
1.049ProIle: 1.049 ± 0.345
3.673ProLys: 3.673 ± 2.499
2.623ProLeu: 2.623 ± 1.206
1.574ProMet: 1.574 ± 0.521
1.049ProAsn: 1.049 ± 0.486
8.395ProPro: 8.395 ± 5.184
2.099ProGln: 2.099 ± 0.807
6.296ProArg: 6.296 ± 0.791
4.197ProSer: 4.197 ± 0.558
5.247ProThr: 5.247 ± 2.604
5.247ProVal: 5.247 ± 1.012
0.525ProTrp: 0.525 ± 0.46
1.574ProTyr: 1.574 ± 0.371
0.0ProXaa: 0.0 ± 0.0
Gln
2.623GlnAla: 2.623 ± 0.375
0.525GlnCys: 0.525 ± 0.354
0.0GlnAsp: 0.0 ± 0.0
3.673GlnGlu: 3.673 ± 0.839
3.673GlnPhe: 3.673 ± 0.969
4.197GlnGly: 4.197 ± 1.273
2.099GlnHis: 2.099 ± 0.931
0.0GlnIle: 0.0 ± 0.0
2.623GlnLys: 2.623 ± 0.731
4.722GlnLeu: 4.722 ± 1.957
1.049GlnMet: 1.049 ± 0.486
3.148GlnAsn: 3.148 ± 1.042
5.247GlnPro: 5.247 ± 3.55
1.049GlnGln: 1.049 ± 0.736
3.148GlnArg: 3.148 ± 1.241
1.574GlnSer: 1.574 ± 0.732
1.049GlnThr: 1.049 ± 0.736
3.673GlnVal: 3.673 ± 1.562
0.0GlnTrp: 0.0 ± 0.0
1.049GlnTyr: 1.049 ± 0.736
0.0GlnXaa: 0.0 ± 0.0
Arg
4.197ArgAla: 4.197 ± 1.735
2.099ArgCys: 2.099 ± 1.417
1.574ArgAsp: 1.574 ± 0.526
3.673ArgGlu: 3.673 ± 0.925
2.099ArgPhe: 2.099 ± 0.33
4.197ArgGly: 4.197 ± 0.338
2.623ArgHis: 2.623 ± 1.094
3.673ArgIle: 3.673 ± 0.969
2.099ArgLys: 2.099 ± 0.689
4.197ArgLeu: 4.197 ± 1.861
2.623ArgMet: 2.623 ± 1.248
4.197ArgAsn: 4.197 ± 0.491
1.049ArgPro: 1.049 ± 0.486
3.148ArgGln: 3.148 ± 1.241
3.673ArgArg: 3.673 ± 0.695
4.722ArgSer: 4.722 ± 2.0
1.574ArgThr: 1.574 ± 0.526
1.574ArgVal: 1.574 ± 0.652
1.049ArgTrp: 1.049 ± 0.736
2.623ArgTyr: 2.623 ± 0.375
0.0ArgXaa: 0.0 ± 0.0
Ser
2.623SerAla: 2.623 ± 0.575
1.574SerCys: 1.574 ± 0.521
4.197SerAsp: 4.197 ± 1.273
3.673SerGlu: 3.673 ± 1.562
6.821SerPhe: 6.821 ± 1.763
8.395SerGly: 8.395 ± 1.301
1.574SerHis: 1.574 ± 0.371
1.574SerIle: 1.574 ± 0.732
8.395SerLys: 8.395 ± 1.622
2.623SerLeu: 2.623 ± 1.248
2.099SerMet: 2.099 ± 0.487
2.623SerAsn: 2.623 ± 1.206
6.821SerPro: 6.821 ± 1.749
5.247SerGln: 5.247 ± 1.798
2.623SerArg: 2.623 ± 0.464
4.722SerSer: 4.722 ± 2.684
3.148SerThr: 3.148 ± 0.458
5.771SerVal: 5.771 ± 3.206
0.0SerTrp: 0.0 ± 0.0
1.574SerTyr: 1.574 ± 0.526
0.525SerXaa: 0.525 ± 0.46
Thr
3.673ThrAla: 3.673 ± 1.137
0.0ThrCys: 0.0 ± 0.0
0.525ThrAsp: 0.525 ± 0.46
2.099ThrGlu: 2.099 ± 0.589
3.673ThrPhe: 3.673 ± 0.775
2.099ThrGly: 2.099 ± 0.589
1.574ThrHis: 1.574 ± 1.068
3.148ThrIle: 3.148 ± 1.241
5.247ThrLys: 5.247 ± 1.593
4.197ThrLeu: 4.197 ± 0.558
2.099ThrMet: 2.099 ± 0.971
0.525ThrAsn: 0.525 ± 0.46
4.722ThrPro: 4.722 ± 1.005
3.673ThrGln: 3.673 ± 0.192
1.574ThrArg: 1.574 ± 0.526
3.148ThrSer: 3.148 ± 2.135
3.148ThrThr: 3.148 ± 0.458
1.574ThrVal: 1.574 ± 0.526
0.0ThrTrp: 0.0 ± 0.0
1.049ThrTyr: 1.049 ± 0.709
0.0ThrXaa: 0.0 ± 0.0
Val
6.296ValAla: 6.296 ± 0.964
0.525ValCys: 0.525 ± 0.354
4.197ValAsp: 4.197 ± 0.659
11.018ValGlu: 11.018 ± 2.776
4.197ValPhe: 4.197 ± 1.861
1.574ValGly: 1.574 ± 1.063
3.673ValHis: 3.673 ± 0.925
2.623ValIle: 2.623 ± 0.643
3.148ValLys: 3.148 ± 1.052
6.821ValLeu: 6.821 ± 1.039
2.099ValMet: 2.099 ± 0.589
3.148ValAsn: 3.148 ± 0.999
3.148ValPro: 3.148 ± 0.755
2.099ValGln: 2.099 ± 0.589
1.574ValArg: 1.574 ± 0.652
5.771ValSer: 5.771 ± 1.436
3.148ValThr: 3.148 ± 0.431
6.296ValVal: 6.296 ± 0.956
0.525ValTrp: 0.525 ± 0.46
3.673ValTyr: 3.673 ± 0.839
0.0ValXaa: 0.0 ± 0.0
Trp
0.525TrpAla: 0.525 ± 0.46
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.525TrpGlu: 0.525 ± 0.46
0.525TrpPhe: 0.525 ± 0.354
2.099TrpGly: 2.099 ± 0.681
0.0TrpHis: 0.0 ± 0.0
1.049TrpIle: 1.049 ± 0.486
0.525TrpLys: 0.525 ± 0.354
4.722TrpLeu: 4.722 ± 2.208
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.525TrpPro: 0.525 ± 0.46
1.574TrpGln: 1.574 ± 0.521
1.049TrpArg: 1.049 ± 0.345
1.049TrpSer: 1.049 ± 0.736
2.099TrpThr: 2.099 ± 0.681
1.574TrpVal: 1.574 ± 0.371
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.049TyrAla: 1.049 ± 0.486
1.574TyrCys: 1.574 ± 0.652
1.574TyrAsp: 1.574 ± 0.652
0.525TyrGlu: 0.525 ± 0.46
1.049TyrPhe: 1.049 ± 0.709
1.049TyrGly: 1.049 ± 0.709
0.525TyrHis: 0.525 ± 0.354
1.049TyrIle: 1.049 ± 0.486
3.148TyrLys: 3.148 ± 0.999
1.574TyrLeu: 1.574 ± 0.652
0.525TyrMet: 0.525 ± 0.46
3.673TyrAsn: 3.673 ± 1.137
0.525TyrPro: 0.525 ± 0.46
1.574TyrGln: 1.574 ± 0.521
1.574TyrArg: 1.574 ± 0.521
3.148TyrSer: 3.148 ± 0.864
1.049TyrThr: 1.049 ± 0.345
5.771TyrVal: 5.771 ± 1.937
0.525TyrTrp: 0.525 ± 0.354
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.525XaaArg: 0.525 ± 0.46
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1907 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski