Amino acid dipepetide frequency for Tomato yellow leaf curl Kanchanaburi virus-[Thailand Kan1]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.032AlaAla: 3.032 ± 1.413
1.819AlaCys: 1.819 ± 0.756
1.819AlaAsp: 1.819 ± 0.846
6.064AlaGlu: 6.064 ± 1.586
1.213AlaPhe: 1.213 ± 0.89
0.0AlaGly: 0.0 ± 0.0
0.606AlaHis: 0.606 ± 0.514
1.819AlaIle: 1.819 ± 0.876
4.851AlaLys: 4.851 ± 1.244
5.458AlaLeu: 5.458 ± 1.015
0.0AlaMet: 0.0 ± 0.0
1.819AlaAsn: 1.819 ± 1.19
3.032AlaPro: 3.032 ± 1.355
2.426AlaGln: 2.426 ± 1.359
1.819AlaArg: 1.819 ± 1.541
4.851AlaSer: 4.851 ± 2.357
3.032AlaThr: 3.032 ± 1.404
2.426AlaVal: 2.426 ± 0.587
1.213AlaTrp: 1.213 ± 0.655
1.819AlaTyr: 1.819 ± 1.043
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.213CysAsp: 1.213 ± 0.645
0.606CysGlu: 0.606 ± 0.64
0.606CysPhe: 0.606 ± 0.605
1.819CysGly: 1.819 ± 1.001
1.213CysHis: 1.213 ± 0.822
0.606CysIle: 0.606 ± 0.758
1.213CysLys: 1.213 ± 0.655
1.213CysLeu: 1.213 ± 0.839
1.819CysMet: 1.819 ± 1.029
2.426CysAsn: 2.426 ± 1.035
1.213CysPro: 1.213 ± 1.516
0.606CysGln: 0.606 ± 0.553
1.213CysArg: 1.213 ± 0.651
3.639CysSer: 3.639 ± 1.18
1.213CysThr: 1.213 ± 0.655
1.213CysVal: 1.213 ± 1.28
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.426AspAla: 2.426 ± 1.533
0.606AspCys: 0.606 ± 0.516
1.819AspAsp: 1.819 ± 1.043
1.213AspGlu: 1.213 ± 0.655
1.213AspPhe: 1.213 ± 0.839
2.426AspGly: 2.426 ± 1.489
1.213AspHis: 1.213 ± 0.688
0.606AspIle: 0.606 ± 0.64
0.606AspLys: 0.606 ± 0.553
5.458AspLeu: 5.458 ± 1.62
0.606AspMet: 0.606 ± 0.553
2.426AspAsn: 2.426 ± 1.483
3.639AspPro: 3.639 ± 1.06
1.819AspGln: 1.819 ± 0.933
3.032AspArg: 3.032 ± 1.008
3.639AspSer: 3.639 ± 1.571
4.245AspThr: 4.245 ± 0.768
4.245AspVal: 4.245 ± 1.175
0.606AspTrp: 0.606 ± 0.514
2.426AspTyr: 2.426 ± 1.195
0.0AspXaa: 0.0 ± 0.0
Glu
4.851GluAla: 4.851 ± 1.46
0.0GluCys: 0.0 ± 0.0
1.213GluAsp: 1.213 ± 0.647
1.819GluGlu: 1.819 ± 0.701
4.851GluPhe: 4.851 ± 1.942
3.639GluGly: 3.639 ± 1.021
0.606GluHis: 0.606 ± 0.553
3.032GluIle: 3.032 ± 1.129
3.032GluLys: 3.032 ± 1.063
6.064GluLeu: 6.064 ± 2.532
0.606GluMet: 0.606 ± 0.502
3.639GluAsn: 3.639 ± 2.112
1.819GluPro: 1.819 ± 0.677
1.819GluGln: 1.819 ± 1.404
0.0GluArg: 0.0 ± 0.0
3.032GluSer: 3.032 ± 1.429
1.213GluThr: 1.213 ± 0.771
3.639GluVal: 3.639 ± 1.249
1.213GluTrp: 1.213 ± 0.609
0.606GluTyr: 0.606 ± 0.502
0.0GluXaa: 0.0 ± 0.0
Phe
0.606PheAla: 0.606 ± 0.514
1.213PheCys: 1.213 ± 0.687
0.606PheAsp: 0.606 ± 0.514
1.213PheGlu: 1.213 ± 0.655
1.213PhePhe: 1.213 ± 0.655
3.639PheGly: 3.639 ± 1.929
0.606PheHis: 0.606 ± 0.514
0.606PheIle: 0.606 ± 0.758
3.032PheLys: 3.032 ± 1.178
4.245PheLeu: 4.245 ± 1.948
1.213PheMet: 1.213 ± 0.645
6.064PheAsn: 6.064 ± 1.879
3.032PhePro: 3.032 ± 1.105
3.032PheGln: 3.032 ± 0.966
3.639PheArg: 3.639 ± 0.948
3.639PheSer: 3.639 ± 2.13
2.426PheThr: 2.426 ± 1.199
1.819PheVal: 1.819 ± 1.026
1.819PheTrp: 1.819 ± 0.756
1.213PheTyr: 1.213 ± 1.28
0.0PheXaa: 0.0 ± 0.0
Gly
2.426GlyAla: 2.426 ± 1.091
1.819GlyCys: 1.819 ± 0.795
2.426GlyAsp: 2.426 ± 1.051
3.032GlyGlu: 3.032 ± 1.136
3.032GlyPhe: 3.032 ± 1.289
1.819GlyGly: 1.819 ± 0.987
1.819GlyHis: 1.819 ± 1.043
3.639GlyIle: 3.639 ± 1.198
4.851GlyLys: 4.851 ± 1.782
0.606GlyLeu: 0.606 ± 0.553
2.426GlyMet: 2.426 ± 1.016
3.032GlyAsn: 3.032 ± 0.993
3.032GlyPro: 3.032 ± 0.803
1.213GlyGln: 1.213 ± 0.655
1.213GlyArg: 1.213 ± 0.645
6.064GlySer: 6.064 ± 1.717
1.213GlyThr: 1.213 ± 1.106
1.819GlyVal: 1.819 ± 0.914
0.0GlyTrp: 0.0 ± 0.0
0.606GlyTyr: 0.606 ± 0.553
0.0GlyXaa: 0.0 ± 0.0
His
0.606HisAla: 0.606 ± 0.64
0.606HisCys: 0.606 ± 0.758
1.819HisAsp: 1.819 ± 0.756
1.819HisGlu: 1.819 ± 0.876
1.819HisPhe: 1.819 ± 0.834
1.819HisGly: 1.819 ± 1.229
0.0HisHis: 0.0 ± 0.0
1.819HisIle: 1.819 ± 0.933
1.819HisLys: 1.819 ± 1.232
2.426HisLeu: 2.426 ± 1.533
0.606HisMet: 0.606 ± 0.553
2.426HisAsn: 2.426 ± 1.541
1.213HisPro: 1.213 ± 0.855
1.213HisGln: 1.213 ± 0.687
3.032HisArg: 3.032 ± 1.567
0.606HisSer: 0.606 ± 0.516
3.032HisThr: 3.032 ± 1.932
6.064HisVal: 6.064 ± 2.692
0.0HisTrp: 0.0 ± 0.0
1.819HisTyr: 1.819 ± 0.705
0.0HisXaa: 0.0 ± 0.0
Ile
1.213IleAla: 1.213 ± 0.697
1.213IleCys: 1.213 ± 0.855
2.426IleAsp: 2.426 ± 0.934
3.032IleGlu: 3.032 ± 1.235
2.426IlePhe: 2.426 ± 1.076
3.032IleGly: 3.032 ± 0.949
3.032IleHis: 3.032 ± 1.578
1.819IleIle: 1.819 ± 0.769
5.458IleLys: 5.458 ± 1.301
1.819IleLeu: 1.819 ± 1.156
1.213IleMet: 1.213 ± 0.657
3.032IleAsn: 3.032 ± 1.147
3.032IlePro: 3.032 ± 1.046
5.458IleGln: 5.458 ± 2.309
1.819IleArg: 1.819 ± 0.788
1.819IleSer: 1.819 ± 1.072
3.639IleThr: 3.639 ± 1.922
2.426IleVal: 2.426 ± 0.922
2.426IleTrp: 2.426 ± 1.456
1.213IleTyr: 1.213 ± 0.792
0.0IleXaa: 0.0 ± 0.0
Lys
1.819LysAla: 1.819 ± 0.834
1.213LysCys: 1.213 ± 0.748
3.032LysAsp: 3.032 ± 1.063
4.851LysGlu: 4.851 ± 2.871
4.245LysPhe: 4.245 ± 1.419
1.213LysGly: 1.213 ± 0.647
1.819LysHis: 1.819 ± 0.746
4.851LysIle: 4.851 ± 1.494
4.245LysLys: 4.245 ± 1.08
2.426LysLeu: 2.426 ± 1.116
0.606LysMet: 0.606 ± 0.502
7.277LysAsn: 7.277 ± 2.597
3.032LysPro: 3.032 ± 0.889
1.819LysGln: 1.819 ± 1.071
3.032LysArg: 3.032 ± 2.044
4.851LysSer: 4.851 ± 1.28
2.426LysThr: 2.426 ± 1.035
5.458LysVal: 5.458 ± 1.002
0.0LysTrp: 0.0 ± 0.0
4.245LysTyr: 4.245 ± 1.631
0.0LysXaa: 0.0 ± 0.0
Leu
3.032LeuAla: 3.032 ± 1.538
4.245LeuCys: 4.245 ± 0.936
3.032LeuAsp: 3.032 ± 1.45
3.032LeuGlu: 3.032 ± 1.621
3.032LeuPhe: 3.032 ± 0.571
3.639LeuGly: 3.639 ± 1.298
5.458LeuHis: 5.458 ± 1.349
1.819LeuIle: 1.819 ± 0.922
8.49LeuLys: 8.49 ± 2.015
4.851LeuLeu: 4.851 ± 1.596
0.606LeuMet: 0.606 ± 0.539
4.245LeuAsn: 4.245 ± 1.188
1.213LeuPro: 1.213 ± 1.28
6.064LeuGln: 6.064 ± 1.997
4.851LeuArg: 4.851 ± 2.43
6.064LeuSer: 6.064 ± 2.376
2.426LeuThr: 2.426 ± 0.711
1.819LeuVal: 1.819 ± 0.677
0.606LeuTrp: 0.606 ± 0.605
3.032LeuTyr: 3.032 ± 0.527
0.0LeuXaa: 0.0 ± 0.0
Met
0.606MetAla: 0.606 ± 0.64
0.606MetCys: 0.606 ± 0.539
1.819MetAsp: 1.819 ± 0.929
2.426MetGlu: 2.426 ± 1.231
1.213MetPhe: 1.213 ± 0.867
2.426MetGly: 2.426 ± 0.915
0.606MetHis: 0.606 ± 0.502
0.0MetIle: 0.0 ± 0.0
1.213MetLys: 1.213 ± 0.694
1.213MetLeu: 1.213 ± 0.839
0.0MetMet: 0.0 ± 0.0
0.606MetAsn: 0.606 ± 0.502
1.819MetPro: 1.819 ± 0.677
1.213MetGln: 1.213 ± 0.697
1.819MetArg: 1.819 ± 1.143
3.032MetSer: 3.032 ± 1.441
0.606MetThr: 0.606 ± 0.539
0.606MetVal: 0.606 ± 0.514
1.213MetTrp: 1.213 ± 0.855
1.213MetTyr: 1.213 ± 1.28
0.0MetXaa: 0.0 ± 0.0
Asn
4.851AsnAla: 4.851 ± 1.358
2.426AsnCys: 2.426 ± 1.018
3.639AsnAsp: 3.639 ± 1.326
2.426AsnGlu: 2.426 ± 0.934
1.213AsnPhe: 1.213 ± 0.77
3.032AsnGly: 3.032 ± 0.862
3.639AsnHis: 3.639 ± 1.813
3.032AsnIle: 3.032 ± 1.143
1.213AsnLys: 1.213 ± 0.651
2.426AsnLeu: 2.426 ± 1.541
3.639AsnMet: 3.639 ± 1.322
6.064AsnAsn: 6.064 ± 1.838
3.639AsnPro: 3.639 ± 1.014
2.426AsnGln: 2.426 ± 0.933
3.032AsnArg: 3.032 ± 1.076
6.064AsnSer: 6.064 ± 2.395
2.426AsnThr: 2.426 ± 1.001
7.884AsnVal: 7.884 ± 1.696
0.606AsnTrp: 0.606 ± 0.514
3.639AsnTyr: 3.639 ± 1.135
0.0AsnXaa: 0.0 ± 0.0
Pro
0.606ProAla: 0.606 ± 0.64
1.819ProCys: 1.819 ± 0.95
1.819ProAsp: 1.819 ± 1.143
1.819ProGlu: 1.819 ± 1.523
3.032ProPhe: 3.032 ± 1.656
1.819ProGly: 1.819 ± 0.631
2.426ProHis: 2.426 ± 1.621
5.458ProIle: 5.458 ± 1.631
4.245ProLys: 4.245 ± 1.882
3.032ProLeu: 3.032 ± 1.36
1.819ProMet: 1.819 ± 1.425
1.819ProAsn: 1.819 ± 0.834
1.213ProPro: 1.213 ± 0.661
4.245ProGln: 4.245 ± 1.56
2.426ProArg: 2.426 ± 1.429
4.851ProSer: 4.851 ± 0.729
3.639ProThr: 3.639 ± 1.243
3.639ProVal: 3.639 ± 1.59
1.213ProTrp: 1.213 ± 0.651
2.426ProTyr: 2.426 ± 0.587
0.0ProXaa: 0.0 ± 0.0
Gln
3.032GlnAla: 3.032 ± 1.191
2.426GlnCys: 2.426 ± 1.541
0.606GlnAsp: 0.606 ± 0.502
1.819GlnGlu: 1.819 ± 0.68
1.819GlnPhe: 1.819 ± 0.746
3.032GlnGly: 3.032 ± 1.046
1.213GlnHis: 1.213 ± 0.853
1.213GlnIle: 1.213 ± 0.771
1.213GlnLys: 1.213 ± 0.688
3.639GlnLeu: 3.639 ± 1.149
1.819GlnMet: 1.819 ± 0.953
1.819GlnAsn: 1.819 ± 1.232
3.032GlnPro: 3.032 ± 1.422
3.639GlnGln: 3.639 ± 1.562
3.639GlnArg: 3.639 ± 1.187
6.671GlnSer: 6.671 ± 2.344
3.032GlnThr: 3.032 ± 1.006
3.639GlnVal: 3.639 ± 0.683
0.0GlnTrp: 0.0 ± 0.0
3.032GlnTyr: 3.032 ± 1.008
0.0GlnXaa: 0.0 ± 0.0
Arg
2.426ArgAla: 2.426 ± 1.107
0.606ArgCys: 0.606 ± 0.758
3.032ArgAsp: 3.032 ± 1.356
3.032ArgGlu: 3.032 ± 1.117
3.639ArgPhe: 3.639 ± 1.844
2.426ArgGly: 2.426 ± 1.172
1.819ArgHis: 1.819 ± 0.895
4.245ArgIle: 4.245 ± 0.906
1.819ArgLys: 1.819 ± 0.769
3.032ArgLeu: 3.032 ± 1.44
0.606ArgMet: 0.606 ± 0.502
2.426ArgAsn: 2.426 ± 0.94
4.245ArgPro: 4.245 ± 1.676
1.819ArgGln: 1.819 ± 0.926
9.096ArgArg: 9.096 ± 4.119
4.245ArgSer: 4.245 ± 1.423
2.426ArgThr: 2.426 ± 0.933
5.458ArgVal: 5.458 ± 1.585
1.213ArgTrp: 1.213 ± 0.77
1.819ArgTyr: 1.819 ± 0.884
0.0ArgXaa: 0.0 ± 0.0
Ser
7.884SerAla: 7.884 ± 1.44
0.0SerCys: 0.0 ± 0.0
4.851SerAsp: 4.851 ± 1.052
2.426SerGlu: 2.426 ± 1.144
3.639SerPhe: 3.639 ± 0.887
2.426SerGly: 2.426 ± 2.211
0.606SerHis: 0.606 ± 0.516
4.245SerIle: 4.245 ± 1.358
7.884SerLys: 7.884 ± 0.903
6.064SerLeu: 6.064 ± 2.675
1.213SerMet: 1.213 ± 0.679
7.884SerAsn: 7.884 ± 2.556
6.064SerPro: 6.064 ± 1.984
4.245SerGln: 4.245 ± 0.856
6.064SerArg: 6.064 ± 1.933
12.129SerSer: 12.129 ± 3.257
3.639SerThr: 3.639 ± 0.68
3.032SerVal: 3.032 ± 1.694
0.606SerTrp: 0.606 ± 0.514
6.064SerTyr: 6.064 ± 1.683
0.0SerXaa: 0.0 ± 0.0
Thr
3.032ThrAla: 3.032 ± 1.383
0.0ThrCys: 0.0 ± 0.0
1.819ThrAsp: 1.819 ± 0.926
3.032ThrGlu: 3.032 ± 1.036
0.606ThrPhe: 0.606 ± 0.502
3.639ThrGly: 3.639 ± 0.895
4.245ThrHis: 4.245 ± 2.118
4.851ThrIle: 4.851 ± 0.897
1.819ThrLys: 1.819 ± 0.88
4.245ThrLeu: 4.245 ± 0.889
1.213ThrMet: 1.213 ± 1.009
2.426ThrAsn: 2.426 ± 1.374
2.426ThrPro: 2.426 ± 1.001
0.606ThrGln: 0.606 ± 0.539
2.426ThrArg: 2.426 ± 1.092
4.851ThrSer: 4.851 ± 0.713
4.245ThrThr: 4.245 ± 1.067
3.639ThrVal: 3.639 ± 1.312
1.819ThrTrp: 1.819 ± 0.933
0.606ThrTyr: 0.606 ± 0.514
0.0ThrXaa: 0.0 ± 0.0
Val
1.819ValAla: 1.819 ± 1.518
0.606ValCys: 0.606 ± 0.514
4.245ValAsp: 4.245 ± 1.191
1.213ValGlu: 1.213 ± 0.688
2.426ValPhe: 2.426 ± 1.264
1.213ValGly: 1.213 ± 0.697
1.819ValHis: 1.819 ± 0.884
5.458ValIle: 5.458 ± 1.666
3.639ValLys: 3.639 ± 0.597
7.277ValLeu: 7.277 ± 1.512
1.213ValMet: 1.213 ± 1.28
4.245ValAsn: 4.245 ± 1.243
4.245ValPro: 4.245 ± 0.775
5.458ValGln: 5.458 ± 1.422
1.819ValArg: 1.819 ± 1.178
6.671ValSer: 6.671 ± 2.182
4.245ValThr: 4.245 ± 2.196
6.064ValVal: 6.064 ± 2.381
0.606ValTrp: 0.606 ± 0.553
6.064ValTyr: 6.064 ± 1.808
0.0ValXaa: 0.0 ± 0.0
Trp
2.426TrpAla: 2.426 ± 1.533
0.0TrpCys: 0.0 ± 0.0
1.213TrpAsp: 1.213 ± 0.994
0.606TrpGlu: 0.606 ± 0.605
0.606TrpPhe: 0.606 ± 0.539
0.606TrpGly: 0.606 ± 0.514
1.213TrpHis: 1.213 ± 0.748
0.0TrpIle: 0.0 ± 0.0
0.606TrpLys: 0.606 ± 0.64
1.213TrpLeu: 1.213 ± 0.77
0.606TrpMet: 0.606 ± 0.64
0.606TrpAsn: 0.606 ± 0.553
0.0TrpPro: 0.0 ± 0.0
0.606TrpGln: 0.606 ± 0.514
3.032TrpArg: 3.032 ± 0.833
0.0TrpSer: 0.0 ± 0.0
0.606TrpThr: 0.606 ± 0.605
0.606TrpVal: 0.606 ± 0.553
0.0TrpTrp: 0.0 ± 0.0
0.606TrpTyr: 0.606 ± 0.514
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.426TyrAla: 2.426 ± 1.279
0.606TyrCys: 0.606 ± 0.516
1.819TyrAsp: 1.819 ± 1.304
1.213TyrGlu: 1.213 ± 0.655
3.032TyrPhe: 3.032 ± 0.527
2.426TyrGly: 2.426 ± 0.688
0.606TyrHis: 0.606 ± 0.514
2.426TyrIle: 2.426 ± 1.063
1.213TyrLys: 1.213 ± 0.77
4.851TyrLeu: 4.851 ± 1.796
1.819TyrMet: 1.819 ± 0.898
3.639TyrAsn: 3.639 ± 0.785
2.426TyrPro: 2.426 ± 1.082
0.606TyrGln: 0.606 ± 0.553
2.426TyrArg: 2.426 ± 1.452
4.245TyrSer: 4.245 ± 1.386
1.819TyrThr: 1.819 ± 1.019
4.851TyrVal: 4.851 ± 1.375
0.0TyrTrp: 0.0 ± 0.0
1.819TyrTyr: 1.819 ± 0.83
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1650 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski