Amino acid dipepetide frequency for Tomato leaf curl Cameroon virus - [Cameroon:Buea:Okra:2008]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.07AlaAla: 7.07 ± 3.139
0.786AlaCys: 0.786 ± 0.827
1.571AlaAsp: 1.571 ± 0.919
3.142AlaGlu: 3.142 ± 1.176
1.571AlaPhe: 1.571 ± 1.704
2.357AlaGly: 2.357 ± 1.665
2.357AlaHis: 2.357 ± 1.078
3.142AlaIle: 3.142 ± 1.288
3.928AlaLys: 3.928 ± 1.197
3.928AlaLeu: 3.928 ± 1.71
1.571AlaMet: 1.571 ± 1.135
2.357AlaAsn: 2.357 ± 1.099
3.142AlaPro: 3.142 ± 1.651
1.571AlaGln: 1.571 ± 0.914
3.928AlaArg: 3.928 ± 1.872
4.713AlaSer: 4.713 ± 2.262
3.928AlaThr: 3.928 ± 2.475
3.142AlaVal: 3.142 ± 1.376
2.357AlaTrp: 2.357 ± 1.281
0.786AlaTyr: 0.786 ± 0.852
0.0AlaXaa: 0.0 ± 0.0
Cys
0.786CysAla: 0.786 ± 0.789
1.571CysCys: 1.571 ± 1.597
0.0CysAsp: 0.0 ± 0.0
0.786CysGlu: 0.786 ± 0.826
1.571CysPhe: 1.571 ± 1.059
2.357CysGly: 2.357 ± 1.07
0.0CysHis: 0.0 ± 0.0
1.571CysIle: 1.571 ± 1.653
1.571CysLys: 1.571 ± 1.052
0.0CysLeu: 0.0 ± 0.0
0.786CysMet: 0.786 ± 0.799
1.571CysAsn: 1.571 ± 0.793
1.571CysPro: 1.571 ± 1.597
1.571CysGln: 1.571 ± 1.214
0.786CysArg: 0.786 ± 0.607
2.357CysSer: 2.357 ± 0.888
3.928CysThr: 3.928 ± 2.224
0.786CysVal: 0.786 ± 0.826
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.571AspAla: 1.571 ± 0.862
0.786AspCys: 0.786 ± 0.751
2.357AspAsp: 2.357 ± 0.96
3.142AspGlu: 3.142 ± 1.134
0.0AspPhe: 0.0 ± 0.0
2.357AspGly: 2.357 ± 1.304
0.786AspHis: 0.786 ± 0.776
3.142AspIle: 3.142 ± 1.797
0.786AspLys: 0.786 ± 0.607
6.284AspLeu: 6.284 ± 1.814
0.786AspMet: 0.786 ± 0.751
3.142AspAsn: 3.142 ± 1.483
1.571AspPro: 1.571 ± 0.914
0.786AspGln: 0.786 ± 0.607
2.357AspArg: 2.357 ± 0.913
5.499AspSer: 5.499 ± 1.169
2.357AspThr: 2.357 ± 1.17
5.499AspVal: 5.499 ± 2.494
1.571AspTrp: 1.571 ± 0.792
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.499GluAla: 5.499 ± 2.12
0.786GluCys: 0.786 ± 0.789
0.0GluAsp: 0.0 ± 0.0
7.855GluGlu: 7.855 ± 3.31
3.928GluPhe: 3.928 ± 1.81
3.928GluGly: 3.928 ± 1.121
0.786GluHis: 0.786 ± 0.751
2.357GluIle: 2.357 ± 1.715
2.357GluLys: 2.357 ± 1.821
5.499GluLeu: 5.499 ± 1.422
0.0GluMet: 0.0 ± 0.0
3.928GluAsn: 3.928 ± 1.109
3.928GluPro: 3.928 ± 1.794
2.357GluGln: 2.357 ± 1.256
0.786GluArg: 0.786 ± 0.776
1.571GluSer: 1.571 ± 0.868
0.786GluThr: 0.786 ± 0.789
1.571GluVal: 1.571 ± 1.234
2.357GluTrp: 2.357 ± 1.17
1.571GluTyr: 1.571 ± 0.914
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.571PheCys: 1.571 ± 1.122
2.357PheAsp: 2.357 ± 0.913
1.571PheGlu: 1.571 ± 0.848
3.142PhePhe: 3.142 ± 1.139
0.0PheGly: 0.0 ± 0.0
1.571PheHis: 1.571 ± 0.914
2.357PheIle: 2.357 ± 0.934
3.928PheLys: 3.928 ± 1.69
6.284PheLeu: 6.284 ± 1.7
0.786PheMet: 0.786 ± 0.607
3.142PheAsn: 3.142 ± 2.265
0.786PhePro: 0.786 ± 0.799
3.142PheGln: 3.142 ± 1.687
3.142PheArg: 3.142 ± 1.116
2.357PheSer: 2.357 ± 1.328
3.142PheThr: 3.142 ± 1.115
1.571PheVal: 1.571 ± 1.117
0.0PheTrp: 0.0 ± 0.0
1.571PheTyr: 1.571 ± 1.129
0.0PheXaa: 0.0 ± 0.0
Gly
3.142GlyAla: 3.142 ± 1.371
3.928GlyCys: 3.928 ± 1.235
4.713GlyAsp: 4.713 ± 1.683
3.142GlyGlu: 3.142 ± 1.145
1.571GlyPhe: 1.571 ± 0.973
2.357GlyGly: 2.357 ± 1.24
2.357GlyHis: 2.357 ± 1.093
7.07GlyIle: 7.07 ± 1.89
3.928GlyLys: 3.928 ± 2.014
1.571GlyLeu: 1.571 ± 1.098
0.0GlyMet: 0.0 ± 0.0
0.786GlyAsn: 0.786 ± 0.852
4.713GlyPro: 4.713 ± 1.704
0.786GlyGln: 0.786 ± 0.828
1.571GlyArg: 1.571 ± 0.868
2.357GlySer: 2.357 ± 0.997
1.571GlyThr: 1.571 ± 1.22
1.571GlyVal: 1.571 ± 1.551
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.786HisAla: 0.786 ± 0.827
2.357HisCys: 2.357 ± 1.582
2.357HisAsp: 2.357 ± 1.711
1.571HisGlu: 1.571 ± 0.868
1.571HisPhe: 1.571 ± 1.214
2.357HisGly: 2.357 ± 1.264
0.786HisHis: 0.786 ± 0.789
3.142HisIle: 3.142 ± 1.715
1.571HisLys: 1.571 ± 1.087
3.928HisLeu: 3.928 ± 1.351
0.786HisMet: 0.786 ± 0.782
3.928HisAsn: 3.928 ± 1.185
1.571HisPro: 1.571 ± 0.919
2.357HisGln: 2.357 ± 1.343
3.142HisArg: 3.142 ± 1.637
0.786HisSer: 0.786 ± 0.852
2.357HisThr: 2.357 ± 1.246
5.499HisVal: 5.499 ± 1.228
0.0HisTrp: 0.0 ± 0.0
0.786HisTyr: 0.786 ± 0.607
0.0HisXaa: 0.0 ± 0.0
Ile
1.571IleAla: 1.571 ± 1.179
1.571IleCys: 1.571 ± 0.848
3.142IleAsp: 3.142 ± 1.756
1.571IleGlu: 1.571 ± 1.214
3.142IlePhe: 3.142 ± 1.756
2.357IleGly: 2.357 ± 1.447
1.571IleHis: 1.571 ± 1.041
6.284IleIle: 6.284 ± 2.23
7.855IleLys: 7.855 ± 1.684
2.357IleLeu: 2.357 ± 1.658
1.571IleMet: 1.571 ± 0.979
3.142IleAsn: 3.142 ± 1.07
0.786IlePro: 0.786 ± 0.607
4.713IleGln: 4.713 ± 1.15
5.499IleArg: 5.499 ± 2.439
6.284IleSer: 6.284 ± 1.936
4.713IleThr: 4.713 ± 2.721
2.357IleVal: 2.357 ± 0.96
1.571IleTrp: 1.571 ± 1.551
1.571IleTyr: 1.571 ± 1.091
0.0IleXaa: 0.0 ± 0.0
Lys
3.142LysAla: 3.142 ± 1.116
1.571LysCys: 1.571 ± 0.868
0.786LysAsp: 0.786 ± 0.607
3.928LysGlu: 3.928 ± 2.311
3.142LysPhe: 3.142 ± 1.175
1.571LysGly: 1.571 ± 0.792
2.357LysHis: 2.357 ± 0.933
4.713LysIle: 4.713 ± 1.668
3.928LysLys: 3.928 ± 1.614
1.571LysLeu: 1.571 ± 0.862
0.0LysMet: 0.0 ± 0.0
4.713LysAsn: 4.713 ± 1.765
2.357LysPro: 2.357 ± 0.987
1.571LysGln: 1.571 ± 1.052
4.713LysArg: 4.713 ± 2.673
6.284LysSer: 6.284 ± 1.675
0.786LysThr: 0.786 ± 0.607
3.928LysVal: 3.928 ± 2.143
0.0LysTrp: 0.0 ± 0.0
4.713LysTyr: 4.713 ± 1.737
0.0LysXaa: 0.0 ± 0.0
Leu
2.357LeuAla: 2.357 ± 1.69
1.571LeuCys: 1.571 ± 1.214
7.07LeuAsp: 7.07 ± 2.084
7.855LeuGlu: 7.855 ± 1.903
0.786LeuPhe: 0.786 ± 0.751
5.499LeuGly: 5.499 ± 1.903
2.357LeuHis: 2.357 ± 1.227
3.928LeuIle: 3.928 ± 2.104
5.499LeuLys: 5.499 ± 1.447
6.284LeuLeu: 6.284 ± 2.692
0.786LeuMet: 0.786 ± 0.751
6.284LeuAsn: 6.284 ± 1.503
1.571LeuPro: 1.571 ± 0.792
3.142LeuGln: 3.142 ± 1.163
7.07LeuArg: 7.07 ± 2.798
3.928LeuSer: 3.928 ± 1.731
3.928LeuThr: 3.928 ± 1.353
3.142LeuVal: 3.142 ± 1.049
0.0LeuTrp: 0.0 ± 0.0
3.928LeuTyr: 3.928 ± 1.702
0.0LeuXaa: 0.0 ± 0.0
Met
3.142MetAla: 3.142 ± 1.256
0.786MetCys: 0.786 ± 0.828
1.571MetAsp: 1.571 ± 1.118
0.0MetGlu: 0.0 ± 0.0
1.571MetPhe: 1.571 ± 1.052
0.786MetGly: 0.786 ± 0.607
0.0MetHis: 0.0 ± 0.0
1.571MetIle: 1.571 ± 1.503
0.0MetLys: 0.0 ± 0.0
1.571MetLeu: 1.571 ± 1.145
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.786MetArg: 0.786 ± 0.789
3.142MetSer: 3.142 ± 1.064
0.0MetThr: 0.0 ± 0.0
0.786MetVal: 0.786 ± 0.751
0.786MetTrp: 0.786 ± 0.799
2.357MetTyr: 2.357 ± 2.479
0.0MetXaa: 0.0 ± 0.0
Asn
5.499AsnAla: 5.499 ± 2.787
1.571AsnCys: 1.571 ± 0.792
2.357AsnAsp: 2.357 ± 1.099
2.357AsnGlu: 2.357 ± 1.11
2.357AsnPhe: 2.357 ± 1.271
2.357AsnGly: 2.357 ± 0.921
5.499AsnHis: 5.499 ± 2.856
2.357AsnIle: 2.357 ± 1.034
0.786AsnLys: 0.786 ± 0.607
4.713AsnLeu: 4.713 ± 2.039
0.0AsnMet: 0.0 ± 0.792
4.713AsnAsn: 4.713 ± 1.145
3.142AsnPro: 3.142 ± 1.154
3.142AsnGln: 3.142 ± 1.585
0.0AsnArg: 0.0 ± 0.0
5.499AsnSer: 5.499 ± 2.503
3.142AsnThr: 3.142 ± 1.246
3.928AsnVal: 3.928 ± 1.286
0.0AsnTrp: 0.0 ± 0.0
4.713AsnTyr: 4.713 ± 1.741
0.0AsnXaa: 0.0 ± 0.0
Pro
3.928ProAla: 3.928 ± 1.163
1.571ProCys: 1.571 ± 1.052
2.357ProAsp: 2.357 ± 1.676
0.786ProGlu: 0.786 ± 0.607
1.571ProPhe: 1.571 ± 0.868
1.571ProGly: 1.571 ± 0.848
3.928ProHis: 3.928 ± 2.419
3.142ProIle: 3.142 ± 1.124
3.142ProLys: 3.142 ± 1.856
3.928ProLeu: 3.928 ± 1.843
1.571ProMet: 1.571 ± 1.042
1.571ProAsn: 1.571 ± 0.914
1.571ProPro: 1.571 ± 1.214
3.142ProGln: 3.142 ± 1.963
5.499ProArg: 5.499 ± 2.319
3.928ProSer: 3.928 ± 2.854
7.07ProThr: 7.07 ± 1.944
3.928ProVal: 3.928 ± 2.117
0.0ProTrp: 0.0 ± 0.0
1.571ProTyr: 1.571 ± 0.848
0.0ProXaa: 0.0 ± 0.0
Gln
1.571GlnAla: 1.571 ± 1.087
0.0GlnCys: 0.0 ± 0.0
1.571GlnAsp: 1.571 ± 1.579
1.571GlnGlu: 1.571 ± 0.862
0.786GlnPhe: 0.786 ± 0.607
1.571GlnGly: 1.571 ± 0.793
3.928GlnHis: 3.928 ± 1.973
2.357GlnIle: 2.357 ± 1.281
1.571GlnLys: 1.571 ± 1.145
2.357GlnLeu: 2.357 ± 0.884
0.0GlnMet: 0.0 ± 0.0
4.713GlnAsn: 4.713 ± 1.666
4.713GlnPro: 4.713 ± 2.733
2.357GlnGln: 2.357 ± 1.836
2.357GlnArg: 2.357 ± 1.222
3.928GlnSer: 3.928 ± 1.58
3.928GlnThr: 3.928 ± 2.15
4.713GlnVal: 4.713 ± 2.242
0.0GlnTrp: 0.0 ± 0.0
0.786GlnTyr: 0.786 ± 0.607
0.0GlnXaa: 0.0 ± 0.0
Arg
3.142ArgAla: 3.142 ± 1.327
1.571ArgCys: 1.571 ± 1.052
5.499ArgAsp: 5.499 ± 1.954
3.142ArgGlu: 3.142 ± 1.176
4.713ArgPhe: 4.713 ± 1.608
3.928ArgGly: 3.928 ± 2.464
2.357ArgHis: 2.357 ± 1.264
3.928ArgIle: 3.928 ± 1.272
3.928ArgLys: 3.928 ± 1.127
3.142ArgLeu: 3.142 ± 1.408
3.142ArgMet: 3.142 ± 2.472
0.0ArgAsn: 0.0 ± 0.0
7.855ArgPro: 7.855 ± 2.349
0.786ArgGln: 0.786 ± 0.799
6.284ArgArg: 6.284 ± 3.65
3.142ArgSer: 3.142 ± 1.733
3.928ArgThr: 3.928 ± 1.267
3.142ArgVal: 3.142 ± 1.86
0.0ArgTrp: 0.0 ± 0.0
1.571ArgTyr: 1.571 ± 1.159
0.0ArgXaa: 0.0 ± 0.0
Ser
4.713SerAla: 4.713 ± 2.889
1.571SerCys: 1.571 ± 1.503
2.357SerAsp: 2.357 ± 1.037
3.928SerGlu: 3.928 ± 1.146
3.142SerPhe: 3.142 ± 1.008
3.142SerGly: 3.142 ± 1.927
1.571SerHis: 1.571 ± 1.025
3.928SerIle: 3.928 ± 1.207
4.713SerLys: 4.713 ± 2.6
7.855SerLeu: 7.855 ± 2.493
2.357SerMet: 2.357 ± 1.453
3.928SerAsn: 3.928 ± 1.603
7.07SerPro: 7.07 ± 2.252
2.357SerGln: 2.357 ± 1.07
3.142SerArg: 3.142 ± 1.121
8.641SerSer: 8.641 ± 3.599
10.212SerThr: 10.212 ± 4.197
2.357SerVal: 2.357 ± 1.745
0.786SerTrp: 0.786 ± 0.826
2.357SerTyr: 2.357 ± 1.281
0.0SerXaa: 0.0 ± 0.0
Thr
3.928ThrAla: 3.928 ± 2.179
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
1.571ThrGlu: 1.571 ± 1.146
1.571ThrPhe: 1.571 ± 0.793
4.713ThrGly: 4.713 ± 1.962
5.499ThrHis: 5.499 ± 2.803
1.571ThrIle: 1.571 ± 0.862
1.571ThrLys: 1.571 ± 0.792
4.713ThrLeu: 4.713 ± 1.878
0.0ThrMet: 0.0 ± 0.0
6.284ThrAsn: 6.284 ± 1.71
2.357ThrPro: 2.357 ± 1.037
2.357ThrGln: 2.357 ± 1.459
6.284ThrArg: 6.284 ± 1.753
8.641ThrSer: 8.641 ± 2.347
3.928ThrThr: 3.928 ± 2.636
4.713ThrVal: 4.713 ± 1.177
2.357ThrTrp: 2.357 ± 1.495
2.357ThrTyr: 2.357 ± 1.037
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
3.142ValAsp: 3.142 ± 0.984
2.357ValGlu: 2.357 ± 1.347
3.142ValPhe: 3.142 ± 2.083
2.357ValGly: 2.357 ± 1.762
3.142ValHis: 3.142 ± 2.612
3.928ValIle: 3.928 ± 1.368
3.928ValLys: 3.928 ± 1.58
6.284ValLeu: 6.284 ± 2.918
1.571ValMet: 1.571 ± 0.848
0.0ValAsn: 0.0 ± 0.0
5.499ValPro: 5.499 ± 2.429
5.499ValGln: 5.499 ± 2.638
4.713ValArg: 4.713 ± 3.449
4.713ValSer: 4.713 ± 1.906
1.571ValThr: 1.571 ± 1.052
3.142ValVal: 3.142 ± 1.479
1.571ValTrp: 1.571 ± 0.848
2.357ValTyr: 2.357 ± 1.576
0.0ValXaa: 0.0 ± 0.0
Trp
3.142TrpAla: 3.142 ± 1.292
0.0TrpCys: 0.0 ± 0.0
0.786TrpAsp: 0.786 ± 0.799
0.786TrpGlu: 0.786 ± 0.776
0.0TrpPhe: 0.0 ± 0.0
0.786TrpGly: 0.786 ± 0.607
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.786TrpMet: 0.786 ± 0.826
0.786TrpAsn: 0.786 ± 0.776
0.0TrpPro: 0.0 ± 0.0
0.786TrpGln: 0.786 ± 0.607
0.786TrpArg: 0.786 ± 0.789
0.786TrpSer: 0.786 ± 0.789
1.571TrpThr: 1.571 ± 1.118
0.786TrpVal: 0.786 ± 0.607
0.0TrpTrp: 0.0 ± 0.0
1.571TrpTyr: 1.571 ± 0.793
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.357TyrAla: 2.357 ± 1.889
0.0TyrCys: 0.0 ± 0.0
0.786TyrAsp: 0.786 ± 0.827
0.786TyrGlu: 0.786 ± 0.799
3.142TyrPhe: 3.142 ± 1.057
0.786TyrGly: 0.786 ± 0.607
0.786TyrHis: 0.786 ± 0.607
3.142TyrIle: 3.142 ± 1.814
0.786TyrLys: 0.786 ± 0.826
4.713TyrLeu: 4.713 ± 1.553
1.571TyrMet: 1.571 ± 1.049
3.928TyrAsn: 3.928 ± 1.368
1.571TyrPro: 1.571 ± 0.793
2.357TyrGln: 2.357 ± 1.338
2.357TyrArg: 2.357 ± 2.479
1.571TyrSer: 1.571 ± 1.214
1.571TyrThr: 1.571 ± 1.098
2.357TyrVal: 2.357 ± 1.078
0.0TyrTrp: 0.0 ± 0.0
0.786TyrTyr: 0.786 ± 0.852
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (1274 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski