Amino acid dipepetide frequency for Tomato leaf curl Bangalore virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.306AlaAla: 7.306 ± 3.433
3.653AlaCys: 3.653 ± 2.095
1.826AlaAsp: 1.826 ± 1.248
2.74AlaGlu: 2.74 ± 2.032
0.0AlaPhe: 0.0 ± 0.0
0.913AlaGly: 0.913 ± 0.843
1.826AlaHis: 1.826 ± 1.602
3.653AlaIle: 3.653 ± 1.409
3.653AlaLys: 3.653 ± 1.409
5.479AlaLeu: 5.479 ± 1.541
0.0AlaMet: 0.0 ± 0.0
2.74AlaAsn: 2.74 ± 1.18
1.826AlaPro: 1.826 ± 1.059
2.74AlaGln: 2.74 ± 1.53
2.74AlaArg: 2.74 ± 1.298
4.566AlaSer: 4.566 ± 1.533
4.566AlaThr: 4.566 ± 2.183
2.74AlaVal: 2.74 ± 1.563
0.913AlaTrp: 0.913 ± 0.677
2.74AlaTyr: 2.74 ± 1.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.826CysCys: 1.826 ± 1.602
0.0CysAsp: 0.0 ± 0.0
0.913CysGlu: 0.913 ± 0.843
0.913CysPhe: 0.913 ± 1.031
1.826CysGly: 1.826 ± 1.059
0.0CysHis: 0.0 ± 0.0
0.913CysIle: 0.913 ± 0.843
0.913CysLys: 0.913 ± 0.843
0.0CysLeu: 0.0 ± 0.0
0.913CysMet: 0.913 ± 0.801
1.826CysAsn: 1.826 ± 1.059
1.826CysPro: 1.826 ± 1.602
0.0CysGln: 0.0 ± 0.0
0.913CysArg: 0.913 ± 0.677
4.566CysSer: 4.566 ± 2.147
1.826CysThr: 1.826 ± 0.769
0.913CysVal: 0.913 ± 0.843
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.74AspAla: 2.74 ± 2.032
0.0AspCys: 0.0 ± 0.0
1.826AspAsp: 1.826 ± 1.059
1.826AspGlu: 1.826 ± 0.769
4.566AspPhe: 4.566 ± 1.614
2.74AspGly: 2.74 ± 2.032
2.74AspHis: 2.74 ± 1.513
2.74AspIle: 2.74 ± 0.879
1.826AspLys: 1.826 ± 0.769
5.479AspLeu: 5.479 ± 2.284
0.913AspMet: 0.913 ± 0.843
1.826AspAsn: 1.826 ± 1.144
1.826AspPro: 1.826 ± 0.839
2.74AspGln: 2.74 ± 2.278
2.74AspArg: 2.74 ± 1.465
4.566AspSer: 4.566 ± 1.32
1.826AspThr: 1.826 ± 1.059
5.479AspVal: 5.479 ± 2.179
1.826AspTrp: 1.826 ± 1.059
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.74GluAla: 2.74 ± 0.939
0.0GluCys: 0.0 ± 0.0
0.913GluAsp: 0.913 ± 0.677
2.74GluGlu: 2.74 ± 1.369
3.653GluPhe: 3.653 ± 1.197
4.566GluGly: 4.566 ± 2.397
0.913GluHis: 0.913 ± 1.031
0.913GluIle: 0.913 ± 1.031
1.826GluLys: 1.826 ± 1.355
2.74GluLeu: 2.74 ± 1.298
0.0GluMet: 0.0 ± 0.0
6.393GluAsn: 6.393 ± 2.095
2.74GluPro: 2.74 ± 0.879
1.826GluGln: 1.826 ± 1.15
2.74GluArg: 2.74 ± 1.245
2.74GluSer: 2.74 ± 1.389
1.826GluThr: 1.826 ± 1.45
2.74GluVal: 2.74 ± 1.327
1.826GluTrp: 1.826 ± 1.144
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.913PheCys: 0.913 ± 0.843
2.74PheAsp: 2.74 ± 1.18
1.826PheGlu: 1.826 ± 0.769
1.826PhePhe: 1.826 ± 0.769
1.826PheGly: 1.826 ± 0.769
1.826PheHis: 1.826 ± 1.355
4.566PheIle: 4.566 ± 1.938
2.74PheLys: 2.74 ± 1.96
9.132PheLeu: 9.132 ± 3.551
0.913PheMet: 0.913 ± 1.043
3.653PheAsn: 3.653 ± 1.754
1.826PhePro: 1.826 ± 1.45
2.74PheGln: 2.74 ± 1.483
3.653PheArg: 3.653 ± 2.09
1.826PheSer: 1.826 ± 1.455
1.826PheThr: 1.826 ± 1.059
1.826PheVal: 1.826 ± 0.769
0.0PheTrp: 0.0 ± 0.0
1.826PheTyr: 1.826 ± 1.686
0.0PheXaa: 0.0 ± 0.0
Gly
0.913GlyAla: 0.913 ± 0.677
1.826GlyCys: 1.826 ± 1.144
4.566GlyAsp: 4.566 ± 1.676
2.74GlyGlu: 2.74 ± 1.245
1.826GlyPhe: 1.826 ± 1.241
2.74GlyGly: 2.74 ± 1.18
2.74GlyHis: 2.74 ± 1.245
1.826GlyIle: 1.826 ± 1.043
8.219GlyLys: 8.219 ± 3.593
1.826GlyLeu: 1.826 ± 1.144
0.913GlyMet: 0.913 ± 0.801
1.826GlyAsn: 1.826 ± 1.455
6.393GlyPro: 6.393 ± 1.973
2.74GlyGln: 2.74 ± 0.939
2.74GlyArg: 2.74 ± 1.298
1.826GlySer: 1.826 ± 1.355
3.653GlyThr: 3.653 ± 2.287
2.74GlyVal: 2.74 ± 1.96
0.0GlyTrp: 0.0 ± 0.0
0.913GlyTyr: 0.913 ± 0.801
0.0GlyXaa: 0.0 ± 0.0
His
1.826HisAla: 1.826 ± 1.686
1.826HisCys: 1.826 ± 1.241
0.913HisAsp: 0.913 ± 0.843
0.913HisGlu: 0.913 ± 0.801
3.653HisPhe: 3.653 ± 1.33
1.826HisGly: 1.826 ± 1.241
5.479HisHis: 5.479 ± 3.225
2.74HisIle: 2.74 ± 1.563
1.826HisLys: 1.826 ± 1.261
2.74HisLeu: 2.74 ± 2.032
0.913HisMet: 0.913 ± 1.176
3.653HisAsn: 3.653 ± 2.086
0.913HisPro: 0.913 ± 0.677
0.0HisGln: 0.0 ± 0.0
3.653HisArg: 3.653 ± 2.287
2.74HisSer: 2.74 ± 1.957
0.913HisThr: 0.913 ± 0.843
4.566HisVal: 4.566 ± 1.994
0.913HisTrp: 0.913 ± 0.677
0.913HisTyr: 0.913 ± 0.677
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.0IleCys: 0.0 ± 0.0
2.74IleAsp: 2.74 ± 2.032
0.913IleGlu: 0.913 ± 0.677
3.653IlePhe: 3.653 ± 1.697
0.913IleGly: 0.913 ± 0.843
1.826IleHis: 1.826 ± 1.15
2.74IleIle: 2.74 ± 1.734
4.566IleLys: 4.566 ± 0.991
1.826IleLeu: 1.826 ± 1.2
0.0IleMet: 0.0 ± 0.0
2.74IleAsn: 2.74 ± 1.426
1.826IlePro: 1.826 ± 1.059
7.306IleGln: 7.306 ± 2.928
3.653IleArg: 3.653 ± 1.134
8.219IleSer: 8.219 ± 2.392
4.566IleThr: 4.566 ± 3.242
1.826IleVal: 1.826 ± 0.769
2.74IleTrp: 2.74 ± 2.015
1.826IleTyr: 1.826 ± 1.15
0.0IleXaa: 0.0 ± 0.0
Lys
4.566LysAla: 4.566 ± 2.583
1.826LysCys: 1.826 ± 1.043
1.826LysAsp: 1.826 ± 1.355
6.393LysGlu: 6.393 ± 1.973
1.826LysPhe: 1.826 ± 1.043
6.393LysGly: 6.393 ± 2.958
0.913LysHis: 0.913 ± 0.677
2.74LysIle: 2.74 ± 1.734
1.826LysLys: 1.826 ± 1.455
2.74LysLeu: 2.74 ± 2.278
0.0LysMet: 0.0 ± 0.0
4.566LysAsn: 4.566 ± 1.873
2.74LysPro: 2.74 ± 1.465
0.913LysGln: 0.913 ± 0.677
2.74LysArg: 2.74 ± 2.528
5.479LysSer: 5.479 ± 1.243
3.653LysThr: 3.653 ± 1.197
3.653LysVal: 3.653 ± 2.502
0.913LysTrp: 0.913 ± 0.843
3.653LysTyr: 3.653 ± 1.575
0.0LysXaa: 0.0 ± 0.0
Leu
4.566LeuAla: 4.566 ± 3.076
2.74LeuCys: 2.74 ± 2.032
4.566LeuAsp: 4.566 ± 2.666
4.566LeuGlu: 4.566 ± 2.393
1.826LeuPhe: 1.826 ± 1.059
5.479LeuGly: 5.479 ± 1.724
3.653LeuHis: 3.653 ± 1.973
4.566LeuIle: 4.566 ± 2.696
4.566LeuLys: 4.566 ± 1.676
0.913LeuLeu: 0.913 ± 0.843
1.826LeuMet: 1.826 ± 1.063
2.74LeuAsn: 2.74 ± 1.105
1.826LeuPro: 1.826 ± 1.455
3.653LeuGln: 3.653 ± 1.678
5.479LeuArg: 5.479 ± 2.796
4.566LeuSer: 4.566 ± 1.557
4.566LeuThr: 4.566 ± 0.991
3.653LeuVal: 3.653 ± 1.575
0.913LeuTrp: 0.913 ± 1.031
2.74LeuTyr: 2.74 ± 0.879
0.0LeuXaa: 0.0 ± 0.0
Met
1.826MetAla: 1.826 ± 1.248
0.913MetCys: 0.913 ± 0.843
1.826MetAsp: 1.826 ± 1.15
0.0MetGlu: 0.0 ± 0.0
4.566MetPhe: 4.566 ± 3.296
2.74MetGly: 2.74 ± 1.327
0.913MetHis: 0.913 ± 0.843
0.913MetIle: 0.913 ± 0.843
0.0MetLys: 0.0 ± 0.0
1.826MetLeu: 1.826 ± 1.182
0.0MetMet: 0.0 ± 0.0
0.913MetAsn: 0.913 ± 0.843
1.826MetPro: 1.826 ± 1.2
0.913MetGln: 0.913 ± 0.981
0.0MetArg: 0.0 ± 0.0
1.826MetSer: 1.826 ± 1.248
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
2.74MetTrp: 2.74 ± 1.327
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.479AsnAla: 5.479 ± 2.231
0.0AsnCys: 0.0 ± 0.0
3.653AsnAsp: 3.653 ± 1.409
0.913AsnGlu: 0.913 ± 0.843
0.0AsnPhe: 0.0 ± 0.0
0.913AsnGly: 0.913 ± 1.031
2.74AsnHis: 2.74 ± 1.734
3.653AsnIle: 3.653 ± 1.067
0.0AsnLys: 0.0 ± 0.0
6.393AsnLeu: 6.393 ± 2.977
2.74AsnMet: 2.74 ± 2.398
4.566AsnAsn: 4.566 ± 1.735
2.74AsnPro: 2.74 ± 0.879
2.74AsnGln: 2.74 ± 0.939
6.393AsnArg: 6.393 ± 2.172
4.566AsnSer: 4.566 ± 1.096
2.74AsnThr: 2.74 ± 1.142
3.653AsnVal: 3.653 ± 2.086
0.0AsnTrp: 0.0 ± 0.0
2.74AsnTyr: 2.74 ± 1.18
0.0AsnXaa: 0.0 ± 0.0
Pro
1.826ProAla: 1.826 ± 1.144
0.913ProCys: 0.913 ± 0.843
2.74ProAsp: 2.74 ± 1.523
1.826ProGlu: 1.826 ± 0.839
1.826ProPhe: 1.826 ± 1.043
1.826ProGly: 1.826 ± 1.355
3.653ProHis: 3.653 ± 1.893
3.653ProIle: 3.653 ± 1.272
2.74ProLys: 2.74 ± 1.553
4.566ProLeu: 4.566 ± 2.014
2.74ProMet: 2.74 ± 1.776
3.653ProAsn: 3.653 ± 2.119
0.0ProPro: 0.0 ± 0.0
2.74ProGln: 2.74 ± 1.388
4.566ProArg: 4.566 ± 1.184
5.479ProSer: 5.479 ± 1.702
4.566ProThr: 4.566 ± 1.957
4.566ProVal: 4.566 ± 2.165
0.0ProTrp: 0.0 ± 0.0
0.913ProTyr: 0.913 ± 0.843
0.0ProXaa: 0.0 ± 0.0
Gln
4.566GlnAla: 4.566 ± 2.304
0.913GlnCys: 0.913 ± 0.677
2.74GlnAsp: 2.74 ± 1.389
1.826GlnGlu: 1.826 ± 0.769
2.74GlnPhe: 2.74 ± 1.426
0.913GlnGly: 0.913 ± 1.176
1.826GlnHis: 1.826 ± 1.455
3.653GlnIle: 3.653 ± 1.697
0.913GlnLys: 0.913 ± 0.677
3.653GlnLeu: 3.653 ± 1.46
1.826GlnMet: 1.826 ± 1.455
0.913GlnAsn: 0.913 ± 1.031
5.479GlnPro: 5.479 ± 2.194
3.653GlnGln: 3.653 ± 1.567
1.826GlnArg: 1.826 ± 0.839
6.393GlnSer: 6.393 ± 1.265
2.74GlnThr: 2.74 ± 1.53
2.74GlnVal: 2.74 ± 1.311
0.913GlnTrp: 0.913 ± 0.677
0.913GlnTyr: 0.913 ± 0.843
0.0GlnXaa: 0.0 ± 0.0
Arg
5.479ArgAla: 5.479 ± 2.251
0.913ArgCys: 0.913 ± 0.801
4.566ArgAsp: 4.566 ± 1.449
2.74ArgGlu: 2.74 ± 1.483
2.74ArgPhe: 2.74 ± 1.18
5.479ArgGly: 5.479 ± 2.952
3.653ArgHis: 3.653 ± 1.057
3.653ArgIle: 3.653 ± 1.37
3.653ArgLys: 3.653 ± 1.894
1.826ArgLeu: 1.826 ± 1.182
0.913ArgMet: 0.913 ± 0.843
0.0ArgAsn: 0.0 ± 0.0
8.219ArgPro: 8.219 ± 1.863
0.913ArgGln: 0.913 ± 1.176
4.566ArgArg: 4.566 ± 3.052
3.653ArgSer: 3.653 ± 1.893
1.826ArgThr: 1.826 ± 1.043
5.479ArgVal: 5.479 ± 1.63
0.0ArgTrp: 0.0 ± 0.0
1.826ArgTyr: 1.826 ± 1.182
0.0ArgXaa: 0.0 ± 0.0
Ser
3.653SerAla: 3.653 ± 2.71
0.0SerCys: 0.0 ± 0.0
2.74SerAsp: 2.74 ± 0.943
3.653SerGlu: 3.653 ± 1.937
2.74SerPhe: 2.74 ± 0.943
3.653SerGly: 3.653 ± 1.134
0.0SerHis: 0.0 ± 0.0
6.393SerIle: 6.393 ± 3.787
10.046SerLys: 10.046 ± 2.368
6.393SerLeu: 6.393 ± 2.886
2.74SerMet: 2.74 ± 1.289
6.393SerAsn: 6.393 ± 2.096
7.306SerPro: 7.306 ± 2.111
4.566SerGln: 4.566 ± 2.077
3.653SerArg: 3.653 ± 1.22
10.959SerSer: 10.959 ± 4.529
5.479SerThr: 5.479 ± 1.944
4.566SerVal: 4.566 ± 2.64
0.0SerTrp: 0.0 ± 0.0
3.653SerTyr: 3.653 ± 1.272
0.0SerXaa: 0.0 ± 0.0
Thr
2.74ThrAla: 2.74 ± 1.311
0.913ThrCys: 0.913 ± 1.176
0.913ThrAsp: 0.913 ± 1.176
2.74ThrGlu: 2.74 ± 2.485
1.826ThrPhe: 1.826 ± 1.2
4.566ThrGly: 4.566 ± 1.932
5.479ThrHis: 5.479 ± 1.84
1.826ThrIle: 1.826 ± 1.059
2.74ThrLys: 2.74 ± 1.465
1.826ThrLeu: 1.826 ± 0.769
0.913ThrMet: 0.913 ± 0.677
3.653ThrAsn: 3.653 ± 1.575
1.826ThrPro: 1.826 ± 1.355
2.74ThrGln: 2.74 ± 1.927
2.74ThrArg: 2.74 ± 0.943
4.566ThrSer: 4.566 ± 1.641
1.826ThrThr: 1.826 ± 1.648
5.479ThrVal: 5.479 ± 1.222
0.913ThrTrp: 0.913 ± 1.176
1.826ThrTyr: 1.826 ± 0.839
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.913ValCys: 0.913 ± 0.677
5.479ValAsp: 5.479 ± 1.386
2.74ValGlu: 2.74 ± 1.105
3.653ValPhe: 3.653 ± 1.754
2.74ValGly: 2.74 ± 1.957
1.826ValHis: 1.826 ± 1.261
2.74ValIle: 2.74 ± 1.426
5.479ValLys: 5.479 ± 2.93
6.393ValLeu: 6.393 ± 1.674
1.826ValMet: 1.826 ± 1.686
1.826ValAsn: 1.826 ± 1.15
2.74ValPro: 2.74 ± 0.943
6.393ValGln: 6.393 ± 1.737
3.653ValArg: 3.653 ± 2.677
4.566ValSer: 4.566 ± 1.184
2.74ValThr: 2.74 ± 2.528
3.653ValVal: 3.653 ± 0.975
0.0ValTrp: 0.0 ± 0.0
6.393ValTyr: 6.393 ± 1.637
0.0ValXaa: 0.0 ± 0.0
Trp
3.653TrpAla: 3.653 ± 1.763
0.0TrpCys: 0.0 ± 0.0
0.913TrpAsp: 0.913 ± 0.801
0.913TrpGlu: 0.913 ± 1.031
0.913TrpPhe: 0.913 ± 1.176
0.0TrpGly: 0.0 ± 0.0
0.913TrpHis: 0.913 ± 0.843
0.0TrpIle: 0.0 ± 0.0
0.913TrpLys: 0.913 ± 1.176
0.0TrpLeu: 0.0 ± 0.0
0.913TrpMet: 0.913 ± 0.843
0.913TrpAsn: 0.913 ± 1.031
0.0TrpPro: 0.0 ± 0.0
0.913TrpGln: 0.913 ± 0.677
0.913TrpArg: 0.913 ± 0.981
0.913TrpSer: 0.913 ± 0.981
0.913TrpThr: 0.913 ± 1.031
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.913TrpTyr: 0.913 ± 0.677
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.74TyrAla: 2.74 ± 1.465
0.0TyrCys: 0.0 ± 0.0
2.74TyrAsp: 2.74 ± 1.834
0.913TyrGlu: 0.913 ± 0.843
3.653TyrPhe: 3.653 ± 0.932
0.913TyrGly: 0.913 ± 0.677
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
0.913TyrLys: 0.913 ± 0.677
3.653TyrLeu: 3.653 ± 1.409
1.826TyrMet: 1.826 ± 1.11
1.826TyrAsn: 1.826 ± 0.769
0.913TyrPro: 0.913 ± 0.677
0.913TyrGln: 0.913 ± 0.843
2.74TyrArg: 2.74 ± 1.734
4.566TyrSer: 4.566 ± 0.959
0.0TyrThr: 0.0 ± 0.0
5.479TyrVal: 5.479 ± 1.222
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1096 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski