Amino acid dipepetide frequency for Tomato leaf curl Laos virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.646AlaAla: 3.646 ± 1.48
0.912AlaCys: 0.912 ± 0.757
0.912AlaAsp: 0.912 ± 0.757
2.735AlaGlu: 2.735 ± 1.27
0.912AlaPhe: 0.912 ± 0.825
0.912AlaGly: 0.912 ± 0.624
2.735AlaHis: 2.735 ± 1.237
0.912AlaIle: 0.912 ± 1.114
5.469AlaLys: 5.469 ± 1.475
6.381AlaLeu: 6.381 ± 1.972
0.912AlaMet: 0.912 ± 0.757
1.823AlaAsn: 1.823 ± 1.248
1.823AlaPro: 1.823 ± 1.248
3.646AlaGln: 3.646 ± 1.367
3.646AlaArg: 3.646 ± 1.812
4.558AlaSer: 4.558 ± 2.424
3.646AlaThr: 3.646 ± 2.227
1.823AlaVal: 1.823 ± 0.74
1.823AlaTrp: 1.823 ± 0.74
1.823AlaTyr: 1.823 ± 1.11
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.823CysCys: 1.823 ± 1.948
0.912CysAsp: 0.912 ± 0.974
0.912CysGlu: 0.912 ± 0.757
0.912CysPhe: 0.912 ± 1.114
1.823CysGly: 1.823 ± 0.847
0.912CysHis: 0.912 ± 0.825
0.0CysIle: 0.0 ± 0.0
0.912CysLys: 0.912 ± 0.757
0.912CysLeu: 0.912 ± 1.15
1.823CysMet: 1.823 ± 1.451
1.823CysAsn: 1.823 ± 0.847
3.646CysPro: 3.646 ± 2.022
0.912CysGln: 0.912 ± 1.114
0.912CysArg: 0.912 ± 0.624
1.823CysSer: 1.823 ± 0.847
2.735CysThr: 2.735 ± 0.81
1.823CysVal: 1.823 ± 1.514
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.646AspAla: 3.646 ± 1.884
0.0AspCys: 0.0 ± 0.0
0.912AspAsp: 0.912 ± 0.624
1.823AspGlu: 1.823 ± 0.74
0.912AspPhe: 0.912 ± 0.757
3.646AspGly: 3.646 ± 1.769
0.0AspHis: 0.0 ± 0.0
1.823AspIle: 1.823 ± 1.155
0.0AspLys: 0.0 ± 0.0
5.469AspLeu: 5.469 ± 2.125
0.912AspMet: 0.912 ± 0.757
2.735AspAsn: 2.735 ± 1.567
1.823AspPro: 1.823 ± 1.065
0.912AspGln: 0.912 ± 0.624
3.646AspArg: 3.646 ± 1.317
4.558AspSer: 4.558 ± 1.701
3.646AspThr: 3.646 ± 1.0
4.558AspVal: 4.558 ± 2.056
3.646AspTrp: 3.646 ± 1.769
0.912AspTyr: 0.912 ± 0.624
0.0AspXaa: 0.0 ± 0.0
Glu
4.558GluAla: 4.558 ± 1.644
0.0GluCys: 0.0 ± 0.0
2.735GluAsp: 2.735 ± 2.157
5.469GluGlu: 5.469 ± 2.942
3.646GluPhe: 3.646 ± 1.96
3.646GluGly: 3.646 ± 1.124
0.912GluHis: 0.912 ± 1.114
0.0GluIle: 0.0 ± 0.0
1.823GluLys: 1.823 ± 1.248
4.558GluLeu: 4.558 ± 1.837
0.0GluMet: 0.0 ± 0.0
3.646GluAsn: 3.646 ± 2.075
2.735GluPro: 2.735 ± 0.954
2.735GluGln: 2.735 ± 1.517
0.912GluArg: 0.912 ± 1.114
2.735GluSer: 2.735 ± 1.205
3.646GluThr: 3.646 ± 2.346
2.735GluVal: 2.735 ± 1.302
1.823GluTrp: 1.823 ± 0.847
0.912GluTyr: 0.912 ± 0.624
0.0GluXaa: 0.0 ± 0.0
Phe
0.912PheAla: 0.912 ± 0.624
1.823PheCys: 1.823 ± 1.026
2.735PheAsp: 2.735 ± 1.141
1.823PheGlu: 1.823 ± 1.11
2.735PhePhe: 2.735 ± 0.954
1.823PheGly: 1.823 ± 1.514
1.823PheHis: 1.823 ± 1.248
1.823PheIle: 1.823 ± 0.847
2.735PheLys: 2.735 ± 2.135
5.469PheLeu: 5.469 ± 1.302
1.823PheMet: 1.823 ± 0.74
2.735PheAsn: 2.735 ± 2.139
1.823PhePro: 1.823 ± 1.451
1.823PheGln: 1.823 ± 0.847
2.735PheArg: 2.735 ± 2.383
2.735PheSer: 2.735 ± 1.205
2.735PheThr: 2.735 ± 1.689
0.912PheVal: 0.912 ± 0.624
0.0PheTrp: 0.0 ± 0.0
1.823PheTyr: 1.823 ± 1.026
0.0PheXaa: 0.0 ± 0.0
Gly
3.646GlyAla: 3.646 ± 1.367
1.823GlyCys: 1.823 ± 1.017
1.823GlyAsp: 1.823 ± 0.847
3.646GlyGlu: 3.646 ± 1.367
0.912GlyPhe: 0.912 ± 0.825
2.735GlyGly: 2.735 ± 1.141
1.823GlyHis: 1.823 ± 1.11
1.823GlyIle: 1.823 ± 1.11
5.469GlyLys: 5.469 ± 2.221
2.735GlyLeu: 2.735 ± 1.47
0.0GlyMet: 0.0 ± 0.0
1.823GlyAsn: 1.823 ± 1.389
3.646GlyPro: 3.646 ± 1.029
3.646GlyGln: 3.646 ± 1.652
1.823GlyArg: 1.823 ± 1.248
1.823GlySer: 1.823 ± 1.248
5.469GlyThr: 5.469 ± 1.612
0.912GlyVal: 0.912 ± 1.114
0.0GlyTrp: 0.0 ± 0.0
0.912GlyTyr: 0.912 ± 0.974
0.0GlyXaa: 0.0 ± 0.0
His
0.912HisAla: 0.912 ± 0.757
2.735HisCys: 2.735 ± 1.951
1.823HisAsp: 1.823 ± 1.155
0.912HisGlu: 0.912 ± 0.624
3.646HisPhe: 3.646 ± 1.568
1.823HisGly: 1.823 ± 1.303
0.912HisHis: 0.912 ± 0.825
1.823HisIle: 1.823 ± 1.462
0.912HisLys: 0.912 ± 0.974
3.646HisLeu: 3.646 ± 1.465
0.0HisMet: 0.0 ± 0.0
4.558HisAsn: 4.558 ± 1.76
2.735HisPro: 2.735 ± 1.376
2.735HisGln: 2.735 ± 0.954
3.646HisArg: 3.646 ± 2.456
0.0HisSer: 0.0 ± 0.0
1.823HisThr: 1.823 ± 1.514
3.646HisVal: 3.646 ± 1.049
0.0HisTrp: 0.0 ± 0.0
0.912HisTyr: 0.912 ± 0.624
0.0HisXaa: 0.0 ± 0.0
Ile
0.912IleAla: 0.912 ± 0.825
1.823IleCys: 1.823 ± 1.11
2.735IleAsp: 2.735 ± 1.415
1.823IleGlu: 1.823 ± 1.11
1.823IlePhe: 1.823 ± 1.248
0.0IleGly: 0.0 ± 0.0
1.823IleHis: 1.823 ± 1.451
1.823IleIle: 1.823 ± 2.228
6.381IleLys: 6.381 ± 1.804
0.912IleLeu: 0.912 ± 0.825
0.912IleMet: 0.912 ± 0.832
3.646IleAsn: 3.646 ± 2.196
0.912IlePro: 0.912 ± 0.624
3.646IleGln: 3.646 ± 1.124
3.646IleArg: 3.646 ± 1.637
4.558IleSer: 4.558 ± 2.514
4.558IleThr: 4.558 ± 3.146
1.823IleVal: 1.823 ± 0.74
3.646IleTrp: 3.646 ± 2.254
1.823IleTyr: 1.823 ± 1.155
0.0IleXaa: 0.0 ± 0.0
Lys
2.735LysAla: 2.735 ± 1.415
0.0LysCys: 0.0 ± 0.0
2.735LysAsp: 2.735 ± 1.872
4.558LysGlu: 4.558 ± 2.268
2.735LysPhe: 2.735 ± 1.604
2.735LysGly: 2.735 ± 1.179
3.646LysHis: 3.646 ± 1.378
4.558LysIle: 4.558 ± 1.925
2.735LysLys: 2.735 ± 1.237
0.0LysLeu: 0.0 ± 0.0
0.0LysMet: 0.0 ± 0.0
5.469LysAsn: 5.469 ± 1.645
2.735LysPro: 2.735 ± 0.907
0.912LysGln: 0.912 ± 0.974
3.646LysArg: 3.646 ± 2.276
6.381LysSer: 6.381 ± 2.147
3.646LysThr: 3.646 ± 1.731
4.558LysVal: 4.558 ± 1.994
0.0LysTrp: 0.0 ± 0.0
6.381LysTyr: 6.381 ± 2.035
0.0LysXaa: 0.0 ± 0.0
Leu
1.823LeuAla: 1.823 ± 1.065
2.735LeuCys: 2.735 ± 1.141
6.381LeuAsp: 6.381 ± 2.359
2.735LeuGlu: 2.735 ± 1.872
1.823LeuPhe: 1.823 ± 1.303
3.646LeuGly: 3.646 ± 1.937
2.735LeuHis: 2.735 ± 1.415
2.735LeuIle: 2.735 ± 1.302
3.646LeuLys: 3.646 ± 1.029
5.469LeuLeu: 5.469 ± 2.94
0.912LeuMet: 0.912 ± 1.15
6.381LeuAsn: 6.381 ± 1.945
2.735LeuPro: 2.735 ± 2.474
4.558LeuGln: 4.558 ± 1.837
4.558LeuArg: 4.558 ± 2.663
4.558LeuSer: 4.558 ± 2.389
5.469LeuThr: 5.469 ± 2.259
3.646LeuVal: 3.646 ± 2.075
0.0LeuTrp: 0.0 ± 0.0
4.558LeuTyr: 4.558 ± 1.965
0.0LeuXaa: 0.0 ± 0.0
Met
0.912MetAla: 0.912 ± 0.757
0.0MetCys: 0.0 ± 0.0
3.646MetAsp: 3.646 ± 1.58
0.912MetGlu: 0.912 ± 1.15
1.823MetPhe: 1.823 ± 1.017
1.823MetGly: 1.823 ± 1.094
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.823MetLys: 1.823 ± 1.155
0.912MetLeu: 0.912 ± 0.974
0.912MetMet: 0.912 ± 0.967
0.0MetAsn: 0.0 ± 0.0
0.912MetPro: 0.912 ± 0.624
1.823MetGln: 1.823 ± 0.847
0.912MetArg: 0.912 ± 0.757
0.912MetSer: 0.912 ± 0.757
0.912MetThr: 0.912 ± 1.15
0.0MetVal: 0.0 ± 0.0
1.823MetTrp: 1.823 ± 1.065
3.646MetTyr: 3.646 ± 3.029
0.0MetXaa: 0.0 ± 0.0
Asn
1.823AsnAla: 1.823 ± 0.74
1.823AsnCys: 1.823 ± 0.847
2.735AsnAsp: 2.735 ± 1.141
1.823AsnGlu: 1.823 ± 1.026
0.912AsnPhe: 0.912 ± 0.757
0.912AsnGly: 0.912 ± 0.757
5.469AsnHis: 5.469 ± 2.654
2.735AsnIle: 2.735 ± 0.954
1.823AsnLys: 1.823 ± 0.74
4.558AsnLeu: 4.558 ± 1.76
1.823AsnMet: 1.823 ± 1.471
3.646AsnAsn: 3.646 ± 0.954
2.735AsnPro: 2.735 ± 0.954
1.823AsnGln: 1.823 ± 1.438
1.823AsnArg: 1.823 ± 1.094
7.293AsnSer: 7.293 ± 3.253
4.558AsnThr: 4.558 ± 1.288
6.381AsnVal: 6.381 ± 2.883
0.0AsnTrp: 0.0 ± 0.0
3.646AsnTyr: 3.646 ± 1.172
0.0AsnXaa: 0.0 ± 0.0
Pro
1.823ProAla: 1.823 ± 1.514
1.823ProCys: 1.823 ± 1.017
1.823ProAsp: 1.823 ± 1.026
2.735ProGlu: 2.735 ± 1.237
2.735ProPhe: 2.735 ± 0.954
2.735ProGly: 2.735 ± 1.261
3.646ProHis: 3.646 ± 1.96
4.558ProIle: 4.558 ± 2.802
5.469ProLys: 5.469 ± 1.455
5.469ProLeu: 5.469 ± 2.195
3.646ProMet: 3.646 ± 1.231
3.646ProAsn: 3.646 ± 1.741
1.823ProPro: 1.823 ± 1.065
2.735ProGln: 2.735 ± 1.261
3.646ProArg: 3.646 ± 1.049
3.646ProSer: 3.646 ± 1.659
4.558ProThr: 4.558 ± 2.268
0.912ProVal: 0.912 ± 0.757
0.912ProTrp: 0.912 ± 0.624
0.912ProTyr: 0.912 ± 0.757
0.0ProXaa: 0.0 ± 0.0
Gln
5.469GlnAla: 5.469 ± 1.184
2.735GlnCys: 2.735 ± 2.135
0.912GlnAsp: 0.912 ± 0.825
2.735GlnGlu: 2.735 ± 0.907
3.646GlnPhe: 3.646 ± 2.22
2.735GlnGly: 2.735 ± 1.302
0.912GlnHis: 0.912 ± 1.15
3.646GlnIle: 3.646 ± 2.22
2.735GlnLys: 2.735 ± 2.922
0.912GlnLeu: 0.912 ± 0.624
0.0GlnMet: 0.0 ± 0.0
4.558GlnAsn: 4.558 ± 1.8
3.646GlnPro: 3.646 ± 2.875
0.912GlnGln: 0.912 ± 0.624
3.646GlnArg: 3.646 ± 1.652
3.646GlnSer: 3.646 ± 1.172
4.558GlnThr: 4.558 ± 2.344
3.646GlnVal: 3.646 ± 1.32
0.0GlnTrp: 0.0 ± 0.0
0.912GlnTyr: 0.912 ± 0.757
0.0GlnXaa: 0.0 ± 0.0
Arg
3.646ArgAla: 3.646 ± 1.317
0.912ArgCys: 0.912 ± 0.974
3.646ArgAsp: 3.646 ± 1.48
4.558ArgGlu: 4.558 ± 1.882
2.735ArgPhe: 2.735 ± 1.141
2.735ArgGly: 2.735 ± 0.81
3.646ArgHis: 3.646 ± 2.006
3.646ArgIle: 3.646 ± 1.25
3.646ArgLys: 3.646 ± 2.227
4.558ArgLeu: 4.558 ± 2.268
1.823ArgMet: 1.823 ± 1.514
0.0ArgAsn: 0.0 ± 0.0
6.381ArgPro: 6.381 ± 1.561
0.912ArgGln: 0.912 ± 1.15
7.293ArgArg: 7.293 ± 3.782
5.469ArgSer: 5.469 ± 1.455
4.558ArgThr: 4.558 ± 2.667
3.646ArgVal: 3.646 ± 1.843
0.0ArgTrp: 0.0 ± 0.0
1.823ArgTyr: 1.823 ± 1.026
0.0ArgXaa: 0.0 ± 0.0
Ser
2.735SerAla: 2.735 ± 1.872
0.912SerCys: 0.912 ± 0.974
2.735SerAsp: 2.735 ± 0.81
4.558SerGlu: 4.558 ± 2.979
2.735SerPhe: 2.735 ± 1.261
2.735SerGly: 2.735 ± 1.415
2.735SerHis: 2.735 ± 1.257
5.469SerIle: 5.469 ± 1.759
4.558SerLys: 4.558 ± 2.284
1.823SerLeu: 1.823 ± 1.248
1.823SerMet: 1.823 ± 1.423
4.558SerAsn: 4.558 ± 1.496
8.204SerPro: 8.204 ± 2.227
4.558SerGln: 4.558 ± 2.344
6.381SerArg: 6.381 ± 2.196
10.939SerSer: 10.939 ± 3.839
4.558SerThr: 4.558 ± 3.795
2.735SerVal: 2.735 ± 1.592
0.0SerTrp: 0.0 ± 0.0
4.558SerTyr: 4.558 ± 0.955
0.0SerXaa: 0.0 ± 0.0
Thr
6.381ThrAla: 6.381 ± 2.705
0.912ThrCys: 0.912 ± 1.15
0.912ThrAsp: 0.912 ± 1.15
2.735ThrGlu: 2.735 ± 1.179
0.912ThrPhe: 0.912 ± 1.15
4.558ThrGly: 4.558 ± 1.935
4.558ThrHis: 4.558 ± 1.729
2.735ThrIle: 2.735 ± 1.205
4.558ThrLys: 4.558 ± 1.118
5.469ThrLeu: 5.469 ± 2.259
1.823ThrMet: 1.823 ± 1.178
3.646ThrAsn: 3.646 ± 1.48
6.381ThrPro: 6.381 ± 2.376
2.735ThrGln: 2.735 ± 1.657
3.646ThrArg: 3.646 ± 0.954
3.646ThrSer: 3.646 ± 1.0
0.912ThrThr: 0.912 ± 1.15
8.204ThrVal: 8.204 ± 3.116
0.912ThrTrp: 0.912 ± 1.15
1.823ThrTyr: 1.823 ± 1.065
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.912ValCys: 0.912 ± 0.624
1.823ValAsp: 1.823 ± 0.847
0.0ValGlu: 0.0 ± 0.0
3.646ValPhe: 3.646 ± 1.937
3.646ValGly: 3.646 ± 1.616
0.912ValHis: 0.912 ± 0.974
5.469ValIle: 5.469 ± 2.108
4.558ValLys: 4.558 ± 1.315
6.381ValLeu: 6.381 ± 2.435
0.912ValMet: 0.912 ± 0.757
0.0ValAsn: 0.0 ± 0.0
4.558ValPro: 4.558 ± 1.23
7.293ValGln: 7.293 ± 3.75
3.646ValArg: 3.646 ± 2.075
5.469ValSer: 5.469 ± 1.381
2.735ValThr: 2.735 ± 2.272
0.912ValVal: 0.912 ± 0.757
0.0ValTrp: 0.0 ± 0.0
4.558ValTyr: 4.558 ± 1.994
0.0ValXaa: 0.0 ± 0.0
Trp
2.735TrpAla: 2.735 ± 1.239
0.912TrpCys: 0.912 ± 1.15
0.912TrpAsp: 0.912 ± 0.974
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.912TrpGly: 0.912 ± 0.624
0.0TrpHis: 0.0 ± 0.0
0.912TrpIle: 0.912 ± 0.757
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.912TrpMet: 0.912 ± 0.757
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.912TrpGln: 0.912 ± 0.624
2.735TrpArg: 2.735 ± 1.261
0.0TrpSer: 0.0 ± 0.0
1.823TrpThr: 1.823 ± 2.228
0.912TrpVal: 0.912 ± 0.624
0.0TrpTrp: 0.0 ± 0.0
0.912TrpTyr: 0.912 ± 0.624
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.735TyrAla: 2.735 ± 1.361
0.0TyrCys: 0.0 ± 0.0
2.735TyrAsp: 2.735 ± 1.517
2.735TyrGlu: 2.735 ± 1.517
3.646TyrPhe: 3.646 ± 0.954
0.912TyrGly: 0.912 ± 0.624
0.0TyrHis: 0.0 ± 0.0
2.735TyrIle: 2.735 ± 2.135
0.912TyrLys: 0.912 ± 0.624
5.469TyrLeu: 5.469 ± 2.245
1.823TyrMet: 1.823 ± 0.991
3.646TyrAsn: 3.646 ± 1.124
0.912TyrPro: 0.912 ± 0.624
2.735TyrGln: 2.735 ± 1.314
2.735TyrArg: 2.735 ± 2.272
4.558TyrSer: 4.558 ± 2.979
0.912TyrThr: 0.912 ± 0.757
3.646TyrVal: 3.646 ± 1.288
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1098 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski