Amino acid dipepetide frequency for Diodia vein chlorosis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.311AlaAla: 2.311 ± 1.749
0.578AlaCys: 0.578 ± 0.205
3.033AlaAsp: 3.033 ± 0.893
2.311AlaGlu: 2.311 ± 0.493
2.022AlaPhe: 2.022 ± 0.398
2.455AlaGly: 2.455 ± 0.631
0.0AlaHis: 0.0 ± 0.0
2.889AlaIle: 2.889 ± 0.822
4.333AlaLys: 4.333 ± 0.607
3.899AlaLeu: 3.899 ± 0.492
1.155AlaMet: 1.155 ± 0.246
3.466AlaAsn: 3.466 ± 1.116
0.144AlaPro: 0.144 ± 0.106
1.733AlaGln: 1.733 ± 0.295
1.3AlaArg: 1.3 ± 0.301
1.733AlaSer: 1.733 ± 0.373
1.3AlaThr: 1.3 ± 0.414
1.878AlaVal: 1.878 ± 0.43
0.289AlaTrp: 0.289 ± 0.118
1.3AlaTyr: 1.3 ± 0.509
0.0AlaXaa: 0.0 ± 0.0
Cys
0.144CysAla: 0.144 ± 0.242
0.144CysCys: 0.144 ± 0.242
1.878CysAsp: 1.878 ± 0.481
0.578CysGlu: 0.578 ± 0.286
0.289CysPhe: 0.289 ± 0.118
1.589CysGly: 1.589 ± 0.332
0.289CysHis: 0.289 ± 0.118
1.011CysIle: 1.011 ± 0.194
1.878CysLys: 1.878 ± 0.213
3.177CysLeu: 3.177 ± 0.941
0.144CysMet: 0.144 ± 0.262
1.589CysAsn: 1.589 ± 0.296
0.289CysPro: 0.289 ± 0.118
1.155CysGln: 1.155 ± 0.471
0.578CysArg: 0.578 ± 0.207
0.867CysSer: 0.867 ± 0.297
1.444CysThr: 1.444 ± 0.348
2.022CysVal: 2.022 ± 0.527
0.289CysTrp: 0.289 ± 0.118
1.589CysTyr: 1.589 ± 0.377
0.0CysXaa: 0.0 ± 0.0
Asp
2.455AspAla: 2.455 ± 0.617
1.733AspCys: 1.733 ± 0.244
5.199AspAsp: 5.199 ± 1.096
3.899AspGlu: 3.899 ± 0.532
5.199AspPhe: 5.199 ± 1.292
5.633AspGly: 5.633 ± 0.918
0.0AspHis: 0.0 ± 0.0
6.21AspIle: 6.21 ± 0.741
4.188AspLys: 4.188 ± 0.655
5.633AspLeu: 5.633 ± 0.701
1.733AspMet: 1.733 ± 0.443
4.477AspAsn: 4.477 ± 0.782
1.155AspPro: 1.155 ± 0.427
1.155AspGln: 1.155 ± 0.708
3.177AspArg: 3.177 ± 0.59
3.466AspSer: 3.466 ± 0.851
2.455AspThr: 2.455 ± 0.818
8.521AspVal: 8.521 ± 0.821
0.144AspTrp: 0.144 ± 0.217
1.589AspTyr: 1.589 ± 0.403
0.0AspXaa: 0.0 ± 0.0
Glu
1.3GluAla: 1.3 ± 0.38
1.733GluCys: 1.733 ± 0.706
3.322GluAsp: 3.322 ± 0.631
4.333GluGlu: 4.333 ± 0.932
3.322GluPhe: 3.322 ± 0.685
3.033GluGly: 3.033 ± 0.741
1.011GluHis: 1.011 ± 0.398
4.333GluIle: 4.333 ± 0.759
8.088GluLys: 8.088 ± 1.563
3.322GluLeu: 3.322 ± 0.55
1.589GluMet: 1.589 ± 0.443
3.322GluAsn: 3.322 ± 0.725
2.744GluPro: 2.744 ± 0.815
0.722GluGln: 0.722 ± 0.214
2.744GluArg: 2.744 ± 0.893
5.488GluSer: 5.488 ± 1.163
1.589GluThr: 1.589 ± 0.525
3.755GluVal: 3.755 ± 1.144
0.0GluTrp: 0.0 ± 0.0
2.311GluTyr: 2.311 ± 0.361
0.0GluXaa: 0.0 ± 0.0
Phe
2.022PheAla: 2.022 ± 0.811
2.166PheCys: 2.166 ± 0.451
4.91PheAsp: 4.91 ± 0.818
2.166PheGlu: 2.166 ± 0.313
2.311PhePhe: 2.311 ± 0.908
2.455PheGly: 2.455 ± 0.338
0.578PheHis: 0.578 ± 0.235
4.622PheIle: 4.622 ± 0.883
4.044PheLys: 4.044 ± 1.277
4.477PheLeu: 4.477 ± 1.253
1.444PheMet: 1.444 ± 0.373
3.611PheAsn: 3.611 ± 0.927
1.3PhePro: 1.3 ± 0.341
1.155PheGln: 1.155 ± 0.348
3.322PheArg: 3.322 ± 0.683
7.221PheSer: 7.221 ± 0.801
2.6PheThr: 2.6 ± 0.371
3.755PheVal: 3.755 ± 0.583
0.578PheTrp: 0.578 ± 0.337
2.889PheTyr: 2.889 ± 0.344
0.0PheXaa: 0.0 ± 0.0
Gly
1.878GlyAla: 1.878 ± 0.466
1.3GlyCys: 1.3 ± 0.262
3.755GlyAsp: 3.755 ± 0.483
2.889GlyGlu: 2.889 ± 0.67
2.455GlyPhe: 2.455 ± 1.016
3.611GlyGly: 3.611 ± 0.815
0.867GlyHis: 0.867 ± 0.291
2.889GlyIle: 2.889 ± 0.424
6.932GlyLys: 6.932 ± 1.276
3.322GlyLeu: 3.322 ± 0.757
1.3GlyMet: 1.3 ± 0.541
2.022GlyAsn: 2.022 ± 0.239
0.289GlyPro: 0.289 ± 0.118
1.011GlyGln: 1.011 ± 0.368
2.455GlyArg: 2.455 ± 0.563
3.177GlySer: 3.177 ± 0.399
1.155GlyThr: 1.155 ± 0.844
3.322GlyVal: 3.322 ± 0.613
0.578GlyTrp: 0.578 ± 0.323
2.455GlyTyr: 2.455 ± 0.617
0.0GlyXaa: 0.0 ± 0.0
His
0.722HisAla: 0.722 ± 0.23
0.144HisCys: 0.144 ± 0.106
0.722HisAsp: 0.722 ± 0.222
1.011HisGlu: 1.011 ± 0.398
1.589HisPhe: 1.589 ± 0.519
0.433HisGly: 0.433 ± 0.203
0.578HisHis: 0.578 ± 0.235
0.0HisIle: 0.0 ± 0.0
1.589HisLys: 1.589 ± 0.377
1.011HisLeu: 1.011 ± 0.498
0.144HisMet: 0.144 ± 0.106
0.289HisAsn: 0.289 ± 0.409
0.722HisPro: 0.722 ± 0.23
0.144HisGln: 0.144 ± 0.106
0.144HisArg: 0.144 ± 0.106
1.444HisSer: 1.444 ± 0.235
0.867HisThr: 0.867 ± 0.202
1.155HisVal: 1.155 ± 0.421
0.144HisTrp: 0.144 ± 0.178
0.289HisTyr: 0.289 ± 0.118
0.0HisXaa: 0.0 ± 0.0
Ile
1.878IleAla: 1.878 ± 0.433
0.578IleCys: 0.578 ± 0.235
6.499IleAsp: 6.499 ± 0.759
2.744IleGlu: 2.744 ± 0.637
3.755IlePhe: 3.755 ± 0.753
2.022IleGly: 2.022 ± 0.326
1.589IleHis: 1.589 ± 0.332
6.932IleIle: 6.932 ± 1.025
6.355IleLys: 6.355 ± 0.828
6.644IleLeu: 6.644 ± 0.806
1.011IleMet: 1.011 ± 0.328
7.077IleAsn: 7.077 ± 0.789
2.166IlePro: 2.166 ± 0.265
2.6IleGln: 2.6 ± 0.485
3.033IleArg: 3.033 ± 0.922
5.777IleSer: 5.777 ± 0.536
3.177IleThr: 3.177 ± 0.586
3.755IleVal: 3.755 ± 0.709
0.289IleTrp: 0.289 ± 0.186
2.455IleTyr: 2.455 ± 0.304
0.0IleXaa: 0.0 ± 0.0
Lys
3.033LysAla: 3.033 ± 0.684
2.311LysCys: 2.311 ± 0.331
4.622LysAsp: 4.622 ± 0.909
5.777LysGlu: 5.777 ± 0.813
7.221LysPhe: 7.221 ± 0.811
3.322LysGly: 3.322 ± 0.514
1.155LysHis: 1.155 ± 0.583
5.777LysIle: 5.777 ± 0.979
7.366LysLys: 7.366 ± 1.678
8.521LysLeu: 8.521 ± 0.861
2.6LysMet: 2.6 ± 0.565
6.499LysAsn: 6.499 ± 0.96
2.455LysPro: 2.455 ± 0.44
2.455LysGln: 2.455 ± 0.523
3.611LysArg: 3.611 ± 1.098
6.066LysSer: 6.066 ± 0.91
6.21LysThr: 6.21 ± 0.754
6.644LysVal: 6.644 ± 0.612
1.011LysTrp: 1.011 ± 0.343
2.744LysTyr: 2.744 ± 0.751
0.0LysXaa: 0.0 ± 0.0
Leu
4.333LeuAla: 4.333 ± 1.12
1.733LeuCys: 1.733 ± 0.456
6.21LeuAsp: 6.21 ± 0.699
5.633LeuGlu: 5.633 ± 1.191
3.611LeuPhe: 3.611 ± 0.527
4.333LeuGly: 4.333 ± 0.74
1.011LeuHis: 1.011 ± 0.302
4.91LeuIle: 4.91 ± 0.483
8.81LeuLys: 8.81 ± 1.142
7.366LeuLeu: 7.366 ± 1.283
3.611LeuMet: 3.611 ± 0.831
5.488LeuAsn: 5.488 ± 0.783
4.044LeuPro: 4.044 ± 0.78
1.155LeuGln: 1.155 ± 0.435
5.777LeuArg: 5.777 ± 0.898
9.676LeuSer: 9.676 ± 1.317
5.199LeuThr: 5.199 ± 0.867
3.611LeuVal: 3.611 ± 0.789
0.289LeuTrp: 0.289 ± 0.267
3.033LeuTyr: 3.033 ± 0.908
0.0LeuXaa: 0.0 ± 0.0
Met
2.022MetAla: 2.022 ± 0.612
0.578MetCys: 0.578 ± 0.235
1.444MetAsp: 1.444 ± 0.252
0.433MetGlu: 0.433 ± 0.255
1.3MetPhe: 1.3 ± 0.367
0.867MetGly: 0.867 ± 0.291
0.144MetHis: 0.144 ± 0.217
0.722MetIle: 0.722 ± 0.224
2.744MetLys: 2.744 ± 0.748
2.311MetLeu: 2.311 ± 0.623
0.867MetMet: 0.867 ± 0.255
2.022MetAsn: 2.022 ± 0.655
0.289MetPro: 0.289 ± 0.215
1.011MetGln: 1.011 ± 0.269
1.444MetArg: 1.444 ± 0.444
2.455MetSer: 2.455 ± 0.432
2.6MetThr: 2.6 ± 0.799
1.589MetVal: 1.589 ± 0.479
0.0MetTrp: 0.0 ± 0.0
1.155MetTyr: 1.155 ± 0.446
0.0MetXaa: 0.0 ± 0.0
Asn
1.589AsnAla: 1.589 ± 0.628
0.578AsnCys: 0.578 ± 0.426
4.477AsnAsp: 4.477 ± 1.014
3.899AsnGlu: 3.899 ± 0.807
5.199AsnPhe: 5.199 ± 1.387
3.033AsnGly: 3.033 ± 0.522
0.289AsnHis: 0.289 ± 0.215
4.622AsnIle: 4.622 ± 1.274
6.21AsnLys: 6.21 ± 0.751
5.055AsnLeu: 5.055 ± 0.539
1.733AsnMet: 1.733 ± 0.313
2.6AsnAsn: 2.6 ± 0.779
2.166AsnPro: 2.166 ± 0.552
2.744AsnGln: 2.744 ± 0.771
2.455AsnArg: 2.455 ± 0.493
5.055AsnSer: 5.055 ± 1.634
4.477AsnThr: 4.477 ± 0.763
5.055AsnVal: 5.055 ± 0.423
0.433AsnTrp: 0.433 ± 0.171
3.033AsnTyr: 3.033 ± 0.403
0.0AsnXaa: 0.0 ± 0.0
Pro
1.444ProAla: 1.444 ± 0.342
0.433ProCys: 0.433 ± 0.165
1.444ProAsp: 1.444 ± 0.486
2.311ProGlu: 2.311 ± 0.67
1.3ProPhe: 1.3 ± 0.405
1.011ProGly: 1.011 ± 0.357
0.578ProHis: 0.578 ± 0.235
1.733ProIle: 1.733 ± 0.223
2.166ProLys: 2.166 ± 0.415
3.177ProLeu: 3.177 ± 0.622
0.722ProMet: 0.722 ± 0.268
2.022ProAsn: 2.022 ± 0.424
1.011ProPro: 1.011 ± 0.398
0.289ProGln: 0.289 ± 0.37
1.3ProArg: 1.3 ± 0.194
1.878ProSer: 1.878 ± 0.272
1.444ProThr: 1.444 ± 0.397
4.044ProVal: 4.044 ± 1.148
0.0ProTrp: 0.0 ± 0.0
1.878ProTyr: 1.878 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
1.444GlnAla: 1.444 ± 0.239
1.155GlnCys: 1.155 ± 0.345
0.722GlnAsp: 0.722 ± 0.341
2.166GlnGlu: 2.166 ± 0.541
1.733GlnPhe: 1.733 ± 0.297
0.867GlnGly: 0.867 ± 0.349
0.144GlnHis: 0.144 ± 0.217
3.033GlnIle: 3.033 ± 0.527
0.433GlnLys: 0.433 ± 0.257
1.3GlnLeu: 1.3 ± 1.1
0.433GlnMet: 0.433 ± 0.165
1.878GlnAsn: 1.878 ± 0.451
1.155GlnPro: 1.155 ± 0.471
0.867GlnGln: 0.867 ± 0.205
1.444GlnArg: 1.444 ± 0.376
1.155GlnSer: 1.155 ± 0.725
0.722GlnThr: 0.722 ± 0.495
3.322GlnVal: 3.322 ± 0.837
0.578GlnTrp: 0.578 ± 0.235
2.166GlnTyr: 2.166 ± 0.404
0.0GlnXaa: 0.0 ± 0.0
Arg
1.444ArgAla: 1.444 ± 0.35
1.3ArgCys: 1.3 ± 0.423
3.466ArgAsp: 3.466 ± 0.441
2.022ArgGlu: 2.022 ± 0.533
3.033ArgPhe: 3.033 ± 0.912
1.878ArgGly: 1.878 ± 0.335
0.289ArgHis: 0.289 ± 0.118
4.188ArgIle: 4.188 ± 0.69
3.177ArgLys: 3.177 ± 0.754
5.199ArgLeu: 5.199 ± 0.82
1.444ArgMet: 1.444 ± 0.427
1.733ArgAsn: 1.733 ± 0.303
0.722ArgPro: 0.722 ± 0.412
2.6ArgGln: 2.6 ± 0.619
2.455ArgArg: 2.455 ± 0.894
4.622ArgSer: 4.622 ± 0.478
2.166ArgThr: 2.166 ± 0.377
2.022ArgVal: 2.022 ± 0.45
0.433ArgTrp: 0.433 ± 0.369
2.166ArgTyr: 2.166 ± 0.964
0.0ArgXaa: 0.0 ± 0.0
Ser
2.455SerAla: 2.455 ± 0.79
0.867SerCys: 0.867 ± 0.477
4.333SerAsp: 4.333 ± 0.974
3.899SerGlu: 3.899 ± 0.323
5.921SerPhe: 5.921 ± 0.707
4.333SerGly: 4.333 ± 0.832
2.455SerHis: 2.455 ± 0.445
5.199SerIle: 5.199 ± 1.329
7.221SerLys: 7.221 ± 0.835
6.355SerLeu: 6.355 ± 1.219
2.166SerMet: 2.166 ± 0.259
6.066SerAsn: 6.066 ± 1.01
1.444SerPro: 1.444 ± 0.628
2.6SerGln: 2.6 ± 0.584
2.889SerArg: 2.889 ± 0.235
5.055SerSer: 5.055 ± 1.106
2.744SerThr: 2.744 ± 0.818
6.932SerVal: 6.932 ± 0.582
0.0SerTrp: 0.0 ± 0.0
5.488SerTyr: 5.488 ± 1.003
0.0SerXaa: 0.0 ± 0.0
Thr
2.744ThrAla: 2.744 ± 0.607
0.722ThrCys: 0.722 ± 0.23
3.466ThrAsp: 3.466 ± 0.912
3.322ThrGlu: 3.322 ± 0.374
3.177ThrPhe: 3.177 ± 1.183
2.022ThrGly: 2.022 ± 0.375
0.144ThrHis: 0.144 ± 0.253
3.611ThrIle: 3.611 ± 0.707
2.455ThrLys: 2.455 ± 0.963
5.777ThrLeu: 5.777 ± 0.98
0.867ThrMet: 0.867 ± 0.309
3.466ThrAsn: 3.466 ± 0.385
1.878ThrPro: 1.878 ± 0.265
0.722ThrGln: 0.722 ± 0.23
1.589ThrArg: 1.589 ± 0.565
4.622ThrSer: 4.622 ± 1.226
3.322ThrThr: 3.322 ± 0.854
2.455ThrVal: 2.455 ± 0.322
0.433ThrTrp: 0.433 ± 0.269
3.177ThrTyr: 3.177 ± 0.527
0.0ThrXaa: 0.0 ± 0.0
Val
3.611ValAla: 3.611 ± 0.299
1.878ValCys: 1.878 ± 0.451
5.633ValAsp: 5.633 ± 0.6
4.91ValGlu: 4.91 ± 1.042
1.733ValPhe: 1.733 ± 0.389
2.166ValGly: 2.166 ± 0.819
1.155ValHis: 1.155 ± 0.471
4.188ValIle: 4.188 ± 0.774
7.366ValLys: 7.366 ± 1.26
6.788ValLeu: 6.788 ± 0.736
1.733ValMet: 1.733 ± 0.549
3.755ValAsn: 3.755 ± 1.27
3.466ValPro: 3.466 ± 0.417
1.878ValGln: 1.878 ± 0.451
4.044ValArg: 4.044 ± 0.455
6.21ValSer: 6.21 ± 1.077
3.755ValThr: 3.755 ± 0.591
4.622ValVal: 4.622 ± 0.613
0.289ValTrp: 0.289 ± 0.118
2.889ValTyr: 2.889 ± 0.653
0.0ValXaa: 0.0 ± 0.0
Trp
0.433TrpAla: 0.433 ± 0.171
0.0TrpCys: 0.0 ± 0.0
0.289TrpAsp: 0.289 ± 0.118
0.289TrpGlu: 0.289 ± 0.267
0.289TrpPhe: 0.289 ± 0.118
0.289TrpGly: 0.289 ± 0.118
0.0TrpHis: 0.0 ± 0.0
0.289TrpIle: 0.289 ± 0.208
0.722TrpLys: 0.722 ± 0.325
1.011TrpLeu: 1.011 ± 0.269
0.433TrpMet: 0.433 ± 0.198
0.289TrpAsn: 0.289 ± 0.309
0.144TrpPro: 0.144 ± 0.178
0.144TrpGln: 0.144 ± 0.178
0.578TrpArg: 0.578 ± 0.232
0.144TrpSer: 0.144 ± 0.106
0.289TrpThr: 0.289 ± 0.118
0.433TrpVal: 0.433 ± 0.19
0.0TrpTrp: 0.0 ± 0.0
0.578TrpTyr: 0.578 ± 0.337
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.867TyrAla: 0.867 ± 0.458
1.011TyrCys: 1.011 ± 0.398
2.6TyrAsp: 2.6 ± 0.551
3.466TyrGlu: 3.466 ± 1.12
2.166TyrPhe: 2.166 ± 0.622
2.311TyrGly: 2.311 ± 0.514
0.867TyrHis: 0.867 ± 0.213
3.177TyrIle: 3.177 ± 0.66
3.466TyrLys: 3.466 ± 0.363
5.488TyrLeu: 5.488 ± 0.702
0.722TyrMet: 0.722 ± 0.626
3.033TyrAsn: 3.033 ± 0.48
2.6TyrPro: 2.6 ± 0.753
0.578TyrGln: 0.578 ± 0.323
2.022TyrArg: 2.022 ± 0.527
2.311TyrSer: 2.311 ± 0.568
2.455TyrThr: 2.455 ± 0.505
3.177TyrVal: 3.177 ± 0.644
0.867TyrTrp: 0.867 ± 0.291
2.6TyrTyr: 2.6 ± 0.497
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6925 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski