Amino acid dipepetide frequency for Streptococcus phage Javan446

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.863AlaAla: 3.863 ± 1.384
0.656AlaCys: 0.656 ± 0.228
3.353AlaAsp: 3.353 ± 0.48
6.049AlaGlu: 6.049 ± 0.692
3.061AlaPhe: 3.061 ± 0.572
4.883AlaGly: 4.883 ± 1.204
0.437AlaHis: 0.437 ± 0.199
5.247AlaIle: 5.247 ± 0.999
7.215AlaLys: 7.215 ± 0.944
6.195AlaLeu: 6.195 ± 0.809
2.478AlaMet: 2.478 ± 0.592
3.717AlaAsn: 3.717 ± 0.493
1.02AlaPro: 1.02 ± 0.246
2.405AlaGln: 2.405 ± 0.454
2.478AlaArg: 2.478 ± 0.368
4.008AlaSer: 4.008 ± 1.019
4.956AlaThr: 4.956 ± 0.849
4.737AlaVal: 4.737 ± 1.236
0.875AlaTrp: 0.875 ± 0.238
1.822AlaTyr: 1.822 ± 0.275
0.0AlaXaa: 0.0 ± 0.0
Cys
0.219CysAla: 0.219 ± 0.116
0.073CysCys: 0.073 ± 0.081
0.437CysAsp: 0.437 ± 0.157
0.51CysGlu: 0.51 ± 0.196
0.219CysPhe: 0.219 ± 0.124
0.437CysGly: 0.437 ± 0.184
0.073CysHis: 0.073 ± 0.079
0.437CysIle: 0.437 ± 0.198
0.656CysLys: 0.656 ± 0.22
0.656CysLeu: 0.656 ± 0.249
0.0CysMet: 0.0 ± 0.0
0.437CysAsn: 0.437 ± 0.205
0.146CysPro: 0.146 ± 0.112
0.146CysGln: 0.146 ± 0.123
0.364CysArg: 0.364 ± 0.165
0.073CysSer: 0.073 ± 0.078
0.146CysThr: 0.146 ± 0.109
0.364CysVal: 0.364 ± 0.155
0.292CysTrp: 0.292 ± 0.159
0.292CysTyr: 0.292 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
3.28AspAla: 3.28 ± 0.553
0.292AspCys: 0.292 ± 0.147
4.154AspAsp: 4.154 ± 0.572
4.3AspGlu: 4.3 ± 0.561
3.498AspPhe: 3.498 ± 0.501
4.81AspGly: 4.81 ± 0.661
1.02AspHis: 1.02 ± 0.28
5.247AspIle: 5.247 ± 0.636
5.685AspLys: 5.685 ± 0.554
6.559AspLeu: 6.559 ± 0.634
1.749AspMet: 1.749 ± 0.296
3.207AspAsn: 3.207 ± 0.492
1.093AspPro: 1.093 ± 0.276
1.603AspGln: 1.603 ± 0.296
2.332AspArg: 2.332 ± 0.399
3.644AspSer: 3.644 ± 0.489
4.008AspThr: 4.008 ± 0.557
4.446AspVal: 4.446 ± 0.454
0.875AspTrp: 0.875 ± 0.278
3.498AspTyr: 3.498 ± 0.591
0.0AspXaa: 0.0 ± 0.0
Glu
4.664GluAla: 4.664 ± 0.671
0.219GluCys: 0.219 ± 0.133
2.988GluAsp: 2.988 ± 0.516
5.83GluGlu: 5.83 ± 1.038
3.425GluPhe: 3.425 ± 0.468
3.425GluGly: 3.425 ± 0.415
1.531GluHis: 1.531 ± 0.355
5.758GluIle: 5.758 ± 0.732
6.049GluLys: 6.049 ± 0.623
7.944GluLeu: 7.944 ± 0.846
2.041GluMet: 2.041 ± 0.376
3.717GluAsn: 3.717 ± 0.636
1.531GluPro: 1.531 ± 0.381
2.842GluGln: 2.842 ± 0.51
2.842GluArg: 2.842 ± 0.43
3.79GluSer: 3.79 ± 0.41
4.008GluThr: 4.008 ± 0.804
4.81GluVal: 4.81 ± 0.725
0.729GluTrp: 0.729 ± 0.243
2.988GluTyr: 2.988 ± 0.517
0.0GluXaa: 0.0 ± 0.0
Phe
2.405PheAla: 2.405 ± 0.574
0.364PheCys: 0.364 ± 0.156
3.061PheAsp: 3.061 ± 0.39
2.769PheGlu: 2.769 ± 0.439
1.458PhePhe: 1.458 ± 0.365
3.644PheGly: 3.644 ± 0.598
0.656PheHis: 0.656 ± 0.2
2.405PheIle: 2.405 ± 0.507
4.664PheLys: 4.664 ± 0.694
3.134PheLeu: 3.134 ± 0.55
0.875PheMet: 0.875 ± 0.26
2.769PheAsn: 2.769 ± 0.503
0.656PhePro: 0.656 ± 0.214
1.166PheGln: 1.166 ± 0.242
2.186PheArg: 2.186 ± 0.364
2.915PheSer: 2.915 ± 0.495
2.988PheThr: 2.988 ± 0.355
3.061PheVal: 3.061 ± 0.455
0.364PheTrp: 0.364 ± 0.137
1.166PheTyr: 1.166 ± 0.29
0.0PheXaa: 0.0 ± 0.0
Gly
4.154GlyAla: 4.154 ± 1.168
0.292GlyCys: 0.292 ± 0.148
3.79GlyAsp: 3.79 ± 0.62
2.915GlyGlu: 2.915 ± 0.472
3.207GlyPhe: 3.207 ± 0.521
3.79GlyGly: 3.79 ± 0.602
1.02GlyHis: 1.02 ± 0.252
4.519GlyIle: 4.519 ± 0.646
7.142GlyLys: 7.142 ± 0.819
5.393GlyLeu: 5.393 ± 1.039
1.968GlyMet: 1.968 ± 0.316
2.988GlyAsn: 2.988 ± 0.412
0.729GlyPro: 0.729 ± 0.263
2.186GlyGln: 2.186 ± 0.465
3.207GlyArg: 3.207 ± 0.455
2.697GlySer: 2.697 ± 0.502
4.081GlyThr: 4.081 ± 0.627
4.737GlyVal: 4.737 ± 0.644
0.583GlyTrp: 0.583 ± 0.225
3.134GlyTyr: 3.134 ± 0.508
0.0GlyXaa: 0.0 ± 0.0
His
1.093HisAla: 1.093 ± 0.331
0.073HisCys: 0.073 ± 0.069
0.875HisAsp: 0.875 ± 0.294
1.093HisGlu: 1.093 ± 0.272
0.802HisPhe: 0.802 ± 0.282
0.947HisGly: 0.947 ± 0.339
0.292HisHis: 0.292 ± 0.182
1.312HisIle: 1.312 ± 0.349
0.947HisLys: 0.947 ± 0.345
0.729HisLeu: 0.729 ± 0.204
0.292HisMet: 0.292 ± 0.144
1.385HisAsn: 1.385 ± 0.313
0.51HisPro: 0.51 ± 0.149
0.875HisGln: 0.875 ± 0.234
0.729HisArg: 0.729 ± 0.228
1.093HisSer: 1.093 ± 0.3
1.02HisThr: 1.02 ± 0.272
1.02HisVal: 1.02 ± 0.261
0.364HisTrp: 0.364 ± 0.17
0.51HisTyr: 0.51 ± 0.191
0.0HisXaa: 0.0 ± 0.0
Ile
5.247IleAla: 5.247 ± 0.704
0.51IleCys: 0.51 ± 0.223
7.142IleAsp: 7.142 ± 0.833
4.446IleGlu: 4.446 ± 0.574
2.332IlePhe: 2.332 ± 0.456
4.3IleGly: 4.3 ± 0.505
1.603IleHis: 1.603 ± 0.388
4.519IleIle: 4.519 ± 0.57
7.142IleLys: 7.142 ± 0.862
5.029IleLeu: 5.029 ± 0.643
1.312IleMet: 1.312 ± 0.331
4.154IleAsn: 4.154 ± 0.548
1.603IlePro: 1.603 ± 0.396
2.769IleGln: 2.769 ± 0.437
3.425IleArg: 3.425 ± 0.551
5.32IleSer: 5.32 ± 0.916
5.175IleThr: 5.175 ± 0.629
3.863IleVal: 3.863 ± 0.479
0.583IleTrp: 0.583 ± 0.199
2.551IleTyr: 2.551 ± 0.515
0.0IleXaa: 0.0 ± 0.0
Lys
6.778LysAla: 6.778 ± 0.749
0.146LysCys: 0.146 ± 0.1
4.883LysAsp: 4.883 ± 0.583
6.997LysGlu: 6.997 ± 1.026
2.842LysPhe: 2.842 ± 0.408
4.883LysGly: 4.883 ± 0.626
1.458LysHis: 1.458 ± 0.368
7.142LysIle: 7.142 ± 0.733
7.798LysLys: 7.798 ± 1.15
7.871LysLeu: 7.871 ± 0.715
2.551LysMet: 2.551 ± 0.426
4.3LysAsn: 4.3 ± 0.508
2.114LysPro: 2.114 ± 0.412
4.956LysGln: 4.956 ± 0.538
3.717LysArg: 3.717 ± 0.609
5.247LysSer: 5.247 ± 0.678
6.122LysThr: 6.122 ± 0.859
6.268LysVal: 6.268 ± 0.802
1.02LysTrp: 1.02 ± 0.227
3.498LysTyr: 3.498 ± 0.554
0.0LysXaa: 0.0 ± 0.0
Leu
6.049LeuAla: 6.049 ± 0.905
0.437LeuCys: 0.437 ± 0.265
6.705LeuAsp: 6.705 ± 0.664
7.58LeuGlu: 7.58 ± 0.713
3.79LeuPhe: 3.79 ± 0.411
4.592LeuGly: 4.592 ± 0.939
0.729LeuHis: 0.729 ± 0.199
5.539LeuIle: 5.539 ± 0.775
7.725LeuLys: 7.725 ± 0.96
6.341LeuLeu: 6.341 ± 0.886
2.114LeuMet: 2.114 ± 0.37
4.592LeuAsn: 4.592 ± 0.57
2.697LeuPro: 2.697 ± 0.415
3.717LeuGln: 3.717 ± 0.564
2.915LeuArg: 2.915 ± 0.571
6.486LeuSer: 6.486 ± 0.615
5.976LeuThr: 5.976 ± 0.593
4.227LeuVal: 4.227 ± 0.515
0.51LeuTrp: 0.51 ± 0.173
1.968LeuTyr: 1.968 ± 0.352
0.0LeuXaa: 0.0 ± 0.0
Met
2.332MetAla: 2.332 ± 0.494
0.073MetCys: 0.073 ± 0.077
1.531MetAsp: 1.531 ± 0.354
1.603MetGlu: 1.603 ± 0.384
0.875MetPhe: 0.875 ± 0.24
1.239MetGly: 1.239 ± 0.299
0.364MetHis: 0.364 ± 0.169
1.822MetIle: 1.822 ± 0.309
1.531MetLys: 1.531 ± 0.321
1.895MetLeu: 1.895 ± 0.444
0.364MetMet: 0.364 ± 0.165
1.749MetAsn: 1.749 ± 0.358
0.51MetPro: 0.51 ± 0.225
1.385MetGln: 1.385 ± 0.334
1.895MetArg: 1.895 ± 0.347
1.822MetSer: 1.822 ± 0.428
1.895MetThr: 1.895 ± 0.399
1.166MetVal: 1.166 ± 0.301
0.146MetTrp: 0.146 ± 0.102
0.656MetTyr: 0.656 ± 0.175
0.0MetXaa: 0.0 ± 0.0
Asn
3.425AsnAla: 3.425 ± 0.576
0.656AsnCys: 0.656 ± 0.249
2.842AsnAsp: 2.842 ± 0.439
3.79AsnGlu: 3.79 ± 0.595
2.478AsnPhe: 2.478 ± 0.41
4.227AsnGly: 4.227 ± 0.457
0.583AsnHis: 0.583 ± 0.201
3.644AsnIle: 3.644 ± 0.689
4.519AsnLys: 4.519 ± 0.709
4.737AsnLeu: 4.737 ± 0.534
1.603AsnMet: 1.603 ± 0.36
2.769AsnAsn: 2.769 ± 0.437
2.041AsnPro: 2.041 ± 0.371
2.041AsnGln: 2.041 ± 0.402
2.332AsnArg: 2.332 ± 0.385
2.915AsnSer: 2.915 ± 0.473
2.551AsnThr: 2.551 ± 0.544
2.332AsnVal: 2.332 ± 0.386
1.093AsnTrp: 1.093 ± 0.278
2.186AsnTyr: 2.186 ± 0.383
0.0AsnXaa: 0.0 ± 0.0
Pro
1.166ProAla: 1.166 ± 0.38
0.073ProCys: 0.073 ± 0.075
1.239ProAsp: 1.239 ± 0.312
1.312ProGlu: 1.312 ± 0.357
1.239ProPhe: 1.239 ± 0.333
0.729ProGly: 0.729 ± 0.195
0.51ProHis: 0.51 ± 0.187
2.041ProIle: 2.041 ± 0.395
2.842ProLys: 2.842 ± 0.491
1.676ProLeu: 1.676 ± 0.382
0.364ProMet: 0.364 ± 0.145
1.458ProAsn: 1.458 ± 0.346
0.875ProPro: 0.875 ± 0.24
0.802ProGln: 0.802 ± 0.23
0.802ProArg: 0.802 ± 0.231
1.895ProSer: 1.895 ± 0.362
1.603ProThr: 1.603 ± 0.321
1.385ProVal: 1.385 ± 0.33
0.364ProTrp: 0.364 ± 0.188
0.875ProTyr: 0.875 ± 0.226
0.0ProXaa: 0.0 ± 0.0
Gln
3.936GlnAla: 3.936 ± 0.839
0.364GlnCys: 0.364 ± 0.159
1.531GlnAsp: 1.531 ± 0.356
3.28GlnGlu: 3.28 ± 0.519
1.531GlnPhe: 1.531 ± 0.337
2.915GlnGly: 2.915 ± 0.421
0.51GlnHis: 0.51 ± 0.172
3.717GlnIle: 3.717 ± 0.46
3.353GlnLys: 3.353 ± 0.5
3.061GlnLeu: 3.061 ± 0.454
0.802GlnMet: 0.802 ± 0.301
1.822GlnAsn: 1.822 ± 0.44
0.947GlnPro: 0.947 ± 0.258
1.749GlnGln: 1.749 ± 0.441
1.239GlnArg: 1.239 ± 0.241
3.79GlnSer: 3.79 ± 0.63
2.551GlnThr: 2.551 ± 0.535
1.749GlnVal: 1.749 ± 0.397
0.51GlnTrp: 0.51 ± 0.198
1.166GlnTyr: 1.166 ± 0.292
0.0GlnXaa: 0.0 ± 0.0
Arg
2.915ArgAla: 2.915 ± 0.452
0.292ArgCys: 0.292 ± 0.122
2.332ArgAsp: 2.332 ± 0.457
2.988ArgGlu: 2.988 ± 0.485
1.895ArgPhe: 1.895 ± 0.36
2.478ArgGly: 2.478 ± 0.464
0.875ArgHis: 0.875 ± 0.243
2.624ArgIle: 2.624 ± 0.399
4.446ArgLys: 4.446 ± 0.625
3.79ArgLeu: 3.79 ± 0.473
1.093ArgMet: 1.093 ± 0.234
2.041ArgAsn: 2.041 ± 0.428
0.802ArgPro: 0.802 ± 0.242
1.676ArgGln: 1.676 ± 0.356
1.531ArgArg: 1.531 ± 0.37
2.114ArgSer: 2.114 ± 0.384
2.478ArgThr: 2.478 ± 0.4
2.332ArgVal: 2.332 ± 0.403
0.583ArgTrp: 0.583 ± 0.237
1.968ArgTyr: 1.968 ± 0.407
0.0ArgXaa: 0.0 ± 0.0
Ser
5.393SerAla: 5.393 ± 1.393
0.292SerCys: 0.292 ± 0.128
4.664SerAsp: 4.664 ± 0.683
3.79SerGlu: 3.79 ± 0.565
2.697SerPhe: 2.697 ± 0.426
3.79SerGly: 3.79 ± 0.664
0.802SerHis: 0.802 ± 0.248
4.373SerIle: 4.373 ± 0.579
5.32SerLys: 5.32 ± 0.696
5.175SerLeu: 5.175 ± 0.73
1.822SerMet: 1.822 ± 0.517
3.061SerAsn: 3.061 ± 0.57
1.458SerPro: 1.458 ± 0.278
3.353SerGln: 3.353 ± 0.724
2.624SerArg: 2.624 ± 0.471
4.519SerSer: 4.519 ± 0.97
3.353SerThr: 3.353 ± 0.593
4.592SerVal: 4.592 ± 0.564
0.583SerTrp: 0.583 ± 0.216
1.895SerTyr: 1.895 ± 0.36
0.0SerXaa: 0.0 ± 0.0
Thr
4.519ThrAla: 4.519 ± 0.801
0.146ThrCys: 0.146 ± 0.128
4.592ThrAsp: 4.592 ± 0.631
3.79ThrGlu: 3.79 ± 0.581
2.769ThrPhe: 2.769 ± 0.541
4.664ThrGly: 4.664 ± 0.667
1.458ThrHis: 1.458 ± 0.289
4.519ThrIle: 4.519 ± 0.43
4.883ThrLys: 4.883 ± 0.794
5.976ThrLeu: 5.976 ± 0.68
1.02ThrMet: 1.02 ± 0.281
3.28ThrAsn: 3.28 ± 0.538
1.895ThrPro: 1.895 ± 0.39
2.478ThrGln: 2.478 ± 0.356
1.895ThrArg: 1.895 ± 0.439
4.446ThrSer: 4.446 ± 0.579
4.3ThrThr: 4.3 ± 0.733
4.664ThrVal: 4.664 ± 0.644
0.583ThrTrp: 0.583 ± 0.213
1.822ThrTyr: 1.822 ± 0.373
0.0ThrXaa: 0.0 ± 0.0
Val
4.737ValAla: 4.737 ± 1.159
0.437ValCys: 0.437 ± 0.153
4.81ValAsp: 4.81 ± 0.464
5.029ValGlu: 5.029 ± 0.676
2.842ValPhe: 2.842 ± 0.47
3.717ValGly: 3.717 ± 0.63
0.947ValHis: 0.947 ± 0.253
4.227ValIle: 4.227 ± 0.558
4.81ValLys: 4.81 ± 0.727
5.029ValLeu: 5.029 ± 0.639
1.385ValMet: 1.385 ± 0.299
3.061ValAsn: 3.061 ± 0.363
1.531ValPro: 1.531 ± 0.323
2.259ValGln: 2.259 ± 0.369
2.769ValArg: 2.769 ± 0.476
4.008ValSer: 4.008 ± 0.636
4.081ValThr: 4.081 ± 0.569
4.081ValVal: 4.081 ± 0.49
0.656ValTrp: 0.656 ± 0.245
1.749ValTyr: 1.749 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
0.802TrpAla: 0.802 ± 0.295
0.146TrpCys: 0.146 ± 0.103
1.093TrpAsp: 1.093 ± 0.274
0.656TrpGlu: 0.656 ± 0.249
0.364TrpPhe: 0.364 ± 0.146
1.312TrpGly: 1.312 ± 0.256
0.146TrpHis: 0.146 ± 0.099
0.729TrpIle: 0.729 ± 0.245
1.166TrpLys: 1.166 ± 0.249
1.02TrpLeu: 1.02 ± 0.345
0.292TrpMet: 0.292 ± 0.146
0.219TrpAsn: 0.219 ± 0.124
0.292TrpPro: 0.292 ± 0.141
0.292TrpGln: 0.292 ± 0.155
0.729TrpArg: 0.729 ± 0.213
0.583TrpSer: 0.583 ± 0.189
0.146TrpThr: 0.146 ± 0.111
0.729TrpVal: 0.729 ± 0.279
0.0TrpTrp: 0.0 ± 0.0
0.292TrpTyr: 0.292 ± 0.141
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.478TyrAla: 2.478 ± 0.381
0.437TyrCys: 0.437 ± 0.186
3.28TyrAsp: 3.28 ± 0.514
2.405TyrGlu: 2.405 ± 0.424
1.385TyrPhe: 1.385 ± 0.375
1.749TyrGly: 1.749 ± 0.363
0.947TyrHis: 0.947 ± 0.267
3.207TyrIle: 3.207 ± 0.445
2.697TyrLys: 2.697 ± 0.539
2.478TyrLeu: 2.478 ± 0.511
0.656TyrMet: 0.656 ± 0.227
2.041TyrAsn: 2.041 ± 0.421
0.802TyrPro: 0.802 ± 0.273
1.895TyrGln: 1.895 ± 0.374
1.239TyrArg: 1.239 ± 0.315
2.186TyrSer: 2.186 ± 0.4
2.259TyrThr: 2.259 ± 0.445
1.603TyrVal: 1.603 ± 0.316
0.292TyrTrp: 0.292 ± 0.136
1.531TyrTyr: 1.531 ± 0.435
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (13722 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski