Amino acid dipepetide frequency for Streptococcus phage Javan406

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.635AlaAla: 4.635 ± 1.562
0.35AlaCys: 0.35 ± 0.157
5.16AlaAsp: 5.16 ± 0.764
5.422AlaGlu: 5.422 ± 0.836
3.498AlaPhe: 3.498 ± 0.769
4.985AlaGly: 4.985 ± 1.607
1.312AlaHis: 1.312 ± 0.364
7.346AlaIle: 7.346 ± 1.171
5.334AlaLys: 5.334 ± 0.769
6.209AlaLeu: 6.209 ± 0.756
2.361AlaMet: 2.361 ± 0.662
4.81AlaAsn: 4.81 ± 0.661
2.011AlaPro: 2.011 ± 0.376
3.585AlaGln: 3.585 ± 0.978
2.099AlaArg: 2.099 ± 0.473
4.11AlaSer: 4.11 ± 0.878
5.509AlaThr: 5.509 ± 0.93
4.897AlaVal: 4.897 ± 1.003
0.787AlaTrp: 0.787 ± 0.249
2.536AlaTyr: 2.536 ± 0.432
0.0AlaXaa: 0.0 ± 0.0
Cys
0.262CysAla: 0.262 ± 0.147
0.087CysCys: 0.087 ± 0.085
0.262CysAsp: 0.262 ± 0.146
0.437CysGlu: 0.437 ± 0.185
0.175CysPhe: 0.175 ± 0.111
0.437CysGly: 0.437 ± 0.184
0.175CysHis: 0.175 ± 0.122
0.612CysIle: 0.612 ± 0.216
0.7CysLys: 0.7 ± 0.222
0.087CysLeu: 0.087 ± 0.088
0.087CysMet: 0.087 ± 0.109
0.262CysAsn: 0.262 ± 0.128
0.175CysPro: 0.175 ± 0.123
0.087CysGln: 0.087 ± 0.081
0.35CysArg: 0.35 ± 0.168
0.525CysSer: 0.525 ± 0.219
0.175CysThr: 0.175 ± 0.132
0.262CysVal: 0.262 ± 0.159
0.0CysTrp: 0.0 ± 0.0
0.175CysTyr: 0.175 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
3.935AspAla: 3.935 ± 0.66
0.35AspCys: 0.35 ± 0.188
4.023AspAsp: 4.023 ± 0.806
4.285AspGlu: 4.285 ± 0.741
4.285AspPhe: 4.285 ± 0.536
4.985AspGly: 4.985 ± 0.953
0.7AspHis: 0.7 ± 0.257
3.848AspIle: 3.848 ± 0.726
5.597AspLys: 5.597 ± 0.771
5.334AspLeu: 5.334 ± 0.661
1.574AspMet: 1.574 ± 0.431
4.285AspAsn: 4.285 ± 0.71
1.487AspPro: 1.487 ± 0.322
1.224AspGln: 1.224 ± 0.334
3.323AspArg: 3.323 ± 0.617
4.373AspSer: 4.373 ± 0.701
3.498AspThr: 3.498 ± 0.584
3.935AspVal: 3.935 ± 0.687
0.7AspTrp: 0.7 ± 0.264
3.498AspTyr: 3.498 ± 0.612
0.0AspXaa: 0.0 ± 0.0
Glu
4.81GluAla: 4.81 ± 0.75
0.525GluCys: 0.525 ± 0.248
3.585GluAsp: 3.585 ± 0.709
4.373GluGlu: 4.373 ± 0.78
3.323GluPhe: 3.323 ± 0.663
2.711GluGly: 2.711 ± 0.582
1.049GluHis: 1.049 ± 0.394
5.422GluIle: 5.422 ± 0.816
6.034GluLys: 6.034 ± 0.858
7.521GluLeu: 7.521 ± 0.977
1.924GluMet: 1.924 ± 0.465
4.373GluAsn: 4.373 ± 0.679
1.749GluPro: 1.749 ± 0.475
3.673GluGln: 3.673 ± 0.617
2.798GluArg: 2.798 ± 0.749
3.148GluSer: 3.148 ± 0.669
2.886GluThr: 2.886 ± 0.488
3.585GluVal: 3.585 ± 0.794
1.312GluTrp: 1.312 ± 0.339
2.099GluTyr: 2.099 ± 0.345
0.0GluXaa: 0.0 ± 0.0
Phe
2.536PheAla: 2.536 ± 0.495
0.087PheCys: 0.087 ± 0.078
4.46PheAsp: 4.46 ± 0.721
3.76PheGlu: 3.76 ± 0.717
1.399PhePhe: 1.399 ± 0.397
3.323PheGly: 3.323 ± 0.598
1.137PheHis: 1.137 ± 0.388
2.536PheIle: 2.536 ± 0.666
3.498PheLys: 3.498 ± 0.635
3.061PheLeu: 3.061 ± 0.449
1.137PheMet: 1.137 ± 0.249
2.449PheAsn: 2.449 ± 0.492
1.487PhePro: 1.487 ± 0.415
1.662PheGln: 1.662 ± 0.321
1.662PheArg: 1.662 ± 0.354
1.662PheSer: 1.662 ± 0.427
2.449PheThr: 2.449 ± 0.429
2.186PheVal: 2.186 ± 0.512
0.525PheTrp: 0.525 ± 0.294
1.312PheTyr: 1.312 ± 0.352
0.0PheXaa: 0.0 ± 0.0
Gly
4.46GlyAla: 4.46 ± 1.437
0.35GlyCys: 0.35 ± 0.14
2.536GlyAsp: 2.536 ± 0.383
2.536GlyGlu: 2.536 ± 0.513
1.836GlyPhe: 1.836 ± 0.306
3.585GlyGly: 3.585 ± 0.768
1.049GlyHis: 1.049 ± 0.238
4.635GlyIle: 4.635 ± 0.947
5.684GlyLys: 5.684 ± 0.765
5.684GlyLeu: 5.684 ± 1.129
2.011GlyMet: 2.011 ± 0.632
3.498GlyAsn: 3.498 ± 0.538
1.224GlyPro: 1.224 ± 0.31
2.361GlyGln: 2.361 ± 0.53
2.186GlyArg: 2.186 ± 0.349
3.585GlySer: 3.585 ± 0.596
3.236GlyThr: 3.236 ± 0.548
4.11GlyVal: 4.11 ± 0.624
1.137GlyTrp: 1.137 ± 0.417
2.886GlyTyr: 2.886 ± 0.375
0.0GlyXaa: 0.0 ± 0.0
His
1.312HisAla: 1.312 ± 0.313
0.262HisCys: 0.262 ± 0.152
0.612HisAsp: 0.612 ± 0.229
1.487HisGlu: 1.487 ± 0.447
0.875HisPhe: 0.875 ± 0.203
1.049HisGly: 1.049 ± 0.325
0.525HisHis: 0.525 ± 0.283
0.962HisIle: 0.962 ± 0.34
1.224HisLys: 1.224 ± 0.348
1.399HisLeu: 1.399 ± 0.493
0.262HisMet: 0.262 ± 0.136
1.224HisAsn: 1.224 ± 0.352
0.7HisPro: 0.7 ± 0.231
0.262HisGln: 0.262 ± 0.139
0.437HisArg: 0.437 ± 0.172
0.7HisSer: 0.7 ± 0.255
1.049HisThr: 1.049 ± 0.375
0.962HisVal: 0.962 ± 0.21
0.087HisTrp: 0.087 ± 0.082
0.612HisTyr: 0.612 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
6.559IleAla: 6.559 ± 1.299
0.35IleCys: 0.35 ± 0.183
5.247IleAsp: 5.247 ± 0.777
6.296IleGlu: 6.296 ± 1.008
2.536IlePhe: 2.536 ± 0.394
4.285IleGly: 4.285 ± 0.705
0.962IleHis: 0.962 ± 0.319
4.81IleIle: 4.81 ± 0.67
5.772IleLys: 5.772 ± 0.815
3.848IleLeu: 3.848 ± 0.849
1.137IleMet: 1.137 ± 0.309
5.072IleAsn: 5.072 ± 0.609
1.749IlePro: 1.749 ± 0.391
2.361IleGln: 2.361 ± 0.508
2.886IleArg: 2.886 ± 0.401
6.034IleSer: 6.034 ± 0.79
3.935IleThr: 3.935 ± 0.626
4.46IleVal: 4.46 ± 0.584
0.612IleTrp: 0.612 ± 0.224
2.973IleTyr: 2.973 ± 0.578
0.0IleXaa: 0.0 ± 0.0
Lys
7.258LysAla: 7.258 ± 0.796
0.262LysCys: 0.262 ± 0.171
5.509LysAsp: 5.509 ± 0.676
6.909LysGlu: 6.909 ± 1.11
3.585LysPhe: 3.585 ± 0.535
4.373LysGly: 4.373 ± 0.521
1.399LysHis: 1.399 ± 0.39
5.16LysIle: 5.16 ± 0.821
8.133LysLys: 8.133 ± 1.137
5.684LysLeu: 5.684 ± 0.629
2.099LysMet: 2.099 ± 0.405
4.373LysAsn: 4.373 ± 0.759
2.973LysPro: 2.973 ± 0.563
3.323LysGln: 3.323 ± 0.488
3.585LysArg: 3.585 ± 0.785
4.81LysSer: 4.81 ± 0.743
5.334LysThr: 5.334 ± 0.614
5.597LysVal: 5.597 ± 0.821
1.049LysTrp: 1.049 ± 0.341
2.798LysTyr: 2.798 ± 0.767
0.0LysXaa: 0.0 ± 0.0
Leu
5.684LeuAla: 5.684 ± 0.644
0.437LeuCys: 0.437 ± 0.202
5.684LeuAsp: 5.684 ± 0.754
5.772LeuGlu: 5.772 ± 0.811
2.798LeuPhe: 2.798 ± 0.419
5.422LeuGly: 5.422 ± 0.862
1.487LeuHis: 1.487 ± 0.392
5.247LeuIle: 5.247 ± 0.939
8.57LeuLys: 8.57 ± 1.051
6.734LeuLeu: 6.734 ± 0.948
1.924LeuMet: 1.924 ± 0.403
4.285LeuAsn: 4.285 ± 0.556
2.536LeuPro: 2.536 ± 0.443
2.624LeuGln: 2.624 ± 0.372
3.411LeuArg: 3.411 ± 0.627
6.296LeuSer: 6.296 ± 0.883
6.034LeuThr: 6.034 ± 0.61
3.935LeuVal: 3.935 ± 0.54
0.525LeuTrp: 0.525 ± 0.179
2.536LeuTyr: 2.536 ± 0.555
0.0LeuXaa: 0.0 ± 0.0
Met
2.798MetAla: 2.798 ± 0.731
0.175MetCys: 0.175 ± 0.146
2.011MetAsp: 2.011 ± 0.569
1.224MetGlu: 1.224 ± 0.355
0.875MetPhe: 0.875 ± 0.258
1.137MetGly: 1.137 ± 0.43
0.262MetHis: 0.262 ± 0.151
1.662MetIle: 1.662 ± 0.431
1.574MetLys: 1.574 ± 0.387
1.924MetLeu: 1.924 ± 0.334
0.437MetMet: 0.437 ± 0.175
1.049MetAsn: 1.049 ± 0.259
0.875MetPro: 0.875 ± 0.296
1.924MetGln: 1.924 ± 0.374
1.049MetArg: 1.049 ± 0.257
2.099MetSer: 2.099 ± 0.5
2.449MetThr: 2.449 ± 0.552
1.749MetVal: 1.749 ± 0.519
0.087MetTrp: 0.087 ± 0.079
0.35MetTyr: 0.35 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
3.848AsnAla: 3.848 ± 0.557
0.175AsnCys: 0.175 ± 0.104
3.498AsnAsp: 3.498 ± 0.808
2.886AsnGlu: 2.886 ± 0.613
1.574AsnPhe: 1.574 ± 0.399
3.498AsnGly: 3.498 ± 0.523
0.875AsnHis: 0.875 ± 0.259
4.722AsnIle: 4.722 ± 0.629
3.673AsnLys: 3.673 ± 0.697
4.722AsnLeu: 4.722 ± 0.475
2.186AsnMet: 2.186 ± 0.41
3.498AsnAsn: 3.498 ± 0.722
1.836AsnPro: 1.836 ± 0.415
2.274AsnGln: 2.274 ± 0.416
2.711AsnArg: 2.711 ± 0.595
3.498AsnSer: 3.498 ± 0.583
3.498AsnThr: 3.498 ± 0.533
3.236AsnVal: 3.236 ± 0.485
1.137AsnTrp: 1.137 ± 0.287
3.673AsnTyr: 3.673 ± 0.615
0.0AsnXaa: 0.0 ± 0.0
Pro
2.624ProAla: 2.624 ± 0.436
0.087ProCys: 0.087 ± 0.073
1.836ProAsp: 1.836 ± 0.55
1.924ProGlu: 1.924 ± 0.452
1.049ProPhe: 1.049 ± 0.346
1.312ProGly: 1.312 ± 0.282
0.437ProHis: 0.437 ± 0.183
2.186ProIle: 2.186 ± 0.482
2.186ProLys: 2.186 ± 0.579
2.624ProLeu: 2.624 ± 0.516
0.525ProMet: 0.525 ± 0.182
1.836ProAsn: 1.836 ± 0.406
0.525ProPro: 0.525 ± 0.223
1.574ProGln: 1.574 ± 0.492
1.312ProArg: 1.312 ± 0.4
1.312ProSer: 1.312 ± 0.426
2.711ProThr: 2.711 ± 0.531
2.186ProVal: 2.186 ± 0.554
0.0ProTrp: 0.0 ± 0.0
1.312ProTyr: 1.312 ± 0.339
0.0ProXaa: 0.0 ± 0.0
Gln
3.585GlnAla: 3.585 ± 0.945
0.262GlnCys: 0.262 ± 0.2
1.749GlnAsp: 1.749 ± 0.414
2.624GlnGlu: 2.624 ± 0.532
2.361GlnPhe: 2.361 ± 0.483
2.449GlnGly: 2.449 ± 0.598
0.437GlnHis: 0.437 ± 0.166
3.848GlnIle: 3.848 ± 0.616
3.236GlnLys: 3.236 ± 0.618
3.848GlnLeu: 3.848 ± 0.567
1.662GlnMet: 1.662 ± 0.366
1.312GlnAsn: 1.312 ± 0.267
1.662GlnPro: 1.662 ± 0.515
2.973GlnGln: 2.973 ± 0.953
1.487GlnArg: 1.487 ± 0.34
2.886GlnSer: 2.886 ± 0.48
2.973GlnThr: 2.973 ± 0.588
1.574GlnVal: 1.574 ± 0.482
0.437GlnTrp: 0.437 ± 0.212
0.962GlnTyr: 0.962 ± 0.244
0.0GlnXaa: 0.0 ± 0.0
Arg
2.361ArgAla: 2.361 ± 0.341
0.525ArgCys: 0.525 ± 0.309
3.061ArgAsp: 3.061 ± 0.511
2.536ArgGlu: 2.536 ± 0.363
2.274ArgPhe: 2.274 ± 0.449
1.312ArgGly: 1.312 ± 0.352
1.399ArgHis: 1.399 ± 0.343
2.973ArgIle: 2.973 ± 0.652
3.673ArgLys: 3.673 ± 0.702
4.023ArgLeu: 4.023 ± 0.569
0.962ArgMet: 0.962 ± 0.325
1.924ArgAsn: 1.924 ± 0.437
1.049ArgPro: 1.049 ± 0.357
1.312ArgGln: 1.312 ± 0.323
2.711ArgArg: 2.711 ± 0.581
1.487ArgSer: 1.487 ± 0.331
2.099ArgThr: 2.099 ± 0.377
2.798ArgVal: 2.798 ± 0.49
0.35ArgTrp: 0.35 ± 0.18
1.836ArgTyr: 1.836 ± 0.387
0.0ArgXaa: 0.0 ± 0.0
Ser
5.334SerAla: 5.334 ± 1.323
0.087SerCys: 0.087 ± 0.096
2.798SerAsp: 2.798 ± 0.54
3.148SerGlu: 3.148 ± 0.505
2.711SerPhe: 2.711 ± 0.433
4.373SerGly: 4.373 ± 0.755
0.875SerHis: 0.875 ± 0.293
4.897SerIle: 4.897 ± 0.576
5.859SerLys: 5.859 ± 0.791
4.81SerLeu: 4.81 ± 0.545
1.924SerMet: 1.924 ± 0.506
3.585SerAsn: 3.585 ± 0.529
1.487SerPro: 1.487 ± 0.321
3.236SerGln: 3.236 ± 0.548
2.624SerArg: 2.624 ± 0.49
4.285SerSer: 4.285 ± 0.624
3.148SerThr: 3.148 ± 0.532
3.935SerVal: 3.935 ± 0.856
0.612SerTrp: 0.612 ± 0.26
2.624SerTyr: 2.624 ± 0.634
0.0SerXaa: 0.0 ± 0.0
Thr
5.597ThrAla: 5.597 ± 0.987
0.087ThrCys: 0.087 ± 0.085
5.16ThrAsp: 5.16 ± 0.838
3.848ThrGlu: 3.848 ± 0.613
1.924ThrPhe: 1.924 ± 0.376
3.76ThrGly: 3.76 ± 0.467
0.787ThrHis: 0.787 ± 0.222
4.198ThrIle: 4.198 ± 0.557
3.76ThrLys: 3.76 ± 0.645
6.471ThrLeu: 6.471 ± 0.666
0.962ThrMet: 0.962 ± 0.256
2.886ThrAsn: 2.886 ± 0.413
2.186ThrPro: 2.186 ± 0.567
2.186ThrGln: 2.186 ± 0.498
1.574ThrArg: 1.574 ± 0.427
3.76ThrSer: 3.76 ± 0.667
4.023ThrThr: 4.023 ± 0.788
5.072ThrVal: 5.072 ± 0.713
1.137ThrTrp: 1.137 ± 0.304
2.361ThrTyr: 2.361 ± 0.466
0.0ThrXaa: 0.0 ± 0.0
Val
5.684ValAla: 5.684 ± 1.271
0.437ValCys: 0.437 ± 0.189
4.547ValAsp: 4.547 ± 0.602
3.935ValGlu: 3.935 ± 0.803
2.886ValPhe: 2.886 ± 0.571
2.536ValGly: 2.536 ± 0.604
0.612ValHis: 0.612 ± 0.249
4.023ValIle: 4.023 ± 0.593
5.072ValLys: 5.072 ± 0.553
4.11ValLeu: 4.11 ± 0.723
1.749ValMet: 1.749 ± 0.524
3.76ValAsn: 3.76 ± 0.581
2.274ValPro: 2.274 ± 0.545
2.099ValGln: 2.099 ± 0.404
2.449ValArg: 2.449 ± 0.485
4.285ValSer: 4.285 ± 0.666
3.848ValThr: 3.848 ± 0.598
3.673ValVal: 3.673 ± 0.775
0.35ValTrp: 0.35 ± 0.16
2.186ValTyr: 2.186 ± 0.404
0.0ValXaa: 0.0 ± 0.0
Trp
1.049TrpAla: 1.049 ± 0.283
0.087TrpCys: 0.087 ± 0.094
1.224TrpAsp: 1.224 ± 0.433
0.875TrpGlu: 0.875 ± 0.274
0.437TrpPhe: 0.437 ± 0.173
0.612TrpGly: 0.612 ± 0.262
0.262TrpHis: 0.262 ± 0.135
0.35TrpIle: 0.35 ± 0.294
0.787TrpLys: 0.787 ± 0.227
1.049TrpLeu: 1.049 ± 0.301
0.262TrpMet: 0.262 ± 0.153
0.612TrpAsn: 0.612 ± 0.238
0.262TrpPro: 0.262 ± 0.15
0.7TrpGln: 0.7 ± 0.193
0.35TrpArg: 0.35 ± 0.162
0.962TrpSer: 0.962 ± 0.279
0.35TrpThr: 0.35 ± 0.171
0.612TrpVal: 0.612 ± 0.246
0.087TrpTrp: 0.087 ± 0.096
0.35TrpTyr: 0.35 ± 0.162
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.624TyrAla: 2.624 ± 0.462
0.35TyrCys: 0.35 ± 0.181
2.274TyrAsp: 2.274 ± 0.479
2.624TyrGlu: 2.624 ± 0.544
1.924TyrPhe: 1.924 ± 0.444
2.449TyrGly: 2.449 ± 0.418
0.262TyrHis: 0.262 ± 0.133
2.449TyrIle: 2.449 ± 0.466
3.848TyrLys: 3.848 ± 0.724
2.886TyrLeu: 2.886 ± 0.617
0.437TyrMet: 0.437 ± 0.218
1.924TyrAsn: 1.924 ± 0.454
1.312TyrPro: 1.312 ± 0.351
2.886TyrGln: 2.886 ± 0.633
1.662TyrArg: 1.662 ± 0.355
2.536TyrSer: 2.536 ± 0.465
2.536TyrThr: 2.536 ± 0.546
1.749TyrVal: 1.749 ± 0.394
0.35TyrTrp: 0.35 ± 0.19
2.361TyrTyr: 2.361 ± 0.618
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11436 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski