Amino acid dipepetide frequency for Salmonella phage SE1 (in:P22virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.426AlaAla: 9.426 ± 1.504
1.505AlaCys: 1.505 ± 0.398
5.069AlaAsp: 5.069 ± 0.727
7.208AlaGlu: 7.208 ± 0.887
3.248AlaPhe: 3.248 ± 0.554
6.257AlaGly: 6.257 ± 0.75
1.188AlaHis: 1.188 ± 0.389
6.02AlaIle: 6.02 ± 0.88
4.515AlaLys: 4.515 ± 0.6
8.0AlaLeu: 8.0 ± 1.072
3.564AlaMet: 3.564 ± 0.715
6.416AlaAsn: 6.416 ± 0.965
2.059AlaPro: 2.059 ± 0.346
3.327AlaGln: 3.327 ± 0.647
5.069AlaArg: 5.069 ± 0.598
4.356AlaSer: 4.356 ± 0.551
5.941AlaThr: 5.941 ± 0.587
5.307AlaVal: 5.307 ± 0.462
1.505AlaTrp: 1.505 ± 0.363
2.059AlaTyr: 2.059 ± 0.456
0.0AlaXaa: 0.0 ± 0.0
Cys
0.792CysAla: 0.792 ± 0.274
0.554CysCys: 0.554 ± 0.194
0.475CysAsp: 0.475 ± 0.19
0.713CysGlu: 0.713 ± 0.227
0.792CysPhe: 0.792 ± 0.27
1.109CysGly: 1.109 ± 0.281
0.317CysHis: 0.317 ± 0.157
0.713CysIle: 0.713 ± 0.223
1.03CysLys: 1.03 ± 0.366
0.475CysLeu: 0.475 ± 0.213
0.238CysMet: 0.238 ± 0.148
0.554CysAsn: 0.554 ± 0.208
0.554CysPro: 0.554 ± 0.21
0.554CysGln: 0.554 ± 0.216
1.347CysArg: 1.347 ± 0.353
0.95CysSer: 0.95 ± 0.303
0.317CysThr: 0.317 ± 0.203
1.109CysVal: 1.109 ± 0.328
0.158CysTrp: 0.158 ± 0.108
0.634CysTyr: 0.634 ± 0.2
0.0CysXaa: 0.0 ± 0.0
Asp
6.574AspAla: 6.574 ± 0.853
0.554AspCys: 0.554 ± 0.202
4.515AspAsp: 4.515 ± 0.557
3.644AspGlu: 3.644 ± 0.57
2.535AspPhe: 2.535 ± 0.401
5.465AspGly: 5.465 ± 0.948
1.109AspHis: 1.109 ± 0.38
3.644AspIle: 3.644 ± 0.543
3.485AspLys: 3.485 ± 0.527
4.436AspLeu: 4.436 ± 0.535
1.347AspMet: 1.347 ± 0.339
2.218AspAsn: 2.218 ± 0.442
1.505AspPro: 1.505 ± 0.347
1.584AspGln: 1.584 ± 0.305
2.455AspArg: 2.455 ± 0.395
3.485AspSer: 3.485 ± 0.549
2.376AspThr: 2.376 ± 0.367
5.069AspVal: 5.069 ± 0.686
0.95AspTrp: 0.95 ± 0.331
2.535AspTyr: 2.535 ± 0.399
0.0AspXaa: 0.0 ± 0.0
Glu
5.386GluAla: 5.386 ± 0.692
1.267GluCys: 1.267 ± 0.26
2.851GluAsp: 2.851 ± 0.495
3.564GluGlu: 3.564 ± 0.676
1.822GluPhe: 1.822 ± 0.453
2.931GluGly: 2.931 ± 0.423
1.03GluHis: 1.03 ± 0.303
3.723GluIle: 3.723 ± 0.563
3.723GluLys: 3.723 ± 0.601
5.624GluLeu: 5.624 ± 0.637
2.535GluMet: 2.535 ± 0.481
3.01GluAsn: 3.01 ± 0.483
2.218GluPro: 2.218 ± 0.348
4.04GluGln: 4.04 ± 0.564
4.752GluArg: 4.752 ± 0.725
3.96GluSer: 3.96 ± 0.464
3.723GluThr: 3.723 ± 0.482
3.644GluVal: 3.644 ± 0.59
1.822GluTrp: 1.822 ± 0.405
1.743GluTyr: 1.743 ± 0.464
0.0GluXaa: 0.0 ± 0.0
Phe
2.614PheAla: 2.614 ± 0.426
0.634PheCys: 0.634 ± 0.234
2.614PheAsp: 2.614 ± 0.435
2.455PheGlu: 2.455 ± 0.463
2.059PhePhe: 2.059 ± 0.598
1.98PheGly: 1.98 ± 0.35
0.634PheHis: 0.634 ± 0.228
2.772PheIle: 2.772 ± 0.545
1.822PheLys: 1.822 ± 0.397
2.693PheLeu: 2.693 ± 0.484
1.109PheMet: 1.109 ± 0.292
2.297PheAsn: 2.297 ± 0.39
1.267PhePro: 1.267 ± 0.237
1.109PheGln: 1.109 ± 0.36
1.743PheArg: 1.743 ± 0.317
2.693PheSer: 2.693 ± 0.479
2.139PheThr: 2.139 ± 0.39
1.663PheVal: 1.663 ± 0.299
0.634PheTrp: 0.634 ± 0.214
1.743PheTyr: 1.743 ± 0.388
0.0PheXaa: 0.0 ± 0.0
Gly
5.624GlyAla: 5.624 ± 0.882
0.634GlyCys: 0.634 ± 0.235
3.564GlyAsp: 3.564 ± 0.488
4.436GlyGlu: 4.436 ± 0.523
2.693GlyPhe: 2.693 ± 0.467
5.149GlyGly: 5.149 ± 0.816
1.03GlyHis: 1.03 ± 0.269
4.356GlyIle: 4.356 ± 0.603
5.149GlyLys: 5.149 ± 0.646
4.594GlyLeu: 4.594 ± 0.731
2.772GlyMet: 2.772 ± 0.432
3.168GlyAsn: 3.168 ± 0.373
1.109GlyPro: 1.109 ± 0.25
3.485GlyGln: 3.485 ± 0.643
4.436GlyArg: 4.436 ± 0.632
3.723GlySer: 3.723 ± 0.71
3.644GlyThr: 3.644 ± 0.587
5.782GlyVal: 5.782 ± 0.779
1.584GlyTrp: 1.584 ± 0.388
2.139GlyTyr: 2.139 ± 0.35
0.0GlyXaa: 0.0 ± 0.0
His
0.871HisAla: 0.871 ± 0.217
0.554HisCys: 0.554 ± 0.234
1.109HisAsp: 1.109 ± 0.326
1.584HisGlu: 1.584 ± 0.451
0.554HisPhe: 0.554 ± 0.21
1.347HisGly: 1.347 ± 0.39
0.238HisHis: 0.238 ± 0.156
0.634HisIle: 0.634 ± 0.216
1.03HisLys: 1.03 ± 0.31
1.822HisLeu: 1.822 ± 0.557
0.554HisMet: 0.554 ± 0.186
0.475HisAsn: 0.475 ± 0.186
0.713HisPro: 0.713 ± 0.246
0.95HisGln: 0.95 ± 0.241
1.347HisArg: 1.347 ± 0.288
1.426HisSer: 1.426 ± 0.292
0.713HisThr: 0.713 ± 0.27
0.871HisVal: 0.871 ± 0.263
0.238HisTrp: 0.238 ± 0.123
0.634HisTyr: 0.634 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
5.941IleAla: 5.941 ± 0.759
0.871IleCys: 0.871 ± 0.258
3.485IleAsp: 3.485 ± 0.487
4.752IleGlu: 4.752 ± 0.619
2.059IlePhe: 2.059 ± 0.453
4.198IleGly: 4.198 ± 0.546
1.347IleHis: 1.347 ± 0.298
4.277IleIle: 4.277 ± 0.945
3.406IleLys: 3.406 ± 0.647
3.485IleLeu: 3.485 ± 0.707
0.713IleMet: 0.713 ± 0.224
3.327IleAsn: 3.327 ± 0.57
2.535IlePro: 2.535 ± 0.511
2.059IleGln: 2.059 ± 0.39
3.881IleArg: 3.881 ± 0.571
5.228IleSer: 5.228 ± 0.702
4.673IleThr: 4.673 ± 0.573
2.931IleVal: 2.931 ± 0.408
0.554IleTrp: 0.554 ± 0.177
2.059IleTyr: 2.059 ± 0.438
0.0IleXaa: 0.0 ± 0.0
Lys
6.02LysAla: 6.02 ± 1.075
0.792LysCys: 0.792 ± 0.254
3.723LysAsp: 3.723 ± 0.61
3.485LysGlu: 3.485 ± 0.55
1.505LysPhe: 1.505 ± 0.36
4.515LysGly: 4.515 ± 0.539
0.554LysHis: 0.554 ± 0.236
3.327LysIle: 3.327 ± 0.469
3.881LysLys: 3.881 ± 0.563
5.307LysLeu: 5.307 ± 0.595
1.98LysMet: 1.98 ± 0.426
2.297LysAsn: 2.297 ± 0.415
3.644LysPro: 3.644 ± 0.486
3.881LysGln: 3.881 ± 0.644
3.723LysArg: 3.723 ± 0.576
4.119LysSer: 4.119 ± 0.557
3.248LysThr: 3.248 ± 0.433
3.01LysVal: 3.01 ± 0.514
0.713LysTrp: 0.713 ± 0.284
1.98LysTyr: 1.98 ± 0.375
0.0LysXaa: 0.0 ± 0.0
Leu
7.287LeuAla: 7.287 ± 0.748
0.95LeuCys: 0.95 ± 0.267
4.356LeuAsp: 4.356 ± 0.608
4.515LeuGlu: 4.515 ± 0.492
3.089LeuPhe: 3.089 ± 0.59
4.356LeuGly: 4.356 ± 0.697
1.505LeuHis: 1.505 ± 0.359
5.228LeuIle: 5.228 ± 0.828
5.307LeuLys: 5.307 ± 0.643
6.02LeuLeu: 6.02 ± 0.702
2.218LeuMet: 2.218 ± 0.377
4.673LeuAsn: 4.673 ± 0.628
3.723LeuPro: 3.723 ± 0.527
2.297LeuGln: 2.297 ± 0.45
4.832LeuArg: 4.832 ± 0.595
6.02LeuSer: 6.02 ± 0.596
4.911LeuThr: 4.911 ± 0.523
4.04LeuVal: 4.04 ± 0.401
0.95LeuTrp: 0.95 ± 0.304
2.614LeuTyr: 2.614 ± 0.465
0.0LeuXaa: 0.0 ± 0.0
Met
3.168MetAla: 3.168 ± 0.489
0.158MetCys: 0.158 ± 0.113
1.109MetAsp: 1.109 ± 0.374
1.743MetGlu: 1.743 ± 0.344
0.792MetPhe: 0.792 ± 0.256
2.059MetGly: 2.059 ± 0.456
0.396MetHis: 0.396 ± 0.197
1.267MetIle: 1.267 ± 0.308
1.822MetLys: 1.822 ± 0.384
2.218MetLeu: 2.218 ± 0.409
0.792MetMet: 0.792 ± 0.25
1.347MetAsn: 1.347 ± 0.413
1.347MetPro: 1.347 ± 0.287
1.109MetGln: 1.109 ± 0.389
1.98MetArg: 1.98 ± 0.442
2.535MetSer: 2.535 ± 0.404
2.059MetThr: 2.059 ± 0.47
1.584MetVal: 1.584 ± 0.313
0.238MetTrp: 0.238 ± 0.135
0.95MetTyr: 0.95 ± 0.3
0.0MetXaa: 0.0 ± 0.0
Asn
4.911AsnAla: 4.911 ± 0.812
0.554AsnCys: 0.554 ± 0.219
2.931AsnAsp: 2.931 ± 0.423
2.851AsnGlu: 2.851 ± 0.44
1.188AsnPhe: 1.188 ± 0.288
4.119AsnGly: 4.119 ± 0.65
1.109AsnHis: 1.109 ± 0.319
2.851AsnIle: 2.851 ± 0.383
2.772AsnLys: 2.772 ± 0.592
3.327AsnLeu: 3.327 ± 0.618
1.347AsnMet: 1.347 ± 0.288
2.297AsnAsn: 2.297 ± 0.493
2.614AsnPro: 2.614 ± 0.417
3.089AsnGln: 3.089 ± 0.541
2.376AsnArg: 2.376 ± 0.445
1.98AsnSer: 1.98 ± 0.403
3.168AsnThr: 3.168 ± 0.554
3.248AsnVal: 3.248 ± 0.666
0.396AsnTrp: 0.396 ± 0.182
0.95AsnTyr: 0.95 ± 0.225
0.0AsnXaa: 0.0 ± 0.0
Pro
3.01ProAla: 3.01 ± 0.404
0.079ProCys: 0.079 ± 0.085
3.089ProAsp: 3.089 ± 0.478
4.515ProGlu: 4.515 ± 0.604
1.188ProPhe: 1.188 ± 0.288
3.168ProGly: 3.168 ± 0.507
0.792ProHis: 0.792 ± 0.291
2.614ProIle: 2.614 ± 0.546
2.772ProLys: 2.772 ± 0.505
2.772ProLeu: 2.772 ± 0.493
0.713ProMet: 0.713 ± 0.199
1.426ProAsn: 1.426 ± 0.336
1.267ProPro: 1.267 ± 0.359
1.347ProGln: 1.347 ± 0.287
1.505ProArg: 1.505 ± 0.317
2.535ProSer: 2.535 ± 0.391
1.98ProThr: 1.98 ± 0.369
3.248ProVal: 3.248 ± 0.468
0.396ProTrp: 0.396 ± 0.172
1.188ProTyr: 1.188 ± 0.304
0.0ProXaa: 0.0 ± 0.0
Gln
4.752GlnAla: 4.752 ± 0.952
0.634GlnCys: 0.634 ± 0.227
2.614GlnAsp: 2.614 ± 0.455
1.743GlnGlu: 1.743 ± 0.45
1.347GlnPhe: 1.347 ± 0.276
3.248GlnGly: 3.248 ± 0.54
0.792GlnHis: 0.792 ± 0.267
3.327GlnIle: 3.327 ± 0.45
2.693GlnLys: 2.693 ± 0.503
3.881GlnLeu: 3.881 ± 0.52
1.505GlnMet: 1.505 ± 0.323
1.743GlnAsn: 1.743 ± 0.698
1.663GlnPro: 1.663 ± 0.363
3.406GlnGln: 3.406 ± 0.755
3.01GlnArg: 3.01 ± 0.545
2.931GlnSer: 2.931 ± 0.539
1.505GlnThr: 1.505 ± 0.347
1.267GlnVal: 1.267 ± 0.388
1.188GlnTrp: 1.188 ± 0.309
1.822GlnTyr: 1.822 ± 0.412
0.0GlnXaa: 0.0 ± 0.0
Arg
4.356ArgAla: 4.356 ± 0.49
0.317ArgCys: 0.317 ± 0.173
3.881ArgAsp: 3.881 ± 0.624
4.515ArgGlu: 4.515 ± 0.717
2.139ArgPhe: 2.139 ± 0.51
3.089ArgGly: 3.089 ± 0.532
1.426ArgHis: 1.426 ± 0.304
4.277ArgIle: 4.277 ± 0.547
4.04ArgLys: 4.04 ± 0.757
5.307ArgLeu: 5.307 ± 0.632
1.901ArgMet: 1.901 ± 0.432
3.248ArgAsn: 3.248 ± 0.478
2.455ArgPro: 2.455 ± 0.388
2.376ArgGln: 2.376 ± 0.525
3.564ArgArg: 3.564 ± 0.703
3.802ArgSer: 3.802 ± 0.466
2.297ArgThr: 2.297 ± 0.425
2.535ArgVal: 2.535 ± 0.482
1.188ArgTrp: 1.188 ± 0.406
2.297ArgTyr: 2.297 ± 0.447
0.0ArgXaa: 0.0 ± 0.0
Ser
5.941SerAla: 5.941 ± 0.952
1.03SerCys: 1.03 ± 0.418
3.96SerAsp: 3.96 ± 0.588
2.931SerGlu: 2.931 ± 0.433
2.931SerPhe: 2.931 ± 0.482
4.673SerGly: 4.673 ± 0.783
1.426SerHis: 1.426 ± 0.413
3.644SerIle: 3.644 ± 0.556
3.96SerLys: 3.96 ± 0.462
5.624SerLeu: 5.624 ± 0.838
1.743SerMet: 1.743 ± 0.302
2.772SerAsn: 2.772 ± 0.481
3.406SerPro: 3.406 ± 0.576
3.168SerGln: 3.168 ± 0.48
3.881SerArg: 3.881 ± 0.461
4.04SerSer: 4.04 ± 0.87
3.089SerThr: 3.089 ± 0.506
3.406SerVal: 3.406 ± 0.577
0.871SerTrp: 0.871 ± 0.255
1.743SerTyr: 1.743 ± 0.442
0.0SerXaa: 0.0 ± 0.0
Thr
5.941ThrAla: 5.941 ± 0.702
0.475ThrCys: 0.475 ± 0.174
3.01ThrAsp: 3.01 ± 0.62
2.931ThrGlu: 2.931 ± 0.642
2.139ThrPhe: 2.139 ± 0.437
4.594ThrGly: 4.594 ± 0.584
0.871ThrHis: 0.871 ± 0.22
3.327ThrIle: 3.327 ± 0.503
3.485ThrLys: 3.485 ± 0.635
4.198ThrLeu: 4.198 ± 0.581
1.188ThrMet: 1.188 ± 0.304
2.297ThrAsn: 2.297 ± 0.436
3.327ThrPro: 3.327 ± 0.444
1.822ThrGln: 1.822 ± 0.317
2.376ThrArg: 2.376 ± 0.448
3.089ThrSer: 3.089 ± 0.55
3.089ThrThr: 3.089 ± 0.471
3.485ThrVal: 3.485 ± 0.522
0.554ThrTrp: 0.554 ± 0.219
1.822ThrTyr: 1.822 ± 0.385
0.0ThrXaa: 0.0 ± 0.0
Val
5.386ValAla: 5.386 ± 0.566
0.792ValCys: 0.792 ± 0.239
3.089ValAsp: 3.089 ± 0.447
3.406ValGlu: 3.406 ± 0.526
2.614ValPhe: 2.614 ± 0.405
3.881ValGly: 3.881 ± 0.521
0.634ValHis: 0.634 ± 0.187
3.327ValIle: 3.327 ± 0.59
4.04ValLys: 4.04 ± 0.535
4.99ValLeu: 4.99 ± 0.484
1.188ValMet: 1.188 ± 0.371
3.089ValAsn: 3.089 ± 0.504
2.139ValPro: 2.139 ± 0.389
2.614ValGln: 2.614 ± 0.481
3.248ValArg: 3.248 ± 0.451
4.436ValSer: 4.436 ± 0.678
3.01ValThr: 3.01 ± 0.594
4.04ValVal: 4.04 ± 0.504
0.871ValTrp: 0.871 ± 0.267
2.139ValTyr: 2.139 ± 0.513
0.0ValXaa: 0.0 ± 0.0
Trp
1.03TrpAla: 1.03 ± 0.322
0.396TrpCys: 0.396 ± 0.165
1.347TrpAsp: 1.347 ± 0.337
0.713TrpGlu: 0.713 ± 0.214
0.554TrpPhe: 0.554 ± 0.206
1.109TrpGly: 1.109 ± 0.265
0.634TrpHis: 0.634 ± 0.225
0.396TrpIle: 0.396 ± 0.175
1.267TrpLys: 1.267 ± 0.335
1.584TrpLeu: 1.584 ± 0.516
0.634TrpMet: 0.634 ± 0.204
0.238TrpAsn: 0.238 ± 0.127
0.634TrpPro: 0.634 ± 0.229
1.03TrpGln: 1.03 ± 0.226
1.109TrpArg: 1.109 ± 0.335
0.713TrpSer: 0.713 ± 0.23
0.713TrpThr: 0.713 ± 0.246
1.109TrpVal: 1.109 ± 0.258
0.396TrpTrp: 0.396 ± 0.183
0.396TrpTyr: 0.396 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.01TyrAla: 3.01 ± 0.561
0.634TyrCys: 0.634 ± 0.235
2.693TyrAsp: 2.693 ± 0.609
1.663TyrGlu: 1.663 ± 0.483
1.426TyrPhe: 1.426 ± 0.387
1.822TyrGly: 1.822 ± 0.404
0.634TyrHis: 0.634 ± 0.259
2.059TyrIle: 2.059 ± 0.502
1.505TyrLys: 1.505 ± 0.336
2.614TyrLeu: 2.614 ± 0.374
0.554TyrMet: 0.554 ± 0.225
1.188TyrAsn: 1.188 ± 0.319
1.426TyrPro: 1.426 ± 0.411
1.822TyrGln: 1.822 ± 0.413
2.376TyrArg: 2.376 ± 0.501
2.218TyrSer: 2.218 ± 0.381
1.426TyrThr: 1.426 ± 0.266
1.584TyrVal: 1.584 ± 0.35
0.792TyrTrp: 0.792 ± 0.226
0.871TyrTyr: 0.871 ± 0.251
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (12626 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski