Amino acid dipepetide frequency for Proteus phage vB_PmiP_RS10pmA

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.826AlaAla: 1.826 ± 0.522
0.548AlaCys: 0.548 ± 0.208
3.347AlaAsp: 3.347 ± 0.417
5.233AlaGlu: 5.233 ± 0.752
2.495AlaPhe: 2.495 ± 0.416
4.26AlaGly: 4.26 ± 0.595
1.095AlaHis: 1.095 ± 0.283
4.564AlaIle: 4.564 ± 0.672
4.747AlaLys: 4.747 ± 0.744
5.538AlaLeu: 5.538 ± 0.461
3.104AlaMet: 3.104 ± 0.444
3.712AlaAsn: 3.712 ± 0.837
1.765AlaPro: 1.765 ± 0.266
2.434AlaGln: 2.434 ± 0.509
2.434AlaArg: 2.434 ± 0.428
3.895AlaSer: 3.895 ± 0.667
3.408AlaThr: 3.408 ± 0.564
4.625AlaVal: 4.625 ± 0.557
0.791AlaTrp: 0.791 ± 0.207
1.46AlaTyr: 1.46 ± 0.282
0.0AlaXaa: 0.0 ± 0.0
Cys
0.487CysAla: 0.487 ± 0.158
0.487CysCys: 0.487 ± 0.198
1.035CysAsp: 1.035 ± 0.236
1.643CysGlu: 1.643 ± 0.268
0.609CysPhe: 0.609 ± 0.184
0.852CysGly: 0.852 ± 0.233
0.0CysHis: 0.0 ± 0.0
0.487CysIle: 0.487 ± 0.164
1.4CysLys: 1.4 ± 0.32
1.095CysLeu: 1.095 ± 0.202
0.487CysMet: 0.487 ± 0.169
0.669CysAsn: 0.669 ± 0.184
0.487CysPro: 0.487 ± 0.212
0.487CysGln: 0.487 ± 0.154
0.609CysArg: 0.609 ± 0.234
0.73CysSer: 0.73 ± 0.22
0.669CysThr: 0.669 ± 0.19
0.669CysVal: 0.669 ± 0.203
0.243CysTrp: 0.243 ± 0.112
0.913CysTyr: 0.913 ± 0.28
0.0CysXaa: 0.0 ± 0.0
Asp
4.077AspAla: 4.077 ± 0.501
1.217AspCys: 1.217 ± 0.262
4.381AspAsp: 4.381 ± 0.477
4.442AspGlu: 4.442 ± 0.494
1.704AspPhe: 1.704 ± 0.341
5.173AspGly: 5.173 ± 0.533
0.852AspHis: 0.852 ± 0.196
4.321AspIle: 4.321 ± 0.544
4.625AspLys: 4.625 ± 0.464
4.138AspLeu: 4.138 ± 0.653
1.521AspMet: 1.521 ± 0.309
4.199AspAsn: 4.199 ± 0.612
2.373AspPro: 2.373 ± 0.389
1.886AspGln: 1.886 ± 0.33
2.617AspArg: 2.617 ± 0.333
3.225AspSer: 3.225 ± 0.403
2.86AspThr: 2.86 ± 0.427
4.26AspVal: 4.26 ± 0.495
1.095AspTrp: 1.095 ± 0.318
3.529AspTyr: 3.529 ± 0.462
0.0AspXaa: 0.0 ± 0.0
Glu
5.051GluAla: 5.051 ± 0.63
1.339GluCys: 1.339 ± 0.25
2.86GluAsp: 2.86 ± 0.371
4.381GluGlu: 4.381 ± 0.502
2.799GluPhe: 2.799 ± 0.502
3.164GluGly: 3.164 ± 0.376
1.035GluHis: 1.035 ± 0.252
5.233GluIle: 5.233 ± 0.431
4.99GluLys: 4.99 ± 0.586
7.972GluLeu: 7.972 ± 0.827
2.252GluMet: 2.252 ± 0.442
3.834GluAsn: 3.834 ± 0.446
2.434GluPro: 2.434 ± 0.493
3.347GluGln: 3.347 ± 0.523
4.199GluArg: 4.199 ± 0.522
3.955GluSer: 3.955 ± 0.585
4.138GluThr: 4.138 ± 0.493
2.982GluVal: 2.982 ± 0.607
1.278GluTrp: 1.278 ± 0.236
2.86GluTyr: 2.86 ± 0.432
0.0GluXaa: 0.0 ± 0.0
Phe
1.886PheAla: 1.886 ± 0.399
0.487PheCys: 0.487 ± 0.183
2.617PheAsp: 2.617 ± 0.391
2.982PheGlu: 2.982 ± 0.49
0.974PhePhe: 0.974 ± 0.282
2.86PheGly: 2.86 ± 0.443
0.548PheHis: 0.548 ± 0.191
3.59PheIle: 3.59 ± 0.479
4.321PheLys: 4.321 ± 0.488
2.373PheLeu: 2.373 ± 0.347
0.852PheMet: 0.852 ± 0.229
3.347PheAsn: 3.347 ± 0.498
1.886PhePro: 1.886 ± 0.38
1.035PheGln: 1.035 ± 0.207
0.852PheArg: 0.852 ± 0.197
2.069PheSer: 2.069 ± 0.336
2.373PheThr: 2.373 ± 0.479
2.252PheVal: 2.252 ± 0.375
0.365PheTrp: 0.365 ± 0.153
2.13PheTyr: 2.13 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
3.408GlyAla: 3.408 ± 0.48
0.913GlyCys: 0.913 ± 0.301
4.807GlyAsp: 4.807 ± 0.508
4.503GlyGlu: 4.503 ± 0.471
3.834GlyPhe: 3.834 ± 0.454
3.651GlyGly: 3.651 ± 0.628
1.095GlyHis: 1.095 ± 0.25
3.955GlyIle: 3.955 ± 0.501
4.807GlyLys: 4.807 ± 0.643
4.99GlyLeu: 4.99 ± 0.49
1.643GlyMet: 1.643 ± 0.29
3.895GlyAsn: 3.895 ± 0.495
1.217GlyPro: 1.217 ± 0.253
2.678GlyGln: 2.678 ± 0.384
3.225GlyArg: 3.225 ± 0.411
3.712GlySer: 3.712 ± 0.484
3.529GlyThr: 3.529 ± 0.473
5.355GlyVal: 5.355 ± 0.55
1.156GlyTrp: 1.156 ± 0.293
2.434GlyTyr: 2.434 ± 0.39
0.0GlyXaa: 0.0 ± 0.0
His
0.791HisAla: 0.791 ± 0.195
0.365HisCys: 0.365 ± 0.144
1.035HisAsp: 1.035 ± 0.257
0.73HisGlu: 0.73 ± 0.293
0.913HisPhe: 0.913 ± 0.226
1.339HisGly: 1.339 ± 0.348
0.609HisHis: 0.609 ± 0.167
0.913HisIle: 0.913 ± 0.229
1.156HisLys: 1.156 ± 0.291
1.46HisLeu: 1.46 ± 0.318
0.304HisMet: 0.304 ± 0.126
1.095HisAsn: 1.095 ± 0.212
0.974HisPro: 0.974 ± 0.21
0.73HisGln: 0.73 ± 0.223
0.73HisArg: 0.73 ± 0.221
1.278HisSer: 1.278 ± 0.268
0.548HisThr: 0.548 ± 0.207
1.339HisVal: 1.339 ± 0.297
0.426HisTrp: 0.426 ± 0.147
0.669HisTyr: 0.669 ± 0.24
0.0HisXaa: 0.0 ± 0.0
Ile
5.538IleAla: 5.538 ± 0.557
1.095IleCys: 1.095 ± 0.261
6.085IleAsp: 6.085 ± 0.591
4.99IleGlu: 4.99 ± 0.533
1.765IlePhe: 1.765 ± 0.315
4.868IleGly: 4.868 ± 0.628
1.521IleHis: 1.521 ± 0.292
4.381IleIle: 4.381 ± 0.56
5.477IleLys: 5.477 ± 0.61
4.26IleLeu: 4.26 ± 0.445
1.704IleMet: 1.704 ± 0.291
5.294IleAsn: 5.294 ± 0.598
2.678IlePro: 2.678 ± 0.405
1.886IleGln: 1.886 ± 0.3
3.529IleArg: 3.529 ± 0.462
4.503IleSer: 4.503 ± 0.584
4.625IleThr: 4.625 ± 0.471
4.26IleVal: 4.26 ± 0.59
0.426IleTrp: 0.426 ± 0.132
2.982IleTyr: 2.982 ± 0.458
0.061IleXaa: 0.061 ± 0.051
Lys
5.173LysAla: 5.173 ± 0.659
0.913LysCys: 0.913 ± 0.293
3.955LysAsp: 3.955 ± 0.51
4.807LysGlu: 4.807 ± 0.528
3.043LysPhe: 3.043 ± 0.504
2.982LysGly: 2.982 ± 0.46
1.521LysHis: 1.521 ± 0.392
4.686LysIle: 4.686 ± 0.398
3.712LysLys: 3.712 ± 0.43
6.268LysLeu: 6.268 ± 0.572
3.469LysMet: 3.469 ± 0.42
3.59LysAsn: 3.59 ± 0.488
3.408LysPro: 3.408 ± 0.412
2.982LysGln: 2.982 ± 0.585
3.59LysArg: 3.59 ± 0.501
4.503LysSer: 4.503 ± 0.52
4.077LysThr: 4.077 ± 0.544
4.564LysVal: 4.564 ± 0.629
1.339LysTrp: 1.339 ± 0.324
2.799LysTyr: 2.799 ± 0.343
0.0LysXaa: 0.0 ± 0.0
Leu
4.807LeuAla: 4.807 ± 0.583
1.339LeuCys: 1.339 ± 0.275
4.747LeuAsp: 4.747 ± 0.469
7.181LeuGlu: 7.181 ± 0.63
2.738LeuPhe: 2.738 ± 0.372
3.895LeuGly: 3.895 ± 0.536
1.217LeuHis: 1.217 ± 0.261
5.477LeuIle: 5.477 ± 0.731
6.998LeuLys: 6.998 ± 0.744
5.355LeuLeu: 5.355 ± 0.677
2.312LeuMet: 2.312 ± 0.389
5.842LeuAsn: 5.842 ± 0.717
3.955LeuPro: 3.955 ± 0.464
2.191LeuGln: 2.191 ± 0.438
3.286LeuArg: 3.286 ± 0.531
4.686LeuSer: 4.686 ± 0.56
5.598LeuThr: 5.598 ± 0.607
4.26LeuVal: 4.26 ± 0.554
0.609LeuTrp: 0.609 ± 0.207
2.678LeuTyr: 2.678 ± 0.44
0.0LeuXaa: 0.0 ± 0.0
Met
1.643MetAla: 1.643 ± 0.288
0.304MetCys: 0.304 ± 0.142
1.217MetAsp: 1.217 ± 0.232
0.852MetGlu: 0.852 ± 0.183
1.095MetPhe: 1.095 ± 0.296
1.217MetGly: 1.217 ± 0.262
0.365MetHis: 0.365 ± 0.149
2.556MetIle: 2.556 ± 0.462
2.069MetLys: 2.069 ± 0.331
2.556MetLeu: 2.556 ± 0.346
1.278MetMet: 1.278 ± 0.305
1.886MetAsn: 1.886 ± 0.372
0.669MetPro: 0.669 ± 0.171
1.339MetGln: 1.339 ± 0.222
1.217MetArg: 1.217 ± 0.237
2.373MetSer: 2.373 ± 0.32
2.13MetThr: 2.13 ± 0.303
1.826MetVal: 1.826 ± 0.38
0.548MetTrp: 0.548 ± 0.162
1.156MetTyr: 1.156 ± 0.316
0.0MetXaa: 0.0 ± 0.0
Asn
3.773AsnAla: 3.773 ± 0.497
0.974AsnCys: 0.974 ± 0.231
3.408AsnAsp: 3.408 ± 0.435
3.225AsnGlu: 3.225 ± 0.485
1.947AsnPhe: 1.947 ± 0.323
5.477AsnGly: 5.477 ± 0.622
1.339AsnHis: 1.339 ± 0.255
5.781AsnIle: 5.781 ± 0.687
4.077AsnLys: 4.077 ± 0.564
4.199AsnLeu: 4.199 ± 0.476
1.217AsnMet: 1.217 ± 0.254
4.99AsnAsn: 4.99 ± 0.634
2.13AsnPro: 2.13 ± 0.389
2.678AsnGln: 2.678 ± 0.504
2.982AsnArg: 2.982 ± 0.48
3.286AsnSer: 3.286 ± 0.476
3.895AsnThr: 3.895 ± 0.466
3.529AsnVal: 3.529 ± 0.431
1.217AsnTrp: 1.217 ± 0.294
2.312AsnTyr: 2.312 ± 0.388
0.0AsnXaa: 0.0 ± 0.0
Pro
1.886ProAla: 1.886 ± 0.392
0.609ProCys: 0.609 ± 0.175
2.556ProAsp: 2.556 ± 0.479
3.043ProGlu: 3.043 ± 0.428
1.826ProPhe: 1.826 ± 0.363
2.13ProGly: 2.13 ± 0.311
0.548ProHis: 0.548 ± 0.162
2.252ProIle: 2.252 ± 0.498
1.765ProLys: 1.765 ± 0.39
2.252ProLeu: 2.252 ± 0.432
0.73ProMet: 0.73 ± 0.181
2.678ProAsn: 2.678 ± 0.427
0.73ProPro: 0.73 ± 0.172
0.791ProGln: 0.791 ± 0.221
1.582ProArg: 1.582 ± 0.242
2.312ProSer: 2.312 ± 0.344
2.373ProThr: 2.373 ± 0.384
3.043ProVal: 3.043 ± 0.466
0.487ProTrp: 0.487 ± 0.151
1.765ProTyr: 1.765 ± 0.249
0.0ProXaa: 0.0 ± 0.0
Gln
2.556GlnAla: 2.556 ± 0.567
0.669GlnCys: 0.669 ± 0.178
1.46GlnAsp: 1.46 ± 0.234
2.13GlnGlu: 2.13 ± 0.479
1.582GlnPhe: 1.582 ± 0.268
1.46GlnGly: 1.46 ± 0.279
0.365GlnHis: 0.365 ± 0.146
2.252GlnIle: 2.252 ± 0.366
1.826GlnLys: 1.826 ± 0.495
3.955GlnLeu: 3.955 ± 0.866
0.974GlnMet: 0.974 ± 0.213
1.886GlnAsn: 1.886 ± 0.334
1.278GlnPro: 1.278 ± 0.238
2.556GlnGln: 2.556 ± 0.715
1.826GlnArg: 1.826 ± 0.275
2.982GlnSer: 2.982 ± 0.399
1.886GlnThr: 1.886 ± 0.303
2.252GlnVal: 2.252 ± 0.356
0.669GlnTrp: 0.669 ± 0.172
1.46GlnTyr: 1.46 ± 0.292
0.0GlnXaa: 0.0 ± 0.0
Arg
3.225ArgAla: 3.225 ± 0.4
0.548ArgCys: 0.548 ± 0.188
3.164ArgAsp: 3.164 ± 0.517
2.738ArgGlu: 2.738 ± 0.359
2.191ArgPhe: 2.191 ± 0.311
3.104ArgGly: 3.104 ± 0.353
0.913ArgHis: 0.913 ± 0.219
3.529ArgIle: 3.529 ± 0.491
2.86ArgLys: 2.86 ± 0.339
3.895ArgLeu: 3.895 ± 0.411
1.095ArgMet: 1.095 ± 0.251
3.043ArgAsn: 3.043 ± 0.392
1.46ArgPro: 1.46 ± 0.263
1.643ArgGln: 1.643 ± 0.394
2.434ArgArg: 2.434 ± 0.345
2.252ArgSer: 2.252 ± 0.302
2.434ArgThr: 2.434 ± 0.38
3.469ArgVal: 3.469 ± 0.407
0.487ArgTrp: 0.487 ± 0.201
2.312ArgTyr: 2.312 ± 0.417
0.0ArgXaa: 0.0 ± 0.0
Ser
4.564SerAla: 4.564 ± 0.742
0.426SerCys: 0.426 ± 0.157
4.26SerAsp: 4.26 ± 0.454
3.834SerGlu: 3.834 ± 0.411
2.495SerPhe: 2.495 ± 0.445
4.442SerGly: 4.442 ± 0.492
1.095SerHis: 1.095 ± 0.253
4.138SerIle: 4.138 ± 0.439
4.564SerLys: 4.564 ± 0.562
5.538SerLeu: 5.538 ± 0.595
1.156SerMet: 1.156 ± 0.239
3.347SerAsn: 3.347 ± 0.477
1.886SerPro: 1.886 ± 0.278
1.947SerGln: 1.947 ± 0.355
2.738SerArg: 2.738 ± 0.415
3.955SerSer: 3.955 ± 0.483
2.921SerThr: 2.921 ± 0.458
5.294SerVal: 5.294 ± 0.565
1.035SerTrp: 1.035 ± 0.173
1.643SerTyr: 1.643 ± 0.353
0.0SerXaa: 0.0 ± 0.0
Thr
3.043ThrAla: 3.043 ± 0.394
0.791ThrCys: 0.791 ± 0.217
3.286ThrAsp: 3.286 ± 0.354
3.529ThrGlu: 3.529 ± 0.445
1.886ThrPhe: 1.886 ± 0.335
6.572ThrGly: 6.572 ± 0.604
1.035ThrHis: 1.035 ± 0.222
4.868ThrIle: 4.868 ± 0.53
3.651ThrLys: 3.651 ± 0.44
5.294ThrLeu: 5.294 ± 0.525
1.521ThrMet: 1.521 ± 0.274
2.373ThrAsn: 2.373 ± 0.347
2.556ThrPro: 2.556 ± 0.481
2.13ThrGln: 2.13 ± 0.379
2.312ThrArg: 2.312 ± 0.359
3.164ThrSer: 3.164 ± 0.472
2.495ThrThr: 2.495 ± 0.426
4.321ThrVal: 4.321 ± 0.651
0.609ThrTrp: 0.609 ± 0.201
2.069ThrTyr: 2.069 ± 0.347
0.0ThrXaa: 0.0 ± 0.0
Val
4.199ValAla: 4.199 ± 0.607
0.304ValCys: 0.304 ± 0.156
3.895ValAsp: 3.895 ± 0.412
5.173ValGlu: 5.173 ± 0.493
2.921ValPhe: 2.921 ± 0.41
4.26ValGly: 4.26 ± 0.481
0.852ValHis: 0.852 ± 0.212
5.416ValIle: 5.416 ± 0.496
4.625ValLys: 4.625 ± 0.553
4.199ValLeu: 4.199 ± 0.518
1.4ValMet: 1.4 ± 0.267
4.138ValAsn: 4.138 ± 0.542
2.008ValPro: 2.008 ± 0.33
1.46ValGln: 1.46 ± 0.248
3.164ValArg: 3.164 ± 0.48
4.868ValSer: 4.868 ± 0.526
4.625ValThr: 4.625 ± 0.67
3.104ValVal: 3.104 ± 0.446
0.852ValTrp: 0.852 ± 0.225
3.104ValTyr: 3.104 ± 0.512
0.0ValXaa: 0.0 ± 0.0
Trp
0.913TrpAla: 0.913 ± 0.183
0.183TrpCys: 0.183 ± 0.094
1.339TrpAsp: 1.339 ± 0.322
1.217TrpGlu: 1.217 ± 0.299
0.852TrpPhe: 0.852 ± 0.246
0.487TrpGly: 0.487 ± 0.151
0.243TrpHis: 0.243 ± 0.134
0.73TrpIle: 0.73 ± 0.185
1.095TrpLys: 1.095 ± 0.232
1.339TrpLeu: 1.339 ± 0.307
0.183TrpMet: 0.183 ± 0.115
0.791TrpAsn: 0.791 ± 0.235
0.0TrpPro: 0.0 ± 0.0
0.365TrpGln: 0.365 ± 0.135
0.913TrpArg: 0.913 ± 0.221
0.974TrpSer: 0.974 ± 0.209
0.73TrpThr: 0.73 ± 0.245
0.791TrpVal: 0.791 ± 0.224
0.365TrpTrp: 0.365 ± 0.154
0.974TrpTyr: 0.974 ± 0.239
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.434TyrAla: 2.434 ± 0.374
0.548TyrCys: 0.548 ± 0.179
3.043TyrAsp: 3.043 ± 0.392
3.529TyrGlu: 3.529 ± 0.461
2.13TyrPhe: 2.13 ± 0.357
2.617TyrGly: 2.617 ± 0.333
1.156TyrHis: 1.156 ± 0.251
2.738TyrIle: 2.738 ± 0.376
2.617TyrLys: 2.617 ± 0.39
2.495TyrLeu: 2.495 ± 0.449
1.035TyrMet: 1.035 ± 0.237
1.826TyrAsn: 1.826 ± 0.347
1.46TyrPro: 1.46 ± 0.393
1.46TyrGln: 1.46 ± 0.293
2.556TyrArg: 2.556 ± 0.378
2.556TyrSer: 2.556 ± 0.425
2.191TyrThr: 2.191 ± 0.336
2.373TyrVal: 2.373 ± 0.426
0.487TyrTrp: 0.487 ± 0.185
1.521TyrTyr: 1.521 ± 0.332
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.061XaaIle: 0.061 ± 0.051
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (16434 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski