Amino acid dipepetide frequency for Salmonella phage IME207

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.049AlaAla: 8.049 ± 1.217
0.614AlaCys: 0.614 ± 0.241
4.843AlaAsp: 4.843 ± 0.671
4.979AlaGlu: 4.979 ± 0.805
2.046AlaPhe: 2.046 ± 0.358
6.821AlaGly: 6.821 ± 0.663
0.477AlaHis: 0.477 ± 0.191
5.866AlaIle: 5.866 ± 0.802
5.593AlaLys: 5.593 ± 0.759
7.162AlaLeu: 7.162 ± 0.893
2.592AlaMet: 2.592 ± 0.508
4.434AlaAsn: 4.434 ± 0.706
2.251AlaPro: 2.251 ± 0.38
3.069AlaGln: 3.069 ± 0.611
5.32AlaArg: 5.32 ± 0.59
5.729AlaSer: 5.729 ± 0.948
5.116AlaThr: 5.116 ± 0.809
5.593AlaVal: 5.593 ± 0.597
1.228AlaTrp: 1.228 ± 0.277
2.251AlaTyr: 2.251 ± 0.396
0.0AlaXaa: 0.0 ± 0.0
Cys
0.955CysAla: 0.955 ± 0.26
0.409CysCys: 0.409 ± 0.162
1.023CysAsp: 1.023 ± 0.264
0.75CysGlu: 0.75 ± 0.229
0.273CysPhe: 0.273 ± 0.138
1.773CysGly: 1.773 ± 0.417
0.409CysHis: 0.409 ± 0.177
0.546CysIle: 0.546 ± 0.186
0.818CysLys: 0.818 ± 0.218
0.955CysLeu: 0.955 ± 0.247
0.341CysMet: 0.341 ± 0.147
0.546CysAsn: 0.546 ± 0.211
0.682CysPro: 0.682 ± 0.196
0.341CysGln: 0.341 ± 0.16
0.887CysArg: 0.887 ± 0.262
0.75CysSer: 0.75 ± 0.251
0.341CysThr: 0.341 ± 0.184
0.682CysVal: 0.682 ± 0.234
0.341CysTrp: 0.341 ± 0.152
0.409CysTyr: 0.409 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
5.729AspAla: 5.729 ± 0.694
0.614AspCys: 0.614 ± 0.237
5.729AspAsp: 5.729 ± 0.715
4.434AspGlu: 4.434 ± 0.621
3.069AspPhe: 3.069 ± 0.51
5.866AspGly: 5.866 ± 0.712
1.16AspHis: 1.16 ± 0.243
3.956AspIle: 3.956 ± 0.5
3.751AspLys: 3.751 ± 0.588
2.865AspLeu: 2.865 ± 0.553
2.251AspMet: 2.251 ± 0.386
2.933AspAsn: 2.933 ± 0.508
1.773AspPro: 1.773 ± 0.332
1.296AspGln: 1.296 ± 0.357
2.114AspArg: 2.114 ± 0.348
2.797AspSer: 2.797 ± 0.421
2.865AspThr: 2.865 ± 0.405
4.434AspVal: 4.434 ± 0.573
0.682AspTrp: 0.682 ± 0.222
3.206AspTyr: 3.206 ± 0.482
0.0AspXaa: 0.0 ± 0.0
Glu
5.116GluAla: 5.116 ± 0.6
0.955GluCys: 0.955 ± 0.272
3.41GluAsp: 3.41 ± 0.536
5.184GluGlu: 5.184 ± 0.671
2.387GluPhe: 2.387 ± 0.376
2.865GluGly: 2.865 ± 0.424
0.887GluHis: 0.887 ± 0.249
4.024GluIle: 4.024 ± 0.496
4.092GluLys: 4.092 ± 0.599
5.934GluLeu: 5.934 ± 0.594
2.592GluMet: 2.592 ± 0.514
2.455GluAsn: 2.455 ± 0.357
2.524GluPro: 2.524 ± 0.366
4.024GluGln: 4.024 ± 0.632
3.41GluArg: 3.41 ± 0.51
3.138GluSer: 3.138 ± 0.446
3.206GluThr: 3.206 ± 0.442
3.751GluVal: 3.751 ± 0.517
1.978GluTrp: 1.978 ± 0.353
2.592GluTyr: 2.592 ± 0.392
0.0GluXaa: 0.0 ± 0.0
Phe
2.114PheAla: 2.114 ± 0.407
1.16PheCys: 1.16 ± 0.261
2.592PheAsp: 2.592 ± 0.414
2.319PheGlu: 2.319 ± 0.422
1.091PhePhe: 1.091 ± 0.259
2.797PheGly: 2.797 ± 0.39
0.818PheHis: 0.818 ± 0.239
3.274PheIle: 3.274 ± 0.478
1.637PheLys: 1.637 ± 0.298
1.978PheLeu: 1.978 ± 0.34
1.091PheMet: 1.091 ± 0.274
2.387PheAsn: 2.387 ± 0.423
1.432PhePro: 1.432 ± 0.271
1.023PheGln: 1.023 ± 0.261
1.501PheArg: 1.501 ± 0.389
1.842PheSer: 1.842 ± 0.314
1.91PheThr: 1.91 ± 0.374
2.319PheVal: 2.319 ± 0.463
0.75PheTrp: 0.75 ± 0.241
1.705PheTyr: 1.705 ± 0.382
0.0PheXaa: 0.0 ± 0.0
Gly
5.661GlyAla: 5.661 ± 0.739
1.091GlyCys: 1.091 ± 0.288
4.502GlyAsp: 4.502 ± 0.754
5.184GlyGlu: 5.184 ± 0.601
2.251GlyPhe: 2.251 ± 0.311
5.457GlyGly: 5.457 ± 0.74
1.228GlyHis: 1.228 ± 0.271
4.502GlyIle: 4.502 ± 0.517
6.139GlyLys: 6.139 ± 0.732
4.57GlyLeu: 4.57 ± 0.549
1.842GlyMet: 1.842 ± 0.374
2.933GlyAsn: 2.933 ± 0.385
0.818GlyPro: 0.818 ± 0.221
2.524GlyGln: 2.524 ± 0.492
3.206GlyArg: 3.206 ± 0.496
4.843GlySer: 4.843 ± 0.52
3.615GlyThr: 3.615 ± 0.676
5.525GlyVal: 5.525 ± 0.775
1.296GlyTrp: 1.296 ± 0.306
3.547GlyTyr: 3.547 ± 0.529
0.0GlyXaa: 0.0 ± 0.0
His
1.228HisAla: 1.228 ± 0.312
0.546HisCys: 0.546 ± 0.176
0.887HisAsp: 0.887 ± 0.234
1.228HisGlu: 1.228 ± 0.27
0.546HisPhe: 0.546 ± 0.186
1.705HisGly: 1.705 ± 0.343
0.546HisHis: 0.546 ± 0.256
0.682HisIle: 0.682 ± 0.292
0.887HisLys: 0.887 ± 0.232
1.501HisLeu: 1.501 ± 0.236
0.682HisMet: 0.682 ± 0.217
0.818HisAsn: 0.818 ± 0.232
0.887HisPro: 0.887 ± 0.246
0.682HisGln: 0.682 ± 0.181
1.228HisArg: 1.228 ± 0.287
1.023HisSer: 1.023 ± 0.263
0.546HisThr: 0.546 ± 0.222
1.705HisVal: 1.705 ± 0.348
0.273HisTrp: 0.273 ± 0.125
1.228HisTyr: 1.228 ± 0.265
0.0HisXaa: 0.0 ± 0.0
Ile
5.116IleAla: 5.116 ± 0.563
0.887IleCys: 0.887 ± 0.269
5.047IleAsp: 5.047 ± 0.562
4.365IleGlu: 4.365 ± 0.604
2.319IlePhe: 2.319 ± 0.406
4.297IleGly: 4.297 ± 0.574
1.569IleHis: 1.569 ± 0.278
4.365IleIle: 4.365 ± 0.662
4.502IleLys: 4.502 ± 0.567
3.479IleLeu: 3.479 ± 0.55
1.91IleMet: 1.91 ± 0.357
3.888IleAsn: 3.888 ± 0.559
2.592IlePro: 2.592 ± 0.445
1.569IleGln: 1.569 ± 0.315
3.888IleArg: 3.888 ± 0.452
4.297IleSer: 4.297 ± 0.585
3.82IleThr: 3.82 ± 0.57
4.297IleVal: 4.297 ± 0.597
0.409IleTrp: 0.409 ± 0.157
1.842IleTyr: 1.842 ± 0.398
0.0IleXaa: 0.0 ± 0.0
Lys
5.252LysAla: 5.252 ± 0.629
1.023LysCys: 1.023 ± 0.292
3.138LysAsp: 3.138 ± 0.561
3.82LysGlu: 3.82 ± 0.543
2.114LysPhe: 2.114 ± 0.461
3.479LysGly: 3.479 ± 0.491
1.364LysHis: 1.364 ± 0.302
3.956LysIle: 3.956 ± 0.56
4.297LysLys: 4.297 ± 0.672
5.116LysLeu: 5.116 ± 0.679
3.069LysMet: 3.069 ± 0.458
2.66LysAsn: 2.66 ± 0.428
2.387LysPro: 2.387 ± 0.413
2.865LysGln: 2.865 ± 0.454
3.547LysArg: 3.547 ± 0.563
4.092LysSer: 4.092 ± 0.479
3.206LysThr: 3.206 ± 0.605
3.956LysVal: 3.956 ± 0.543
1.364LysTrp: 1.364 ± 0.328
2.524LysTyr: 2.524 ± 0.34
0.0LysXaa: 0.0 ± 0.0
Leu
5.866LeuAla: 5.866 ± 0.821
0.818LeuCys: 0.818 ± 0.236
3.615LeuAsp: 3.615 ± 0.472
4.024LeuGlu: 4.024 ± 0.453
2.455LeuPhe: 2.455 ± 0.443
4.706LeuGly: 4.706 ± 0.524
1.091LeuHis: 1.091 ± 0.299
4.297LeuIle: 4.297 ± 0.541
4.57LeuLys: 4.57 ± 0.683
4.161LeuLeu: 4.161 ± 0.649
1.842LeuMet: 1.842 ± 0.3
3.888LeuAsn: 3.888 ± 0.565
3.41LeuPro: 3.41 ± 0.508
3.138LeuGln: 3.138 ± 0.518
4.706LeuArg: 4.706 ± 0.551
4.434LeuSer: 4.434 ± 0.553
4.911LeuThr: 4.911 ± 0.444
3.888LeuVal: 3.888 ± 0.712
1.705LeuTrp: 1.705 ± 0.31
2.933LeuTyr: 2.933 ± 0.431
0.0LeuXaa: 0.0 ± 0.0
Met
2.728MetAla: 2.728 ± 0.404
0.341MetCys: 0.341 ± 0.18
1.637MetAsp: 1.637 ± 0.336
1.432MetGlu: 1.432 ± 0.286
1.501MetPhe: 1.501 ± 0.357
1.842MetGly: 1.842 ± 0.435
0.614MetHis: 0.614 ± 0.215
2.387MetIle: 2.387 ± 0.417
2.455MetLys: 2.455 ± 0.399
1.978MetLeu: 1.978 ± 0.364
0.818MetMet: 0.818 ± 0.211
1.637MetAsn: 1.637 ± 0.314
2.046MetPro: 2.046 ± 0.477
1.16MetGln: 1.16 ± 0.344
1.501MetArg: 1.501 ± 0.265
2.728MetSer: 2.728 ± 0.47
2.251MetThr: 2.251 ± 0.405
1.705MetVal: 1.705 ± 0.415
0.682MetTrp: 0.682 ± 0.209
1.023MetTyr: 1.023 ± 0.196
0.0MetXaa: 0.0 ± 0.0
Asn
5.729AsnAla: 5.729 ± 1.034
0.341AsnCys: 0.341 ± 0.13
3.206AsnAsp: 3.206 ± 0.461
2.66AsnGlu: 2.66 ± 0.363
1.501AsnPhe: 1.501 ± 0.325
4.502AsnGly: 4.502 ± 0.586
1.364AsnHis: 1.364 ± 0.309
2.387AsnIle: 2.387 ± 0.376
3.069AsnLys: 3.069 ± 0.445
2.933AsnLeu: 2.933 ± 0.482
1.023AsnMet: 1.023 ± 0.308
2.592AsnAsn: 2.592 ± 0.501
1.842AsnPro: 1.842 ± 0.402
1.569AsnGln: 1.569 ± 0.288
2.592AsnArg: 2.592 ± 0.412
3.751AsnSer: 3.751 ± 0.562
2.455AsnThr: 2.455 ± 0.411
3.342AsnVal: 3.342 ± 0.634
0.955AsnTrp: 0.955 ± 0.236
1.773AsnTyr: 1.773 ± 0.337
0.0AsnXaa: 0.0 ± 0.0
Pro
3.274ProAla: 3.274 ± 0.572
0.546ProCys: 0.546 ± 0.173
3.274ProAsp: 3.274 ± 0.649
3.41ProGlu: 3.41 ± 0.478
1.569ProPhe: 1.569 ± 0.299
1.842ProGly: 1.842 ± 0.363
0.887ProHis: 0.887 ± 0.224
1.705ProIle: 1.705 ± 0.295
1.842ProLys: 1.842 ± 0.365
2.046ProLeu: 2.046 ± 0.409
1.023ProMet: 1.023 ± 0.236
1.569ProAsn: 1.569 ± 0.358
1.023ProPro: 1.023 ± 0.302
1.364ProGln: 1.364 ± 0.422
1.842ProArg: 1.842 ± 0.432
2.183ProSer: 2.183 ± 0.453
1.296ProThr: 1.296 ± 0.292
3.41ProVal: 3.41 ± 0.539
0.546ProTrp: 0.546 ± 0.217
1.501ProTyr: 1.501 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
3.41GlnAla: 3.41 ± 0.681
0.205GlnCys: 0.205 ± 0.122
1.501GlnAsp: 1.501 ± 0.333
2.387GlnGlu: 2.387 ± 0.46
1.501GlnPhe: 1.501 ± 0.31
1.501GlnGly: 1.501 ± 0.398
1.296GlnHis: 1.296 ± 0.327
2.865GlnIle: 2.865 ± 0.501
2.319GlnLys: 2.319 ± 0.389
3.547GlnLeu: 3.547 ± 0.53
1.773GlnMet: 1.773 ± 0.422
1.091GlnAsn: 1.091 ± 0.277
1.569GlnPro: 1.569 ± 0.375
2.66GlnGln: 2.66 ± 0.674
1.978GlnArg: 1.978 ± 0.383
2.524GlnSer: 2.524 ± 0.435
2.183GlnThr: 2.183 ± 0.484
2.319GlnVal: 2.319 ± 0.421
0.682GlnTrp: 0.682 ± 0.2
1.296GlnTyr: 1.296 ± 0.355
0.0GlnXaa: 0.0 ± 0.0
Arg
4.229ArgAla: 4.229 ± 0.433
1.16ArgCys: 1.16 ± 0.433
2.592ArgAsp: 2.592 ± 0.481
3.41ArgGlu: 3.41 ± 0.434
2.251ArgPhe: 2.251 ± 0.394
3.138ArgGly: 3.138 ± 0.475
0.887ArgHis: 0.887 ± 0.214
3.888ArgIle: 3.888 ± 0.468
3.479ArgLys: 3.479 ± 0.478
4.706ArgLeu: 4.706 ± 0.618
2.114ArgMet: 2.114 ± 0.343
3.274ArgAsn: 3.274 ± 0.423
1.364ArgPro: 1.364 ± 0.296
2.251ArgGln: 2.251 ± 0.512
3.206ArgArg: 3.206 ± 0.617
3.547ArgSer: 3.547 ± 0.466
1.705ArgThr: 1.705 ± 0.359
3.138ArgVal: 3.138 ± 0.456
0.75ArgTrp: 0.75 ± 0.266
3.138ArgTyr: 3.138 ± 0.418
0.0ArgXaa: 0.0 ± 0.0
Ser
4.911SerAla: 4.911 ± 0.691
0.818SerCys: 0.818 ± 0.222
4.229SerAsp: 4.229 ± 0.452
3.888SerGlu: 3.888 ± 0.563
2.728SerPhe: 2.728 ± 0.444
6.207SerGly: 6.207 ± 0.654
0.75SerHis: 0.75 ± 0.215
3.683SerIle: 3.683 ± 0.575
3.82SerLys: 3.82 ± 0.541
5.116SerLeu: 5.116 ± 0.642
1.978SerMet: 1.978 ± 0.419
2.455SerAsn: 2.455 ± 0.424
2.387SerPro: 2.387 ± 0.451
2.524SerGln: 2.524 ± 0.403
3.479SerArg: 3.479 ± 0.491
3.82SerSer: 3.82 ± 0.515
3.82SerThr: 3.82 ± 0.757
3.615SerVal: 3.615 ± 0.438
0.887SerTrp: 0.887 ± 0.244
2.66SerTyr: 2.66 ± 0.499
0.0SerXaa: 0.0 ± 0.0
Thr
4.775ThrAla: 4.775 ± 0.71
0.409ThrCys: 0.409 ± 0.15
3.41ThrAsp: 3.41 ± 0.563
3.751ThrGlu: 3.751 ± 0.5
1.364ThrPhe: 1.364 ± 0.322
5.047ThrGly: 5.047 ± 0.578
0.887ThrHis: 0.887 ± 0.283
3.342ThrIle: 3.342 ± 0.431
3.206ThrLys: 3.206 ± 0.501
3.206ThrLeu: 3.206 ± 0.374
1.637ThrMet: 1.637 ± 0.309
2.797ThrAsn: 2.797 ± 0.447
3.001ThrPro: 3.001 ± 0.456
1.637ThrGln: 1.637 ± 0.363
2.865ThrArg: 2.865 ± 0.45
3.82ThrSer: 3.82 ± 0.607
4.024ThrThr: 4.024 ± 0.611
3.138ThrVal: 3.138 ± 0.457
1.091ThrTrp: 1.091 ± 0.352
1.773ThrTyr: 1.773 ± 0.366
0.0ThrXaa: 0.0 ± 0.0
Val
5.32ValAla: 5.32 ± 0.742
0.409ValCys: 0.409 ± 0.184
3.206ValAsp: 3.206 ± 0.661
4.092ValGlu: 4.092 ± 0.544
2.524ValPhe: 2.524 ± 0.496
3.615ValGly: 3.615 ± 0.548
1.091ValHis: 1.091 ± 0.276
5.184ValIle: 5.184 ± 0.565
3.751ValLys: 3.751 ± 0.552
4.434ValLeu: 4.434 ± 0.511
2.66ValMet: 2.66 ± 0.398
4.365ValAsn: 4.365 ± 0.605
1.637ValPro: 1.637 ± 0.396
2.455ValGln: 2.455 ± 0.363
3.615ValArg: 3.615 ± 0.596
4.911ValSer: 4.911 ± 0.585
4.297ValThr: 4.297 ± 0.921
4.365ValVal: 4.365 ± 0.742
1.296ValTrp: 1.296 ± 0.347
2.387ValTyr: 2.387 ± 0.361
0.0ValXaa: 0.0 ± 0.0
Trp
0.955TrpAla: 0.955 ± 0.271
0.341TrpCys: 0.341 ± 0.151
1.023TrpAsp: 1.023 ± 0.235
0.955TrpGlu: 0.955 ± 0.253
0.682TrpPhe: 0.682 ± 0.2
0.341TrpGly: 0.341 ± 0.177
0.546TrpHis: 0.546 ± 0.227
1.091TrpIle: 1.091 ± 0.254
0.887TrpLys: 0.887 ± 0.272
1.91TrpLeu: 1.91 ± 0.402
0.409TrpMet: 0.409 ± 0.185
1.228TrpAsn: 1.228 ± 0.276
0.614TrpPro: 0.614 ± 0.21
0.955TrpGln: 0.955 ± 0.268
1.432TrpArg: 1.432 ± 0.327
0.887TrpSer: 0.887 ± 0.233
1.16TrpThr: 1.16 ± 0.236
1.773TrpVal: 1.773 ± 0.298
0.273TrpTrp: 0.273 ± 0.126
0.477TrpTyr: 0.477 ± 0.179
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.615TyrAla: 3.615 ± 0.37
0.546TyrCys: 0.546 ± 0.176
2.728TyrAsp: 2.728 ± 0.425
2.251TyrGlu: 2.251 ± 0.392
1.569TyrPhe: 1.569 ± 0.373
2.933TyrGly: 2.933 ± 0.539
0.955TyrHis: 0.955 ± 0.252
2.455TyrIle: 2.455 ± 0.385
2.114TyrLys: 2.114 ± 0.415
2.865TyrLeu: 2.865 ± 0.511
0.75TyrMet: 0.75 ± 0.23
1.637TyrAsn: 1.637 ± 0.324
1.91TyrPro: 1.91 ± 0.339
1.501TyrGln: 1.501 ± 0.339
1.91TyrArg: 1.91 ± 0.39
2.66TyrSer: 2.66 ± 0.473
2.455TyrThr: 2.455 ± 0.477
2.66TyrVal: 2.66 ± 0.416
0.75TyrTrp: 0.75 ± 0.184
1.432TyrTyr: 1.432 ± 0.428
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 94 proteins (14662 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski