Amino acid dipepetide frequency for Arthrobacter phage Auxilium

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.97AlaAla: 16.97 ± 1.693
1.148AlaCys: 1.148 ± 0.298
7.719AlaAsp: 7.719 ± 0.798
9.569AlaGlu: 9.569 ± 1.01
2.871AlaPhe: 2.871 ± 0.392
10.654AlaGly: 10.654 ± 1.218
2.488AlaHis: 2.488 ± 0.419
4.976AlaIle: 4.976 ± 0.548
5.933AlaLys: 5.933 ± 0.643
9.633AlaLeu: 9.633 ± 0.854
3.445AlaMet: 3.445 ± 0.606
3.126AlaAsn: 3.126 ± 0.448
4.019AlaPro: 4.019 ± 0.572
5.104AlaGln: 5.104 ± 0.597
7.656AlaArg: 7.656 ± 0.783
5.231AlaSer: 5.231 ± 0.686
6.188AlaThr: 6.188 ± 0.538
7.847AlaVal: 7.847 ± 0.63
2.297AlaTrp: 2.297 ± 0.431
2.169AlaTyr: 2.169 ± 0.345
0.0AlaXaa: 0.0 ± 0.0
Cys
0.574CysAla: 0.574 ± 0.203
0.191CysCys: 0.191 ± 0.115
0.957CysAsp: 0.957 ± 0.291
1.212CysGlu: 1.212 ± 0.458
0.191CysPhe: 0.191 ± 0.118
0.893CysGly: 0.893 ± 0.283
0.255CysHis: 0.255 ± 0.139
0.383CysIle: 0.383 ± 0.168
0.957CysLys: 0.957 ± 0.271
0.638CysLeu: 0.638 ± 0.217
0.191CysMet: 0.191 ± 0.124
0.255CysAsn: 0.255 ± 0.133
1.404CysPro: 1.404 ± 0.367
0.319CysGln: 0.319 ± 0.165
0.128CysArg: 0.128 ± 0.11
0.766CysSer: 0.766 ± 0.249
0.893CysThr: 0.893 ± 0.271
0.574CysVal: 0.574 ± 0.203
0.191CysTrp: 0.191 ± 0.111
0.191CysTyr: 0.191 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
7.847AspAla: 7.847 ± 0.776
0.829AspCys: 0.829 ± 0.242
3.636AspAsp: 3.636 ± 0.558
4.274AspGlu: 4.274 ± 0.555
2.169AspPhe: 2.169 ± 0.365
6.826AspGly: 6.826 ± 0.778
1.34AspHis: 1.34 ± 0.312
2.616AspIle: 2.616 ± 0.383
2.488AspLys: 2.488 ± 0.343
5.359AspLeu: 5.359 ± 0.569
1.467AspMet: 1.467 ± 0.271
1.659AspAsn: 1.659 ± 0.319
3.892AspPro: 3.892 ± 0.559
1.467AspGln: 1.467 ± 0.269
3.764AspArg: 3.764 ± 0.48
2.743AspSer: 2.743 ± 0.515
2.935AspThr: 2.935 ± 0.412
4.593AspVal: 4.593 ± 0.609
1.404AspTrp: 1.404 ± 0.282
1.786AspTyr: 1.786 ± 0.341
0.0AspXaa: 0.0 ± 0.0
Glu
9.123GluAla: 9.123 ± 0.95
0.638GluCys: 0.638 ± 0.207
3.445GluAsp: 3.445 ± 0.446
3.445GluGlu: 3.445 ± 0.53
1.531GluPhe: 1.531 ± 0.237
4.593GluGly: 4.593 ± 0.485
1.786GluHis: 1.786 ± 0.334
2.488GluIle: 2.488 ± 0.438
2.679GluLys: 2.679 ± 0.51
4.657GluLeu: 4.657 ± 0.535
1.404GluMet: 1.404 ± 0.256
1.914GluAsn: 1.914 ± 0.31
3.317GluPro: 3.317 ± 0.538
3.381GluGln: 3.381 ± 0.447
4.785GluArg: 4.785 ± 0.483
3.7GluSer: 3.7 ± 0.515
4.338GluThr: 4.338 ± 0.418
4.338GluVal: 4.338 ± 0.53
2.424GluTrp: 2.424 ± 0.514
1.978GluTyr: 1.978 ± 0.379
0.0GluXaa: 0.0 ± 0.0
Phe
2.36PheAla: 2.36 ± 0.443
0.638PheCys: 0.638 ± 0.23
2.616PheAsp: 2.616 ± 0.346
1.212PheGlu: 1.212 ± 0.364
0.766PhePhe: 0.766 ± 0.217
2.679PheGly: 2.679 ± 0.346
0.957PheHis: 0.957 ± 0.228
1.021PheIle: 1.021 ± 0.246
1.34PheLys: 1.34 ± 0.328
1.722PheLeu: 1.722 ± 0.402
0.574PheMet: 0.574 ± 0.178
0.829PheAsn: 0.829 ± 0.195
0.957PhePro: 0.957 ± 0.256
1.085PheGln: 1.085 ± 0.225
1.659PheArg: 1.659 ± 0.289
1.659PheSer: 1.659 ± 0.286
2.105PheThr: 2.105 ± 0.378
1.659PheVal: 1.659 ± 0.381
0.766PheTrp: 0.766 ± 0.264
0.766PheTyr: 0.766 ± 0.243
0.0PheXaa: 0.0 ± 0.0
Gly
7.847GlyAla: 7.847 ± 1.272
0.893GlyCys: 0.893 ± 0.244
4.721GlyAsp: 4.721 ± 0.496
4.53GlyGlu: 4.53 ± 0.506
2.616GlyPhe: 2.616 ± 0.454
6.38GlyGly: 6.38 ± 0.892
1.595GlyHis: 1.595 ± 0.32
3.892GlyIle: 3.892 ± 0.598
4.083GlyLys: 4.083 ± 0.442
6.635GlyLeu: 6.635 ± 0.884
2.36GlyMet: 2.36 ± 0.31
3.19GlyAsn: 3.19 ± 0.415
3.828GlyPro: 3.828 ± 0.681
2.935GlyGln: 2.935 ± 0.415
5.04GlyArg: 5.04 ± 0.609
4.53GlySer: 4.53 ± 0.601
5.359GlyThr: 5.359 ± 0.712
4.912GlyVal: 4.912 ± 0.527
1.978GlyTrp: 1.978 ± 0.415
3.062GlyTyr: 3.062 ± 0.392
0.0GlyXaa: 0.0 ± 0.0
His
2.552HisAla: 2.552 ± 0.392
0.319HisCys: 0.319 ± 0.151
1.722HisAsp: 1.722 ± 0.334
1.914HisGlu: 1.914 ± 0.364
0.702HisPhe: 0.702 ± 0.249
1.595HisGly: 1.595 ± 0.287
0.702HisHis: 0.702 ± 0.299
0.893HisIle: 0.893 ± 0.212
0.766HisLys: 0.766 ± 0.237
1.85HisLeu: 1.85 ± 0.359
0.51HisMet: 0.51 ± 0.167
0.638HisAsn: 0.638 ± 0.203
1.34HisPro: 1.34 ± 0.287
0.638HisGln: 0.638 ± 0.193
1.531HisArg: 1.531 ± 0.363
0.957HisSer: 0.957 ± 0.359
1.404HisThr: 1.404 ± 0.255
1.722HisVal: 1.722 ± 0.333
0.51HisTrp: 0.51 ± 0.206
0.893HisTyr: 0.893 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
5.04IleAla: 5.04 ± 0.547
0.51IleCys: 0.51 ± 0.191
2.935IleAsp: 2.935 ± 0.433
2.998IleGlu: 2.998 ± 0.415
1.085IlePhe: 1.085 ± 0.23
3.062IleGly: 3.062 ± 0.487
0.893IleHis: 0.893 ± 0.224
1.659IleIle: 1.659 ± 0.312
2.233IleLys: 2.233 ± 0.427
2.807IleLeu: 2.807 ± 0.502
0.574IleMet: 0.574 ± 0.235
1.212IleAsn: 1.212 ± 0.287
2.488IlePro: 2.488 ± 0.353
1.531IleGln: 1.531 ± 0.28
3.317IleArg: 3.317 ± 0.495
3.636IleSer: 3.636 ± 0.49
3.828IleThr: 3.828 ± 0.613
2.935IleVal: 2.935 ± 0.424
0.128IleTrp: 0.128 ± 0.095
1.212IleTyr: 1.212 ± 0.304
0.0IleXaa: 0.0 ± 0.0
Lys
5.997LysAla: 5.997 ± 0.959
0.574LysCys: 0.574 ± 0.265
2.679LysAsp: 2.679 ± 0.353
2.105LysGlu: 2.105 ± 0.372
1.34LysPhe: 1.34 ± 0.27
3.19LysGly: 3.19 ± 0.415
1.085LysHis: 1.085 ± 0.309
2.169LysIle: 2.169 ± 0.395
2.297LysLys: 2.297 ± 0.392
3.19LysLeu: 3.19 ± 0.44
1.786LysMet: 1.786 ± 0.284
1.659LysAsn: 1.659 ± 0.289
3.19LysPro: 3.19 ± 0.499
2.233LysGln: 2.233 ± 0.322
3.764LysArg: 3.764 ± 0.589
1.914LysSer: 1.914 ± 0.331
2.998LysThr: 2.998 ± 0.322
3.126LysVal: 3.126 ± 0.462
0.51LysTrp: 0.51 ± 0.161
0.893LysTyr: 0.893 ± 0.291
0.0LysXaa: 0.0 ± 0.0
Leu
10.526LeuAla: 10.526 ± 1.026
0.702LeuCys: 0.702 ± 0.281
4.466LeuAsp: 4.466 ± 0.609
4.211LeuGlu: 4.211 ± 0.586
1.85LeuPhe: 1.85 ± 0.439
6.188LeuGly: 6.188 ± 0.638
1.595LeuHis: 1.595 ± 0.367
3.955LeuIle: 3.955 ± 0.433
3.955LeuLys: 3.955 ± 0.591
5.678LeuLeu: 5.678 ± 0.591
1.404LeuMet: 1.404 ± 0.337
2.679LeuAsn: 2.679 ± 0.538
4.848LeuPro: 4.848 ± 0.624
2.297LeuGln: 2.297 ± 0.402
5.104LeuArg: 5.104 ± 0.545
4.274LeuSer: 4.274 ± 0.634
5.742LeuThr: 5.742 ± 0.762
4.593LeuVal: 4.593 ± 0.548
1.404LeuTrp: 1.404 ± 0.3
1.85LeuTyr: 1.85 ± 0.332
0.0LeuXaa: 0.0 ± 0.0
Met
3.573MetAla: 3.573 ± 0.471
0.128MetCys: 0.128 ± 0.092
1.722MetAsp: 1.722 ± 0.349
1.212MetGlu: 1.212 ± 0.264
0.638MetPhe: 0.638 ± 0.202
1.276MetGly: 1.276 ± 0.459
0.638MetHis: 0.638 ± 0.196
0.829MetIle: 0.829 ± 0.235
0.893MetLys: 0.893 ± 0.212
1.722MetLeu: 1.722 ± 0.297
0.319MetMet: 0.319 ± 0.142
1.021MetAsn: 1.021 ± 0.25
1.595MetPro: 1.595 ± 0.357
0.574MetGln: 0.574 ± 0.202
1.085MetArg: 1.085 ± 0.296
2.488MetSer: 2.488 ± 0.356
2.488MetThr: 2.488 ± 0.472
1.659MetVal: 1.659 ± 0.318
0.191MetTrp: 0.191 ± 0.102
0.447MetTyr: 0.447 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
3.764AsnAla: 3.764 ± 0.59
0.064AsnCys: 0.064 ± 0.078
1.531AsnAsp: 1.531 ± 0.363
1.531AsnGlu: 1.531 ± 0.368
0.638AsnPhe: 0.638 ± 0.153
3.381AsnGly: 3.381 ± 0.54
1.467AsnHis: 1.467 ± 0.377
0.957AsnIle: 0.957 ± 0.248
1.276AsnLys: 1.276 ± 0.283
2.297AsnLeu: 2.297 ± 0.382
0.447AsnMet: 0.447 ± 0.212
1.021AsnAsn: 1.021 ± 0.25
2.935AsnPro: 2.935 ± 0.554
1.085AsnGln: 1.085 ± 0.237
1.595AsnArg: 1.595 ± 0.29
2.105AsnSer: 2.105 ± 0.363
1.786AsnThr: 1.786 ± 0.317
2.041AsnVal: 2.041 ± 0.321
0.447AsnTrp: 0.447 ± 0.167
0.638AsnTyr: 0.638 ± 0.201
0.0AsnXaa: 0.0 ± 0.0
Pro
5.359ProAla: 5.359 ± 0.609
0.829ProCys: 0.829 ± 0.24
4.147ProAsp: 4.147 ± 0.631
5.167ProGlu: 5.167 ± 0.773
1.531ProPhe: 1.531 ± 0.309
4.466ProGly: 4.466 ± 0.733
1.021ProHis: 1.021 ± 0.25
2.424ProIle: 2.424 ± 0.376
2.743ProLys: 2.743 ± 0.432
3.828ProLeu: 3.828 ± 0.517
1.276ProMet: 1.276 ± 0.271
1.85ProAsn: 1.85 ± 0.373
3.254ProPro: 3.254 ± 0.7
1.659ProGln: 1.659 ± 0.396
3.19ProArg: 3.19 ± 0.521
2.488ProSer: 2.488 ± 0.456
2.616ProThr: 2.616 ± 0.437
3.892ProVal: 3.892 ± 0.584
1.021ProTrp: 1.021 ± 0.236
1.085ProTyr: 1.085 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
4.338GlnAla: 4.338 ± 0.632
0.638GlnCys: 0.638 ± 0.202
2.488GlnAsp: 2.488 ± 0.504
2.233GlnGlu: 2.233 ± 0.416
1.021GlnPhe: 1.021 ± 0.235
3.19GlnGly: 3.19 ± 0.533
0.702GlnHis: 0.702 ± 0.182
1.722GlnIle: 1.722 ± 0.245
1.595GlnLys: 1.595 ± 0.302
2.871GlnLeu: 2.871 ± 0.419
1.212GlnMet: 1.212 ± 0.298
0.638GlnAsn: 0.638 ± 0.167
1.467GlnPro: 1.467 ± 0.364
1.722GlnGln: 1.722 ± 0.328
2.297GlnArg: 2.297 ± 0.398
1.978GlnSer: 1.978 ± 0.307
2.424GlnThr: 2.424 ± 0.436
2.169GlnVal: 2.169 ± 0.363
0.766GlnTrp: 0.766 ± 0.238
0.957GlnTyr: 0.957 ± 0.259
0.0GlnXaa: 0.0 ± 0.0
Arg
6.507ArgAla: 6.507 ± 0.622
0.957ArgCys: 0.957 ± 0.304
3.955ArgAsp: 3.955 ± 0.617
4.083ArgGlu: 4.083 ± 0.439
1.978ArgPhe: 1.978 ± 0.393
4.338ArgGly: 4.338 ± 0.5
1.914ArgHis: 1.914 ± 0.445
4.402ArgIle: 4.402 ± 0.459
2.679ArgLys: 2.679 ± 0.313
5.678ArgLeu: 5.678 ± 0.657
2.233ArgMet: 2.233 ± 0.316
2.169ArgAsn: 2.169 ± 0.384
2.679ArgPro: 2.679 ± 0.416
1.85ArgGln: 1.85 ± 0.329
4.657ArgArg: 4.657 ± 0.674
3.573ArgSer: 3.573 ± 0.486
3.573ArgThr: 3.573 ± 0.426
3.892ArgVal: 3.892 ± 0.576
1.595ArgTrp: 1.595 ± 0.288
2.424ArgTyr: 2.424 ± 0.41
0.0ArgXaa: 0.0 ± 0.0
Ser
5.359SerAla: 5.359 ± 0.555
0.319SerCys: 0.319 ± 0.13
3.254SerAsp: 3.254 ± 0.414
3.828SerGlu: 3.828 ± 0.443
1.148SerPhe: 1.148 ± 0.236
5.678SerGly: 5.678 ± 0.629
0.957SerHis: 0.957 ± 0.221
2.616SerIle: 2.616 ± 0.465
3.062SerLys: 3.062 ± 0.439
5.231SerLeu: 5.231 ± 0.559
1.467SerMet: 1.467 ± 0.325
1.212SerAsn: 1.212 ± 0.247
2.807SerPro: 2.807 ± 0.429
2.105SerGln: 2.105 ± 0.315
3.7SerArg: 3.7 ± 0.468
1.914SerSer: 1.914 ± 0.333
3.126SerThr: 3.126 ± 0.386
3.062SerVal: 3.062 ± 0.45
0.957SerTrp: 0.957 ± 0.28
1.978SerTyr: 1.978 ± 0.283
0.0SerXaa: 0.0 ± 0.0
Thr
7.783ThrAla: 7.783 ± 0.565
0.574ThrCys: 0.574 ± 0.188
3.509ThrAsp: 3.509 ± 0.536
5.231ThrGlu: 5.231 ± 0.565
2.105ThrPhe: 2.105 ± 0.327
4.211ThrGly: 4.211 ± 0.546
1.595ThrHis: 1.595 ± 0.328
2.743ThrIle: 2.743 ± 0.486
2.488ThrLys: 2.488 ± 0.388
4.402ThrLeu: 4.402 ± 0.657
1.212ThrMet: 1.212 ± 0.27
2.233ThrAsn: 2.233 ± 0.35
3.7ThrPro: 3.7 ± 0.515
2.297ThrGln: 2.297 ± 0.393
3.509ThrArg: 3.509 ± 0.53
3.19ThrSer: 3.19 ± 0.478
4.466ThrThr: 4.466 ± 0.546
5.933ThrVal: 5.933 ± 0.684
1.021ThrTrp: 1.021 ± 0.273
1.914ThrTyr: 1.914 ± 0.315
0.0ThrXaa: 0.0 ± 0.0
Val
8.102ValAla: 8.102 ± 0.711
0.574ValCys: 0.574 ± 0.195
4.211ValAsp: 4.211 ± 0.723
4.019ValGlu: 4.019 ± 0.519
1.978ValPhe: 1.978 ± 0.303
3.892ValGly: 3.892 ± 0.596
1.276ValHis: 1.276 ± 0.325
2.169ValIle: 2.169 ± 0.464
3.509ValLys: 3.509 ± 0.514
5.742ValLeu: 5.742 ± 0.609
1.722ValMet: 1.722 ± 0.293
2.297ValAsn: 2.297 ± 0.317
4.274ValPro: 4.274 ± 0.477
2.233ValGln: 2.233 ± 0.337
4.338ValArg: 4.338 ± 0.524
4.019ValSer: 4.019 ± 0.53
5.04ValThr: 5.04 ± 0.556
4.976ValVal: 4.976 ± 0.624
0.638ValTrp: 0.638 ± 0.2
2.041ValTyr: 2.041 ± 0.319
0.0ValXaa: 0.0 ± 0.0
Trp
2.488TrpAla: 2.488 ± 0.535
0.383TrpCys: 0.383 ± 0.175
1.85TrpAsp: 1.85 ± 0.323
0.893TrpGlu: 0.893 ± 0.229
0.447TrpPhe: 0.447 ± 0.166
1.404TrpGly: 1.404 ± 0.371
0.574TrpHis: 0.574 ± 0.275
0.893TrpIle: 0.893 ± 0.241
0.574TrpLys: 0.574 ± 0.191
1.659TrpLeu: 1.659 ± 0.315
0.319TrpMet: 0.319 ± 0.176
0.51TrpAsn: 0.51 ± 0.189
0.766TrpPro: 0.766 ± 0.258
0.893TrpGln: 0.893 ± 0.217
1.276TrpArg: 1.276 ± 0.267
1.34TrpSer: 1.34 ± 0.291
0.829TrpThr: 0.829 ± 0.22
1.085TrpVal: 1.085 ± 0.245
0.319TrpTrp: 0.319 ± 0.168
0.255TrpTyr: 0.255 ± 0.116
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.935TyrAla: 2.935 ± 0.415
0.255TyrCys: 0.255 ± 0.154
1.531TyrAsp: 1.531 ± 0.292
2.297TyrGlu: 2.297 ± 0.276
0.766TyrPhe: 0.766 ± 0.253
2.488TyrGly: 2.488 ± 0.404
0.319TyrHis: 0.319 ± 0.132
1.276TyrIle: 1.276 ± 0.31
1.148TyrLys: 1.148 ± 0.241
1.722TyrLeu: 1.722 ± 0.406
0.51TyrMet: 0.51 ± 0.19
0.893TyrAsn: 0.893 ± 0.231
1.085TyrPro: 1.085 ± 0.23
1.021TyrGln: 1.021 ± 0.246
2.743TyrArg: 2.743 ± 0.4
1.276TyrSer: 1.276 ± 0.245
2.041TyrThr: 2.041 ± 0.343
2.041TyrVal: 2.041 ± 0.332
0.191TyrTrp: 0.191 ± 0.111
0.574TyrTyr: 0.574 ± 0.189
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (15676 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski