Amino acid dipepetide frequency for Acinetobacter phage vB_AbaM_IME284

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.254AlaAla: 5.254 ± 0.868
0.761AlaCys: 0.761 ± 0.25
2.97AlaAsp: 2.97 ± 0.471
4.036AlaGlu: 4.036 ± 0.536
2.817AlaPhe: 2.817 ± 0.471
4.188AlaGly: 4.188 ± 0.682
1.447AlaHis: 1.447 ± 0.352
5.178AlaIle: 5.178 ± 0.632
5.482AlaLys: 5.482 ± 0.684
6.777AlaLeu: 6.777 ± 0.711
2.36AlaMet: 2.36 ± 0.606
4.34AlaAsn: 4.34 ± 0.673
2.589AlaPro: 2.589 ± 0.383
3.503AlaGln: 3.503 ± 0.555
2.132AlaArg: 2.132 ± 0.394
3.731AlaSer: 3.731 ± 0.513
5.178AlaThr: 5.178 ± 0.884
4.036AlaVal: 4.036 ± 0.62
0.99AlaTrp: 0.99 ± 0.241
2.437AlaTyr: 2.437 ± 0.386
0.0AlaXaa: 0.0 ± 0.0
Cys
0.838CysAla: 0.838 ± 0.237
0.076CysCys: 0.076 ± 0.071
0.761CysAsp: 0.761 ± 0.26
1.066CysGlu: 1.066 ± 0.317
0.533CysPhe: 0.533 ± 0.193
0.99CysGly: 0.99 ± 0.255
0.305CysHis: 0.305 ± 0.175
0.457CysIle: 0.457 ± 0.213
1.371CysLys: 1.371 ± 0.346
1.447CysLeu: 1.447 ± 0.315
0.076CysMet: 0.076 ± 0.075
0.533CysAsn: 0.533 ± 0.208
0.152CysPro: 0.152 ± 0.107
0.305CysGln: 0.305 ± 0.151
0.457CysArg: 0.457 ± 0.156
1.066CysSer: 1.066 ± 0.289
0.076CysThr: 0.076 ± 0.072
1.066CysVal: 1.066 ± 0.254
0.228CysTrp: 0.228 ± 0.143
0.228CysTyr: 0.228 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
4.645AspAla: 4.645 ± 0.524
0.685AspCys: 0.685 ± 0.212
4.34AspAsp: 4.34 ± 0.667
4.949AspGlu: 4.949 ± 0.725
2.817AspPhe: 2.817 ± 0.437
4.492AspGly: 4.492 ± 0.537
0.99AspHis: 0.99 ± 0.273
3.122AspIle: 3.122 ± 0.502
4.492AspLys: 4.492 ± 0.847
4.721AspLeu: 4.721 ± 0.512
0.914AspMet: 0.914 ± 0.232
2.437AspAsn: 2.437 ± 0.444
2.208AspPro: 2.208 ± 0.412
2.893AspGln: 2.893 ± 0.477
2.284AspArg: 2.284 ± 0.365
4.112AspSer: 4.112 ± 0.508
2.589AspThr: 2.589 ± 0.415
3.655AspVal: 3.655 ± 0.475
1.599AspTrp: 1.599 ± 0.411
2.665AspTyr: 2.665 ± 0.475
0.0AspXaa: 0.0 ± 0.0
Glu
4.645GluAla: 4.645 ± 0.559
0.685GluCys: 0.685 ± 0.218
4.112GluAsp: 4.112 ± 0.944
4.188GluGlu: 4.188 ± 0.67
3.655GluPhe: 3.655 ± 0.587
4.036GluGly: 4.036 ± 0.517
1.294GluHis: 1.294 ± 0.284
5.102GluIle: 5.102 ± 0.708
4.721GluLys: 4.721 ± 0.62
6.015GluLeu: 6.015 ± 0.852
1.98GluMet: 1.98 ± 0.401
2.97GluAsn: 2.97 ± 0.517
1.599GluPro: 1.599 ± 0.419
2.97GluGln: 2.97 ± 0.42
2.208GluArg: 2.208 ± 0.379
5.406GluSer: 5.406 ± 0.675
2.513GluThr: 2.513 ± 0.524
5.178GluVal: 5.178 ± 0.6
0.914GluTrp: 0.914 ± 0.22
2.741GluTyr: 2.741 ± 0.574
0.0GluXaa: 0.0 ± 0.0
Phe
3.731PheAla: 3.731 ± 0.617
1.142PheCys: 1.142 ± 0.232
3.122PheAsp: 3.122 ± 0.467
2.741PheGlu: 2.741 ± 0.44
1.675PhePhe: 1.675 ± 0.387
3.35PheGly: 3.35 ± 0.544
0.609PheHis: 0.609 ± 0.197
3.655PheIle: 3.655 ± 0.524
3.883PheLys: 3.883 ± 0.577
3.046PheLeu: 3.046 ± 0.405
1.066PheMet: 1.066 ± 0.278
2.437PheAsn: 2.437 ± 0.429
0.99PhePro: 0.99 ± 0.327
1.599PheGln: 1.599 ± 0.384
1.827PheArg: 1.827 ± 0.353
2.208PheSer: 2.208 ± 0.335
2.513PheThr: 2.513 ± 0.421
3.122PheVal: 3.122 ± 0.472
0.761PheTrp: 0.761 ± 0.314
2.36PheTyr: 2.36 ± 0.437
0.0PheXaa: 0.0 ± 0.0
Gly
5.178GlyAla: 5.178 ± 0.877
0.685GlyCys: 0.685 ± 0.239
3.426GlyAsp: 3.426 ± 0.476
4.416GlyGlu: 4.416 ± 0.511
5.559GlyPhe: 5.559 ± 0.628
5.254GlyGly: 5.254 ± 0.799
1.294GlyHis: 1.294 ± 0.269
4.569GlyIle: 4.569 ± 0.656
4.949GlyLys: 4.949 ± 0.584
6.548GlyLeu: 6.548 ± 0.667
2.589GlyMet: 2.589 ± 0.355
3.503GlyAsn: 3.503 ± 0.496
0.457GlyPro: 0.457 ± 0.234
2.208GlyGln: 2.208 ± 0.454
1.904GlyArg: 1.904 ± 0.38
4.645GlySer: 4.645 ± 0.61
3.198GlyThr: 3.198 ± 0.579
5.863GlyVal: 5.863 ± 0.783
0.99GlyTrp: 0.99 ± 0.275
3.122GlyTyr: 3.122 ± 0.477
0.0GlyXaa: 0.0 ± 0.0
His
1.066HisAla: 1.066 ± 0.293
0.381HisCys: 0.381 ± 0.157
1.371HisAsp: 1.371 ± 0.319
1.371HisGlu: 1.371 ± 0.314
0.533HisPhe: 0.533 ± 0.175
1.142HisGly: 1.142 ± 0.279
0.228HisHis: 0.228 ± 0.134
1.371HisIle: 1.371 ± 0.269
1.371HisLys: 1.371 ± 0.289
1.294HisLeu: 1.294 ± 0.304
0.457HisMet: 0.457 ± 0.168
1.218HisAsn: 1.218 ± 0.317
0.609HisPro: 0.609 ± 0.229
0.609HisGln: 0.609 ± 0.253
0.457HisArg: 0.457 ± 0.169
0.99HisSer: 0.99 ± 0.252
0.99HisThr: 0.99 ± 0.307
0.685HisVal: 0.685 ± 0.247
0.076HisTrp: 0.076 ± 0.065
0.761HisTyr: 0.761 ± 0.211
0.0HisXaa: 0.0 ± 0.0
Ile
4.645IleAla: 4.645 ± 0.546
1.066IleCys: 1.066 ± 0.309
4.264IleAsp: 4.264 ± 0.551
6.015IleGlu: 6.015 ± 0.809
2.208IlePhe: 2.208 ± 0.411
5.026IleGly: 5.026 ± 0.63
1.218IleHis: 1.218 ± 0.387
3.426IleIle: 3.426 ± 0.493
6.32IleLys: 6.32 ± 0.717
4.416IleLeu: 4.416 ± 0.55
1.371IleMet: 1.371 ± 0.293
3.731IleAsn: 3.731 ± 0.536
4.112IlePro: 4.112 ± 0.551
2.132IleGln: 2.132 ± 0.331
2.437IleArg: 2.437 ± 0.395
4.873IleSer: 4.873 ± 0.627
4.492IleThr: 4.492 ± 0.694
4.416IleVal: 4.416 ± 0.554
0.609IleTrp: 0.609 ± 0.185
1.904IleTyr: 1.904 ± 0.375
0.0IleXaa: 0.0 ± 0.0
Lys
6.32LysAla: 6.32 ± 0.831
0.99LysCys: 0.99 ± 0.359
4.416LysAsp: 4.416 ± 0.53
4.949LysGlu: 4.949 ± 0.664
2.97LysPhe: 2.97 ± 0.604
4.721LysGly: 4.721 ± 0.571
0.914LysHis: 0.914 ± 0.25
5.33LysIle: 5.33 ± 0.737
5.482LysLys: 5.482 ± 0.861
6.092LysLeu: 6.092 ± 0.587
2.817LysMet: 2.817 ± 0.42
3.655LysAsn: 3.655 ± 0.468
2.284LysPro: 2.284 ± 0.42
2.437LysGln: 2.437 ± 0.473
3.883LysArg: 3.883 ± 0.547
4.721LysSer: 4.721 ± 0.622
4.797LysThr: 4.797 ± 0.544
6.092LysVal: 6.092 ± 0.701
0.99LysTrp: 0.99 ± 0.235
2.513LysTyr: 2.513 ± 0.423
0.0LysXaa: 0.0 ± 0.0
Leu
6.092LeuAla: 6.092 ± 0.714
0.609LeuCys: 0.609 ± 0.209
5.254LeuAsp: 5.254 ± 0.642
6.092LeuGlu: 6.092 ± 0.758
2.97LeuPhe: 2.97 ± 0.534
5.863LeuGly: 5.863 ± 0.602
1.751LeuHis: 1.751 ± 0.355
5.102LeuIle: 5.102 ± 0.715
6.777LeuLys: 6.777 ± 0.835
6.777LeuLeu: 6.777 ± 0.705
2.132LeuMet: 2.132 ± 0.414
6.472LeuAsn: 6.472 ± 0.632
2.056LeuPro: 2.056 ± 0.419
2.36LeuGln: 2.36 ± 0.343
2.817LeuArg: 2.817 ± 0.424
5.787LeuSer: 5.787 ± 0.592
4.645LeuThr: 4.645 ± 0.5
4.797LeuVal: 4.797 ± 0.684
0.838LeuTrp: 0.838 ± 0.307
2.513LeuTyr: 2.513 ± 0.433
0.0LeuXaa: 0.0 ± 0.0
Met
1.599MetAla: 1.599 ± 0.339
0.457MetCys: 0.457 ± 0.185
1.447MetAsp: 1.447 ± 0.471
1.447MetGlu: 1.447 ± 0.344
0.99MetPhe: 0.99 ± 0.258
2.284MetGly: 2.284 ± 0.456
0.381MetHis: 0.381 ± 0.165
1.523MetIle: 1.523 ± 0.289
2.437MetLys: 2.437 ± 0.364
1.98MetLeu: 1.98 ± 0.365
0.609MetMet: 0.609 ± 0.221
2.893MetAsn: 2.893 ± 0.505
1.218MetPro: 1.218 ± 0.311
1.675MetGln: 1.675 ± 0.305
1.066MetArg: 1.066 ± 0.29
2.36MetSer: 2.36 ± 0.389
1.827MetThr: 1.827 ± 0.298
1.142MetVal: 1.142 ± 0.311
0.305MetTrp: 0.305 ± 0.18
0.457MetTyr: 0.457 ± 0.196
0.0MetXaa: 0.0 ± 0.0
Asn
3.579AsnAla: 3.579 ± 0.62
0.533AsnCys: 0.533 ± 0.187
3.959AsnAsp: 3.959 ± 0.588
3.046AsnGlu: 3.046 ± 0.372
1.827AsnPhe: 1.827 ± 0.418
5.254AsnGly: 5.254 ± 0.71
1.142AsnHis: 1.142 ± 0.274
3.731AsnIle: 3.731 ± 0.502
3.426AsnLys: 3.426 ± 0.611
4.873AsnLeu: 4.873 ± 0.625
1.523AsnMet: 1.523 ± 0.37
3.046AsnAsn: 3.046 ± 0.624
2.893AsnPro: 2.893 ± 0.486
2.208AsnGln: 2.208 ± 0.386
1.827AsnArg: 1.827 ± 0.379
4.416AsnSer: 4.416 ± 0.767
3.274AsnThr: 3.274 ± 0.447
3.122AsnVal: 3.122 ± 0.496
0.914AsnTrp: 0.914 ± 0.256
2.208AsnTyr: 2.208 ± 0.374
0.0AsnXaa: 0.0 ± 0.0
Pro
1.904ProAla: 1.904 ± 0.307
0.457ProCys: 0.457 ± 0.195
2.132ProAsp: 2.132 ± 0.371
2.817ProGlu: 2.817 ± 0.379
1.523ProPhe: 1.523 ± 0.344
0.076ProGly: 0.076 ± 0.067
0.457ProHis: 0.457 ± 0.203
2.665ProIle: 2.665 ± 0.466
2.513ProLys: 2.513 ± 0.398
2.284ProLeu: 2.284 ± 0.419
1.371ProMet: 1.371 ± 0.373
2.741ProAsn: 2.741 ± 0.442
0.761ProPro: 0.761 ± 0.233
1.599ProGln: 1.599 ± 0.333
0.838ProArg: 0.838 ± 0.238
2.132ProSer: 2.132 ± 0.428
2.132ProThr: 2.132 ± 0.397
1.447ProVal: 1.447 ± 0.314
0.305ProTrp: 0.305 ± 0.157
1.751ProTyr: 1.751 ± 0.397
0.0ProXaa: 0.0 ± 0.0
Gln
3.426GlnAla: 3.426 ± 0.534
0.381GlnCys: 0.381 ± 0.16
2.36GlnAsp: 2.36 ± 0.482
2.132GlnGlu: 2.132 ± 0.394
2.513GlnPhe: 2.513 ± 0.512
2.97GlnGly: 2.97 ± 0.513
0.609GlnHis: 0.609 ± 0.215
2.513GlnIle: 2.513 ± 0.468
3.122GlnLys: 3.122 ± 0.534
3.046GlnLeu: 3.046 ± 0.446
0.99GlnMet: 0.99 ± 0.303
2.208GlnAsn: 2.208 ± 0.434
1.066GlnPro: 1.066 ± 0.292
1.904GlnGln: 1.904 ± 0.39
1.218GlnArg: 1.218 ± 0.355
2.513GlnSer: 2.513 ± 0.526
2.132GlnThr: 2.132 ± 0.365
2.208GlnVal: 2.208 ± 0.374
0.761GlnTrp: 0.761 ± 0.227
1.827GlnTyr: 1.827 ± 0.412
0.0GlnXaa: 0.0 ± 0.0
Arg
1.371ArgAla: 1.371 ± 0.316
0.761ArgCys: 0.761 ± 0.24
1.827ArgAsp: 1.827 ± 0.441
2.589ArgGlu: 2.589 ± 0.581
1.98ArgPhe: 1.98 ± 0.361
2.437ArgGly: 2.437 ± 0.362
1.142ArgHis: 1.142 ± 0.292
3.122ArgIle: 3.122 ± 0.505
3.198ArgLys: 3.198 ± 0.462
2.513ArgLeu: 2.513 ± 0.361
0.761ArgMet: 0.761 ± 0.222
1.218ArgAsn: 1.218 ± 0.224
1.066ArgPro: 1.066 ± 0.298
1.142ArgGln: 1.142 ± 0.324
1.371ArgArg: 1.371 ± 0.426
2.589ArgSer: 2.589 ± 0.475
2.36ArgThr: 2.36 ± 0.42
2.284ArgVal: 2.284 ± 0.402
0.609ArgTrp: 0.609 ± 0.191
1.371ArgTyr: 1.371 ± 0.313
0.0ArgXaa: 0.0 ± 0.0
Ser
3.807SerAla: 3.807 ± 0.517
0.457SerCys: 0.457 ± 0.2
3.579SerAsp: 3.579 ± 0.53
4.492SerGlu: 4.492 ± 0.903
3.883SerPhe: 3.883 ± 0.584
5.102SerGly: 5.102 ± 0.552
1.218SerHis: 1.218 ± 0.274
5.787SerIle: 5.787 ± 0.572
5.482SerLys: 5.482 ± 0.665
5.711SerLeu: 5.711 ± 0.535
1.675SerMet: 1.675 ± 0.433
3.35SerAsn: 3.35 ± 0.452
1.675SerPro: 1.675 ± 0.378
2.741SerGln: 2.741 ± 0.358
2.132SerArg: 2.132 ± 0.317
3.731SerSer: 3.731 ± 0.531
3.046SerThr: 3.046 ± 0.519
4.112SerVal: 4.112 ± 0.621
0.914SerTrp: 0.914 ± 0.341
2.208SerTyr: 2.208 ± 0.39
0.0SerXaa: 0.0 ± 0.0
Thr
3.959ThrAla: 3.959 ± 0.619
0.685ThrCys: 0.685 ± 0.208
3.35ThrAsp: 3.35 ± 0.505
2.284ThrGlu: 2.284 ± 0.428
1.904ThrPhe: 1.904 ± 0.399
4.645ThrGly: 4.645 ± 0.505
0.761ThrHis: 0.761 ± 0.224
3.883ThrIle: 3.883 ± 0.614
3.122ThrLys: 3.122 ± 0.457
4.645ThrLeu: 4.645 ± 0.682
1.98ThrMet: 1.98 ± 0.351
2.893ThrAsn: 2.893 ± 0.48
2.589ThrPro: 2.589 ± 0.398
1.98ThrGln: 1.98 ± 0.354
1.98ThrArg: 1.98 ± 0.41
2.589ThrSer: 2.589 ± 0.389
3.426ThrThr: 3.426 ± 0.629
4.873ThrVal: 4.873 ± 0.836
1.447ThrTrp: 1.447 ± 0.336
1.98ThrTyr: 1.98 ± 0.362
0.0ThrXaa: 0.0 ± 0.0
Val
4.569ValAla: 4.569 ± 0.684
0.457ValCys: 0.457 ± 0.194
4.036ValAsp: 4.036 ± 0.43
4.873ValGlu: 4.873 ± 0.681
2.513ValPhe: 2.513 ± 0.391
4.721ValGly: 4.721 ± 0.629
0.609ValHis: 0.609 ± 0.197
4.873ValIle: 4.873 ± 0.831
4.797ValLys: 4.797 ± 0.509
5.026ValLeu: 5.026 ± 0.709
1.98ValMet: 1.98 ± 0.347
4.036ValAsn: 4.036 ± 0.6
1.751ValPro: 1.751 ± 0.289
2.97ValGln: 2.97 ± 0.475
2.056ValArg: 2.056 ± 0.31
3.731ValSer: 3.731 ± 0.506
3.579ValThr: 3.579 ± 0.562
3.731ValVal: 3.731 ± 0.542
1.142ValTrp: 1.142 ± 0.324
3.503ValTyr: 3.503 ± 0.556
0.0ValXaa: 0.0 ± 0.0
Trp
1.218TrpAla: 1.218 ± 0.331
0.228TrpCys: 0.228 ± 0.127
0.761TrpAsp: 0.761 ± 0.254
0.761TrpGlu: 0.761 ± 0.171
1.066TrpPhe: 1.066 ± 0.292
0.761TrpGly: 0.761 ± 0.205
0.152TrpHis: 0.152 ± 0.106
1.066TrpIle: 1.066 ± 0.309
0.838TrpLys: 0.838 ± 0.224
1.142TrpLeu: 1.142 ± 0.286
0.457TrpMet: 0.457 ± 0.157
0.838TrpAsn: 0.838 ± 0.252
0.076TrpPro: 0.076 ± 0.073
0.914TrpGln: 0.914 ± 0.238
0.99TrpArg: 0.99 ± 0.263
1.218TrpSer: 1.218 ± 0.348
0.914TrpThr: 0.914 ± 0.259
1.294TrpVal: 1.294 ± 0.311
0.152TrpTrp: 0.152 ± 0.101
0.305TrpTyr: 0.305 ± 0.132
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.36TyrAla: 2.36 ± 0.386
0.533TyrCys: 0.533 ± 0.192
3.122TyrAsp: 3.122 ± 0.538
2.665TyrGlu: 2.665 ± 0.599
1.904TyrPhe: 1.904 ± 0.415
3.046TyrGly: 3.046 ± 0.574
0.533TyrHis: 0.533 ± 0.268
2.208TyrIle: 2.208 ± 0.391
2.589TyrLys: 2.589 ± 0.354
3.35TyrLeu: 3.35 ± 0.572
0.914TyrMet: 0.914 ± 0.22
2.284TyrAsn: 2.284 ± 0.485
1.751TyrPro: 1.751 ± 0.467
1.751TyrGln: 1.751 ± 0.359
1.827TyrArg: 1.827 ± 0.407
2.284TyrSer: 2.284 ± 0.429
1.294TyrThr: 1.294 ± 0.336
1.827TyrVal: 1.827 ± 0.346
0.609TyrTrp: 0.609 ± 0.223
0.838TyrTyr: 0.838 ± 0.189
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (13134 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski