Amino acid dipepetide frequency for Moraxella phage Mcat3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.796AlaAla: 4.796 ± 0.923
1.028AlaCys: 1.028 ± 0.54
5.823AlaAsp: 5.823 ± 0.572
3.768AlaGlu: 3.768 ± 0.658
3.197AlaPhe: 3.197 ± 0.739
4.339AlaGly: 4.339 ± 0.74
1.941AlaHis: 1.941 ± 0.34
6.166AlaIle: 6.166 ± 0.667
8.107AlaLys: 8.107 ± 1.298
7.993AlaLeu: 7.993 ± 0.851
2.969AlaMet: 2.969 ± 0.665
3.882AlaAsn: 3.882 ± 0.877
2.284AlaPro: 2.284 ± 0.688
4.681AlaGln: 4.681 ± 0.834
3.996AlaArg: 3.996 ± 0.841
5.367AlaSer: 5.367 ± 0.774
6.965AlaThr: 6.965 ± 1.455
6.28AlaVal: 6.28 ± 0.645
1.713AlaTrp: 1.713 ± 0.441
2.969AlaTyr: 2.969 ± 0.569
0.0AlaXaa: 0.0 ± 0.0
Cys
0.571CysAla: 0.571 ± 0.272
0.114CysCys: 0.114 ± 0.145
0.685CysAsp: 0.685 ± 0.245
1.142CysGlu: 1.142 ± 0.402
0.457CysPhe: 0.457 ± 0.217
1.028CysGly: 1.028 ± 0.459
0.457CysHis: 0.457 ± 0.239
0.343CysIle: 0.343 ± 0.179
0.457CysLys: 0.457 ± 0.231
1.256CysLeu: 1.256 ± 0.429
0.114CysMet: 0.114 ± 0.145
0.457CysAsn: 0.457 ± 0.263
0.457CysPro: 0.457 ± 0.242
0.343CysGln: 0.343 ± 0.199
0.228CysArg: 0.228 ± 0.165
0.571CysSer: 0.571 ± 0.205
0.228CysThr: 0.228 ± 0.158
0.228CysVal: 0.228 ± 0.163
0.114CysTrp: 0.114 ± 0.102
0.571CysTyr: 0.571 ± 0.324
0.0CysXaa: 0.0 ± 0.0
Asp
3.768AspAla: 3.768 ± 0.62
0.571AspCys: 0.571 ± 0.322
6.965AspAsp: 6.965 ± 0.944
6.28AspGlu: 6.28 ± 0.802
2.855AspPhe: 2.855 ± 0.548
4.681AspGly: 4.681 ± 0.739
0.913AspHis: 0.913 ± 0.388
3.768AspIle: 3.768 ± 0.602
5.937AspLys: 5.937 ± 0.758
5.024AspLeu: 5.024 ± 0.924
1.37AspMet: 1.37 ± 0.413
3.768AspAsn: 3.768 ± 0.608
1.941AspPro: 1.941 ± 0.513
1.142AspGln: 1.142 ± 0.419
2.512AspArg: 2.512 ± 0.572
3.197AspSer: 3.197 ± 0.634
3.996AspThr: 3.996 ± 0.649
2.74AspVal: 2.74 ± 0.613
0.799AspTrp: 0.799 ± 0.329
2.855AspTyr: 2.855 ± 0.644
0.0AspXaa: 0.0 ± 0.0
Glu
3.54GluAla: 3.54 ± 0.54
0.571GluCys: 0.571 ± 0.277
1.028GluAsp: 1.028 ± 0.332
1.37GluGlu: 1.37 ± 0.39
2.626GluPhe: 2.626 ± 0.535
2.284GluGly: 2.284 ± 0.496
2.169GluHis: 2.169 ± 0.453
4.681GluIle: 4.681 ± 0.881
4.567GluLys: 4.567 ± 0.99
7.65GluLeu: 7.65 ± 0.841
1.599GluMet: 1.599 ± 0.446
2.855GluAsn: 2.855 ± 0.398
1.713GluPro: 1.713 ± 0.413
3.996GluGln: 3.996 ± 0.55
3.311GluArg: 3.311 ± 0.618
3.654GluSer: 3.654 ± 0.721
2.284GluThr: 2.284 ± 0.389
3.425GluVal: 3.425 ± 0.61
0.685GluTrp: 0.685 ± 0.326
2.74GluTyr: 2.74 ± 0.662
0.0GluXaa: 0.0 ± 0.0
Phe
3.425PheAla: 3.425 ± 0.723
0.685PheCys: 0.685 ± 0.327
2.969PheAsp: 2.969 ± 0.531
2.284PheGlu: 2.284 ± 0.524
1.028PhePhe: 1.028 ± 0.229
2.855PheGly: 2.855 ± 0.464
0.571PheHis: 0.571 ± 0.26
2.512PheIle: 2.512 ± 0.456
2.398PheLys: 2.398 ± 0.472
3.083PheLeu: 3.083 ± 0.713
1.37PheMet: 1.37 ± 0.36
1.713PheAsn: 1.713 ± 0.379
1.028PhePro: 1.028 ± 0.271
0.343PheGln: 0.343 ± 0.194
1.142PheArg: 1.142 ± 0.292
2.055PheSer: 2.055 ± 0.393
1.599PheThr: 1.599 ± 0.375
2.169PheVal: 2.169 ± 0.371
0.457PheTrp: 0.457 ± 0.205
1.713PheTyr: 1.713 ± 0.398
0.0PheXaa: 0.0 ± 0.0
Gly
4.111GlyAla: 4.111 ± 0.763
0.571GlyCys: 0.571 ± 0.33
4.111GlyAsp: 4.111 ± 0.555
5.709GlyGlu: 5.709 ± 0.768
2.055GlyPhe: 2.055 ± 0.563
5.252GlyGly: 5.252 ± 0.703
1.142GlyHis: 1.142 ± 0.351
5.138GlyIle: 5.138 ± 0.691
5.481GlyLys: 5.481 ± 0.598
5.595GlyLeu: 5.595 ± 0.715
2.284GlyMet: 2.284 ± 0.594
3.425GlyAsn: 3.425 ± 0.723
0.343GlyPro: 0.343 ± 0.35
2.055GlyGln: 2.055 ± 0.48
3.425GlyArg: 3.425 ± 0.522
2.512GlySer: 2.512 ± 0.485
2.74GlyThr: 2.74 ± 0.679
4.339GlyVal: 4.339 ± 0.637
0.571GlyTrp: 0.571 ± 0.304
2.74GlyTyr: 2.74 ± 0.657
0.0GlyXaa: 0.0 ± 0.0
His
3.083HisAla: 3.083 ± 0.596
0.114HisCys: 0.114 ± 0.111
1.484HisAsp: 1.484 ± 0.338
1.941HisGlu: 1.941 ± 0.349
1.028HisPhe: 1.028 ± 0.362
1.827HisGly: 1.827 ± 0.483
1.028HisHis: 1.028 ± 0.265
1.484HisIle: 1.484 ± 0.361
1.256HisLys: 1.256 ± 0.264
1.713HisLeu: 1.713 ± 0.479
0.343HisMet: 0.343 ± 0.162
1.142HisAsn: 1.142 ± 0.43
0.799HisPro: 0.799 ± 0.23
0.913HisGln: 0.913 ± 0.25
1.028HisArg: 1.028 ± 0.357
1.256HisSer: 1.256 ± 0.336
2.284HisThr: 2.284 ± 0.655
0.913HisVal: 0.913 ± 0.244
0.114HisTrp: 0.114 ± 0.108
0.799HisTyr: 0.799 ± 0.379
0.0HisXaa: 0.0 ± 0.0
Ile
7.079IleAla: 7.079 ± 0.925
0.799IleCys: 0.799 ± 0.297
5.823IleAsp: 5.823 ± 0.625
3.197IleGlu: 3.197 ± 0.8
2.74IlePhe: 2.74 ± 0.545
5.024IleGly: 5.024 ± 0.662
0.913IleHis: 0.913 ± 0.327
3.882IleIle: 3.882 ± 0.649
5.024IleLys: 5.024 ± 0.824
4.111IleLeu: 4.111 ± 0.644
1.028IleMet: 1.028 ± 0.359
4.225IleAsn: 4.225 ± 0.882
3.197IlePro: 3.197 ± 0.572
2.055IleGln: 2.055 ± 0.441
3.768IleArg: 3.768 ± 0.814
4.453IleSer: 4.453 ± 0.661
4.453IleThr: 4.453 ± 0.536
2.855IleVal: 2.855 ± 0.649
0.343IleTrp: 0.343 ± 0.239
2.74IleTyr: 2.74 ± 0.612
0.0IleXaa: 0.0 ± 0.0
Lys
7.308LysAla: 7.308 ± 0.637
0.571LysCys: 0.571 ± 0.224
4.91LysAsp: 4.91 ± 0.662
4.681LysGlu: 4.681 ± 0.718
2.398LysPhe: 2.398 ± 0.465
3.768LysGly: 3.768 ± 0.643
1.37LysHis: 1.37 ± 0.335
5.024LysIle: 5.024 ± 0.774
5.481LysLys: 5.481 ± 0.667
5.367LysLeu: 5.367 ± 0.606
2.284LysMet: 2.284 ± 0.479
3.083LysAsn: 3.083 ± 0.411
3.083LysPro: 3.083 ± 0.654
4.339LysGln: 4.339 ± 0.667
3.54LysArg: 3.54 ± 0.653
4.796LysSer: 4.796 ± 0.673
5.252LysThr: 5.252 ± 0.92
4.796LysVal: 4.796 ± 0.766
0.685LysTrp: 0.685 ± 0.385
1.827LysTyr: 1.827 ± 0.44
0.0LysXaa: 0.0 ± 0.0
Leu
7.536LeuAla: 7.536 ± 1.448
1.028LeuCys: 1.028 ± 0.457
7.422LeuAsp: 7.422 ± 1.243
4.111LeuGlu: 4.111 ± 0.851
3.197LeuPhe: 3.197 ± 0.635
6.394LeuGly: 6.394 ± 0.91
2.512LeuHis: 2.512 ± 0.643
6.052LeuIle: 6.052 ± 0.788
5.709LeuLys: 5.709 ± 0.827
6.965LeuLeu: 6.965 ± 0.934
1.713LeuMet: 1.713 ± 0.403
4.225LeuAsn: 4.225 ± 0.61
3.54LeuPro: 3.54 ± 0.671
3.197LeuGln: 3.197 ± 0.504
3.54LeuArg: 3.54 ± 0.554
7.65LeuSer: 7.65 ± 0.772
5.937LeuThr: 5.937 ± 0.689
5.823LeuVal: 5.823 ± 0.815
0.457LeuTrp: 0.457 ± 0.198
1.827LeuTyr: 1.827 ± 0.506
0.0LeuXaa: 0.0 ± 0.0
Met
3.197MetAla: 3.197 ± 0.761
0.228MetCys: 0.228 ± 0.147
1.599MetAsp: 1.599 ± 0.336
0.343MetGlu: 0.343 ± 0.251
0.913MetPhe: 0.913 ± 0.28
1.599MetGly: 1.599 ± 0.396
0.228MetHis: 0.228 ± 0.18
1.37MetIle: 1.37 ± 0.341
1.028MetLys: 1.028 ± 0.255
1.941MetLeu: 1.941 ± 0.545
0.685MetMet: 0.685 ± 0.27
1.941MetAsn: 1.941 ± 0.477
1.142MetPro: 1.142 ± 0.485
1.827MetGln: 1.827 ± 0.403
0.913MetArg: 0.913 ± 0.44
2.969MetSer: 2.969 ± 0.552
2.169MetThr: 2.169 ± 0.513
0.685MetVal: 0.685 ± 0.284
0.457MetTrp: 0.457 ± 0.225
0.457MetTyr: 0.457 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
5.138AsnAla: 5.138 ± 1.073
0.343AsnCys: 0.343 ± 0.183
2.512AsnAsp: 2.512 ± 0.456
2.512AsnGlu: 2.512 ± 0.42
1.484AsnPhe: 1.484 ± 0.482
2.855AsnGly: 2.855 ± 0.516
1.028AsnHis: 1.028 ± 0.265
2.512AsnIle: 2.512 ± 0.46
3.311AsnLys: 3.311 ± 0.567
3.425AsnLeu: 3.425 ± 0.685
1.256AsnMet: 1.256 ± 0.34
3.311AsnAsn: 3.311 ± 0.643
3.197AsnPro: 3.197 ± 0.882
3.54AsnGln: 3.54 ± 0.756
1.713AsnArg: 1.713 ± 0.325
3.311AsnSer: 3.311 ± 0.695
3.083AsnThr: 3.083 ± 0.668
2.169AsnVal: 2.169 ± 0.496
0.685AsnTrp: 0.685 ± 0.259
2.398AsnTyr: 2.398 ± 0.508
0.0AsnXaa: 0.0 ± 0.0
Pro
3.197ProAla: 3.197 ± 0.695
0.228ProCys: 0.228 ± 0.142
1.484ProAsp: 1.484 ± 0.341
1.256ProGlu: 1.256 ± 0.43
1.37ProPhe: 1.37 ± 0.37
0.571ProGly: 0.571 ± 0.483
0.571ProHis: 0.571 ± 0.189
1.941ProIle: 1.941 ± 0.534
4.111ProLys: 4.111 ± 0.658
3.768ProLeu: 3.768 ± 0.616
0.685ProMet: 0.685 ± 0.272
2.626ProAsn: 2.626 ± 0.444
1.599ProPro: 1.599 ± 0.487
1.713ProGln: 1.713 ± 0.473
1.256ProArg: 1.256 ± 0.286
2.398ProSer: 2.398 ± 0.622
2.74ProThr: 2.74 ± 0.661
2.055ProVal: 2.055 ± 0.44
0.114ProTrp: 0.114 ± 0.117
1.256ProTyr: 1.256 ± 0.384
0.0ProXaa: 0.0 ± 0.0
Gln
5.595GlnAla: 5.595 ± 1.011
0.343GlnCys: 0.343 ± 0.202
2.169GlnAsp: 2.169 ± 0.434
2.169GlnGlu: 2.169 ± 0.432
1.37GlnPhe: 1.37 ± 0.359
2.398GlnGly: 2.398 ± 0.579
0.685GlnHis: 0.685 ± 0.256
3.996GlnIle: 3.996 ± 0.782
4.225GlnLys: 4.225 ± 0.439
3.768GlnLeu: 3.768 ± 0.822
1.484GlnMet: 1.484 ± 0.363
3.425GlnAsn: 3.425 ± 0.643
1.484GlnPro: 1.484 ± 0.492
1.599GlnGln: 1.599 ± 0.568
1.713GlnArg: 1.713 ± 0.54
3.311GlnSer: 3.311 ± 0.644
2.855GlnThr: 2.855 ± 0.401
1.827GlnVal: 1.827 ± 0.462
0.343GlnTrp: 0.343 ± 0.227
0.913GlnTyr: 0.913 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
4.111ArgAla: 4.111 ± 0.727
0.114ArgCys: 0.114 ± 0.107
2.398ArgAsp: 2.398 ± 0.459
2.512ArgGlu: 2.512 ± 0.504
2.055ArgPhe: 2.055 ± 0.634
1.941ArgGly: 1.941 ± 0.654
1.484ArgHis: 1.484 ± 0.352
3.083ArgIle: 3.083 ± 0.666
2.74ArgLys: 2.74 ± 0.579
4.796ArgLeu: 4.796 ± 0.73
1.37ArgMet: 1.37 ± 0.365
1.028ArgAsn: 1.028 ± 0.328
1.941ArgPro: 1.941 ± 0.504
2.398ArgGln: 2.398 ± 0.524
1.713ArgArg: 1.713 ± 0.308
2.512ArgSer: 2.512 ± 0.412
2.512ArgThr: 2.512 ± 0.577
2.626ArgVal: 2.626 ± 0.398
0.343ArgTrp: 0.343 ± 0.191
1.941ArgTyr: 1.941 ± 0.454
0.0ArgXaa: 0.0 ± 0.0
Ser
5.481SerAla: 5.481 ± 0.886
0.799SerCys: 0.799 ± 0.352
3.882SerAsp: 3.882 ± 0.681
3.996SerGlu: 3.996 ± 0.641
1.713SerPhe: 1.713 ± 0.447
4.111SerGly: 4.111 ± 0.885
2.74SerHis: 2.74 ± 0.685
4.453SerIle: 4.453 ± 0.698
3.425SerLys: 3.425 ± 1.019
5.937SerLeu: 5.937 ± 0.862
1.484SerMet: 1.484 ± 0.272
2.284SerAsn: 2.284 ± 0.484
1.142SerPro: 1.142 ± 0.437
3.768SerGln: 3.768 ± 0.679
2.626SerArg: 2.626 ± 0.453
2.855SerSer: 2.855 ± 0.636
3.311SerThr: 3.311 ± 0.548
5.367SerVal: 5.367 ± 0.967
0.685SerTrp: 0.685 ± 0.245
1.599SerTyr: 1.599 ± 0.461
0.0SerXaa: 0.0 ± 0.0
Thr
6.394ThrAla: 6.394 ± 0.987
0.457ThrCys: 0.457 ± 0.218
5.252ThrAsp: 5.252 ± 0.687
3.54ThrGlu: 3.54 ± 0.538
1.484ThrPhe: 1.484 ± 0.408
5.481ThrGly: 5.481 ± 0.966
2.512ThrHis: 2.512 ± 0.637
3.882ThrIle: 3.882 ± 0.542
5.367ThrLys: 5.367 ± 0.719
6.166ThrLeu: 6.166 ± 0.746
1.256ThrMet: 1.256 ± 0.391
2.055ThrAsn: 2.055 ± 0.361
3.311ThrPro: 3.311 ± 0.617
2.398ThrGln: 2.398 ± 0.442
1.599ThrArg: 1.599 ± 0.526
3.197ThrSer: 3.197 ± 0.72
3.654ThrThr: 3.654 ± 0.589
3.197ThrVal: 3.197 ± 0.517
0.913ThrTrp: 0.913 ± 0.287
0.799ThrTyr: 0.799 ± 0.383
0.0ThrXaa: 0.0 ± 0.0
Val
5.252ValAla: 5.252 ± 0.919
0.799ValCys: 0.799 ± 0.294
2.398ValAsp: 2.398 ± 0.561
2.855ValGlu: 2.855 ± 0.659
1.941ValPhe: 1.941 ± 0.49
4.796ValGly: 4.796 ± 0.743
0.913ValHis: 0.913 ± 0.351
4.681ValIle: 4.681 ± 0.995
3.996ValLys: 3.996 ± 0.808
5.138ValLeu: 5.138 ± 0.767
1.256ValMet: 1.256 ± 0.359
2.512ValAsn: 2.512 ± 0.619
1.599ValPro: 1.599 ± 0.487
2.284ValGln: 2.284 ± 0.422
3.654ValArg: 3.654 ± 1.249
4.111ValSer: 4.111 ± 0.712
3.882ValThr: 3.882 ± 0.735
2.969ValVal: 2.969 ± 0.589
0.799ValTrp: 0.799 ± 0.227
1.37ValTyr: 1.37 ± 0.338
0.0ValXaa: 0.0 ± 0.0
Trp
1.256TrpAla: 1.256 ± 0.359
0.343TrpCys: 0.343 ± 0.228
0.228TrpAsp: 0.228 ± 0.138
0.913TrpGlu: 0.913 ± 0.352
0.114TrpPhe: 0.114 ± 0.107
0.571TrpGly: 0.571 ± 0.281
0.228TrpHis: 0.228 ± 0.157
0.457TrpIle: 0.457 ± 0.209
0.571TrpLys: 0.571 ± 0.232
1.599TrpLeu: 1.599 ± 0.392
0.0TrpMet: 0.0 ± 0.123
0.457TrpAsn: 0.457 ± 0.149
0.0TrpPro: 0.0 ± 0.0
1.599TrpGln: 1.599 ± 0.599
0.343TrpArg: 0.343 ± 0.215
0.228TrpSer: 0.228 ± 0.167
0.685TrpThr: 0.685 ± 0.196
0.799TrpVal: 0.799 ± 0.382
0.343TrpTrp: 0.343 ± 0.202
0.343TrpTyr: 0.343 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.425TyrAla: 3.425 ± 0.57
0.114TyrCys: 0.114 ± 0.115
1.941TyrAsp: 1.941 ± 0.388
2.169TyrGlu: 2.169 ± 0.401
1.37TyrPhe: 1.37 ± 0.561
1.827TyrGly: 1.827 ± 0.497
1.142TyrHis: 1.142 ± 0.427
2.284TyrIle: 2.284 ± 0.63
1.37TyrLys: 1.37 ± 0.356
3.54TyrLeu: 3.54 ± 0.752
1.028TyrMet: 1.028 ± 0.335
1.37TyrAsn: 1.37 ± 0.444
1.142TyrPro: 1.142 ± 0.4
1.484TyrGln: 1.484 ± 0.401
1.599TyrArg: 1.599 ± 0.386
1.256TyrSer: 1.256 ± 0.386
2.284TyrThr: 2.284 ± 0.509
1.941TyrVal: 1.941 ± 0.393
0.571TyrTrp: 0.571 ± 0.271
1.028TyrTyr: 1.028 ± 0.41
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 40 proteins (8759 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski