Amino acid dipepetide frequency for uncultured phage_MedDCM-OCT-S39-C11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.037AlaAla: 10.037 ± 1.497
0.235AlaCys: 0.235 ± 0.182
6.508AlaAsp: 6.508 ± 0.925
6.195AlaGlu: 6.195 ± 0.922
3.685AlaPhe: 3.685 ± 0.462
8.155AlaGly: 8.155 ± 0.879
0.627AlaHis: 0.627 ± 0.218
5.332AlaIle: 5.332 ± 0.699
5.097AlaLys: 5.097 ± 0.917
8.233AlaLeu: 8.233 ± 1.027
2.196AlaMet: 2.196 ± 0.461
4.391AlaAsn: 4.391 ± 0.886
3.921AlaPro: 3.921 ± 0.662
5.332AlaGln: 5.332 ± 1.001
5.646AlaArg: 5.646 ± 1.193
5.803AlaSer: 5.803 ± 0.593
5.41AlaThr: 5.41 ± 0.751
5.803AlaVal: 5.803 ± 0.725
1.49AlaTrp: 1.49 ± 0.406
2.352AlaTyr: 2.352 ± 0.49
0.0AlaXaa: 0.0 ± 0.0
Cys
0.47CysAla: 0.47 ± 0.211
0.157CysCys: 0.157 ± 0.131
0.157CysAsp: 0.157 ± 0.109
0.314CysGlu: 0.314 ± 0.178
0.078CysPhe: 0.078 ± 0.081
0.314CysGly: 0.314 ± 0.129
0.157CysHis: 0.157 ± 0.108
0.392CysIle: 0.392 ± 0.24
0.314CysLys: 0.314 ± 0.217
0.784CysLeu: 0.784 ± 0.248
0.235CysMet: 0.235 ± 0.16
0.157CysAsn: 0.157 ± 0.106
0.314CysPro: 0.314 ± 0.168
0.392CysGln: 0.392 ± 0.198
1.098CysArg: 1.098 ± 0.462
0.549CysSer: 0.549 ± 0.258
0.627CysThr: 0.627 ± 0.246
0.47CysVal: 0.47 ± 0.245
0.314CysTrp: 0.314 ± 0.169
0.314CysTyr: 0.314 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
5.254AspAla: 5.254 ± 0.694
0.47AspCys: 0.47 ± 0.209
3.607AspAsp: 3.607 ± 0.579
3.921AspGlu: 3.921 ± 0.606
1.96AspPhe: 1.96 ± 0.493
6.116AspGly: 6.116 ± 1.441
0.627AspHis: 0.627 ± 0.191
3.215AspIle: 3.215 ± 0.622
3.529AspLys: 3.529 ± 0.467
5.254AspLeu: 5.254 ± 0.642
1.411AspMet: 1.411 ± 0.37
3.137AspAsn: 3.137 ± 0.753
3.215AspPro: 3.215 ± 0.439
3.372AspGln: 3.372 ± 0.505
3.529AspArg: 3.529 ± 0.816
3.45AspSer: 3.45 ± 0.435
3.764AspThr: 3.764 ± 0.895
4.548AspVal: 4.548 ± 0.591
1.019AspTrp: 1.019 ± 0.364
2.588AspTyr: 2.588 ± 0.595
0.0AspXaa: 0.0 ± 0.0
Glu
6.665GluAla: 6.665 ± 0.864
0.47GluCys: 0.47 ± 0.16
2.901GluAsp: 2.901 ± 0.45
3.058GluGlu: 3.058 ± 0.638
2.274GluPhe: 2.274 ± 0.592
3.999GluGly: 3.999 ± 0.497
0.392GluHis: 0.392 ± 0.252
3.685GluIle: 3.685 ± 0.595
2.823GluLys: 2.823 ± 0.625
5.724GluLeu: 5.724 ± 0.868
1.568GluMet: 1.568 ± 0.569
1.568GluAsn: 1.568 ± 0.351
3.45GluPro: 3.45 ± 0.736
3.215GluGln: 3.215 ± 0.441
3.764GluArg: 3.764 ± 0.627
3.137GluSer: 3.137 ± 0.575
3.685GluThr: 3.685 ± 0.549
4.156GluVal: 4.156 ± 0.544
1.019GluTrp: 1.019 ± 0.329
1.568GluTyr: 1.568 ± 0.328
0.0GluXaa: 0.0 ± 0.0
Phe
3.215PheAla: 3.215 ± 0.479
0.157PheCys: 0.157 ± 0.094
3.529PheAsp: 3.529 ± 0.722
1.49PheGlu: 1.49 ± 0.366
0.863PhePhe: 0.863 ± 0.276
2.196PheGly: 2.196 ± 0.401
0.157PheHis: 0.157 ± 0.096
1.098PheIle: 1.098 ± 0.307
1.176PheLys: 1.176 ± 0.319
2.588PheLeu: 2.588 ± 0.449
0.627PheMet: 0.627 ± 0.255
2.509PheAsn: 2.509 ± 0.458
0.941PhePro: 0.941 ± 0.237
1.49PheGln: 1.49 ± 0.344
2.039PheArg: 2.039 ± 0.497
2.039PheSer: 2.039 ± 0.476
2.431PheThr: 2.431 ± 0.613
2.352PheVal: 2.352 ± 0.35
0.549PheTrp: 0.549 ± 0.204
0.784PheTyr: 0.784 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
7.528GlyAla: 7.528 ± 0.932
0.627GlyCys: 0.627 ± 0.232
3.999GlyAsp: 3.999 ± 0.562
4.626GlyGlu: 4.626 ± 0.715
2.666GlyPhe: 2.666 ± 0.502
7.92GlyGly: 7.92 ± 1.594
0.784GlyHis: 0.784 ± 0.255
3.607GlyIle: 3.607 ± 0.599
3.764GlyLys: 3.764 ± 0.659
6.351GlyLeu: 6.351 ± 0.99
1.725GlyMet: 1.725 ± 0.478
4.94GlyAsn: 4.94 ± 1.018
2.509GlyPro: 2.509 ± 0.425
3.842GlyGln: 3.842 ± 0.385
3.921GlyArg: 3.921 ± 0.642
7.92GlySer: 7.92 ± 1.618
5.567GlyThr: 5.567 ± 0.813
6.195GlyVal: 6.195 ± 0.902
1.568GlyTrp: 1.568 ± 0.354
2.666GlyTyr: 2.666 ± 0.402
0.0GlyXaa: 0.0 ± 0.0
His
0.47HisAla: 0.47 ± 0.227
0.235HisCys: 0.235 ± 0.127
1.019HisAsp: 1.019 ± 0.365
0.627HisGlu: 0.627 ± 0.264
0.706HisPhe: 0.706 ± 0.243
0.392HisGly: 0.392 ± 0.167
0.078HisHis: 0.078 ± 0.087
0.314HisIle: 0.314 ± 0.129
0.314HisLys: 0.314 ± 0.146
1.098HisLeu: 1.098 ± 0.397
0.235HisMet: 0.235 ± 0.131
0.314HisAsn: 0.314 ± 0.148
0.314HisPro: 0.314 ± 0.176
0.392HisGln: 0.392 ± 0.208
0.392HisArg: 0.392 ± 0.214
0.784HisSer: 0.784 ± 0.333
0.627HisThr: 0.627 ± 0.246
0.627HisVal: 0.627 ± 0.232
0.314HisTrp: 0.314 ± 0.16
0.157HisTyr: 0.157 ± 0.129
0.0HisXaa: 0.0 ± 0.0
Ile
4.94IleAla: 4.94 ± 0.637
0.235IleCys: 0.235 ± 0.152
4.077IleAsp: 4.077 ± 0.462
3.215IleGlu: 3.215 ± 0.558
1.098IlePhe: 1.098 ± 0.289
4.391IleGly: 4.391 ± 0.448
0.549IleHis: 0.549 ± 0.229
1.176IleIle: 1.176 ± 0.178
2.588IleLys: 2.588 ± 0.459
2.588IleLeu: 2.588 ± 0.438
0.314IleMet: 0.314 ± 0.117
2.666IleAsn: 2.666 ± 0.453
2.588IlePro: 2.588 ± 0.465
2.509IleGln: 2.509 ± 0.538
2.588IleArg: 2.588 ± 0.288
3.45IleSer: 3.45 ± 0.66
2.901IleThr: 2.901 ± 0.593
2.98IleVal: 2.98 ± 0.463
0.549IleTrp: 0.549 ± 0.244
1.255IleTyr: 1.255 ± 0.411
0.0IleXaa: 0.0 ± 0.0
Lys
5.803LysAla: 5.803 ± 1.124
0.314LysCys: 0.314 ± 0.238
3.293LysAsp: 3.293 ± 0.599
2.352LysGlu: 2.352 ± 0.741
1.568LysPhe: 1.568 ± 0.398
4.47LysGly: 4.47 ± 0.993
0.47LysHis: 0.47 ± 0.215
1.803LysIle: 1.803 ± 0.365
1.96LysLys: 1.96 ± 0.365
5.41LysLeu: 5.41 ± 0.72
0.627LysMet: 0.627 ± 0.263
1.882LysAsn: 1.882 ± 0.334
1.647LysPro: 1.647 ± 0.263
2.509LysGln: 2.509 ± 0.387
2.98LysArg: 2.98 ± 0.657
2.509LysSer: 2.509 ± 0.396
3.293LysThr: 3.293 ± 0.464
3.529LysVal: 3.529 ± 0.495
0.627LysTrp: 0.627 ± 0.219
1.882LysTyr: 1.882 ± 0.376
0.0LysXaa: 0.0 ± 0.0
Leu
7.998LeuAla: 7.998 ± 1.357
0.784LeuCys: 0.784 ± 0.27
6.038LeuAsp: 6.038 ± 0.721
4.862LeuGlu: 4.862 ± 0.774
2.039LeuPhe: 2.039 ± 0.302
6.195LeuGly: 6.195 ± 1.28
0.549LeuHis: 0.549 ± 0.21
3.764LeuIle: 3.764 ± 0.639
4.862LeuLys: 4.862 ± 0.754
5.881LeuLeu: 5.881 ± 0.851
2.196LeuMet: 2.196 ± 0.672
4.156LeuAsn: 4.156 ± 0.626
3.45LeuPro: 3.45 ± 0.541
4.234LeuGln: 4.234 ± 0.909
4.234LeuArg: 4.234 ± 0.453
6.822LeuSer: 6.822 ± 0.609
6.038LeuThr: 6.038 ± 0.757
5.332LeuVal: 5.332 ± 0.833
0.863LeuTrp: 0.863 ± 0.266
2.431LeuTyr: 2.431 ± 0.57
0.0LeuXaa: 0.0 ± 0.0
Met
3.137MetAla: 3.137 ± 0.812
0.0MetCys: 0.0 ± 0.0
0.941MetAsp: 0.941 ± 0.273
1.333MetGlu: 1.333 ± 0.4
0.47MetPhe: 0.47 ± 0.176
1.411MetGly: 1.411 ± 0.384
0.078MetHis: 0.078 ± 0.075
0.627MetIle: 0.627 ± 0.239
1.019MetLys: 1.019 ± 0.337
2.509MetLeu: 2.509 ± 0.447
0.706MetMet: 0.706 ± 0.233
0.784MetAsn: 0.784 ± 0.288
1.49MetPro: 1.49 ± 0.327
1.411MetGln: 1.411 ± 0.475
1.255MetArg: 1.255 ± 0.342
2.352MetSer: 2.352 ± 0.427
1.176MetThr: 1.176 ± 0.346
1.176MetVal: 1.176 ± 0.404
0.392MetTrp: 0.392 ± 0.237
0.784MetTyr: 0.784 ± 0.286
0.0MetXaa: 0.0 ± 0.0
Asn
4.391AsnAla: 4.391 ± 0.851
0.314AsnCys: 0.314 ± 0.165
2.274AsnAsp: 2.274 ± 0.53
2.588AsnGlu: 2.588 ± 0.568
1.647AsnPhe: 1.647 ± 0.404
5.175AsnGly: 5.175 ± 1.467
0.627AsnHis: 0.627 ± 0.222
1.568AsnIle: 1.568 ± 0.355
1.803AsnLys: 1.803 ± 0.471
3.764AsnLeu: 3.764 ± 0.623
0.863AsnMet: 0.863 ± 0.306
2.509AsnAsn: 2.509 ± 0.652
3.058AsnPro: 3.058 ± 0.707
2.274AsnGln: 2.274 ± 0.536
2.431AsnArg: 2.431 ± 0.554
3.764AsnSer: 3.764 ± 0.722
4.705AsnThr: 4.705 ± 0.991
2.431AsnVal: 2.431 ± 0.367
0.47AsnTrp: 0.47 ± 0.159
2.039AsnTyr: 2.039 ± 0.366
0.0AsnXaa: 0.0 ± 0.0
Pro
3.764ProAla: 3.764 ± 0.991
0.235ProCys: 0.235 ± 0.159
2.039ProAsp: 2.039 ± 0.504
3.529ProGlu: 3.529 ± 0.725
1.725ProPhe: 1.725 ± 0.493
4.234ProGly: 4.234 ± 0.68
0.549ProHis: 0.549 ± 0.186
1.96ProIle: 1.96 ± 0.475
2.196ProLys: 2.196 ± 0.557
3.764ProLeu: 3.764 ± 0.708
0.784ProMet: 0.784 ± 0.277
1.647ProAsn: 1.647 ± 0.338
1.568ProPro: 1.568 ± 0.455
1.568ProGln: 1.568 ± 0.6
1.333ProArg: 1.333 ± 0.475
3.842ProSer: 3.842 ± 0.446
4.47ProThr: 4.47 ± 0.696
2.744ProVal: 2.744 ± 0.489
0.863ProTrp: 0.863 ± 0.344
1.333ProTyr: 1.333 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
5.489GlnAla: 5.489 ± 1.261
0.314GlnCys: 0.314 ± 0.154
2.901GlnAsp: 2.901 ± 0.589
2.588GlnGlu: 2.588 ± 0.485
1.333GlnPhe: 1.333 ± 0.29
3.137GlnGly: 3.137 ± 0.593
0.549GlnHis: 0.549 ± 0.242
2.666GlnIle: 2.666 ± 0.525
2.352GlnLys: 2.352 ± 0.429
5.41GlnLeu: 5.41 ± 0.742
1.176GlnMet: 1.176 ± 0.374
1.725GlnAsn: 1.725 ± 0.297
1.803GlnPro: 1.803 ± 0.393
3.529GlnGln: 3.529 ± 0.585
4.47GlnArg: 4.47 ± 0.907
3.215GlnSer: 3.215 ± 0.43
2.352GlnThr: 2.352 ± 0.544
2.98GlnVal: 2.98 ± 0.328
0.784GlnTrp: 0.784 ± 0.255
1.411GlnTyr: 1.411 ± 0.341
0.0GlnXaa: 0.0 ± 0.0
Arg
5.881ArgAla: 5.881 ± 0.882
0.157ArgCys: 0.157 ± 0.096
3.529ArgAsp: 3.529 ± 0.52
3.215ArgGlu: 3.215 ± 0.705
2.352ArgPhe: 2.352 ± 0.43
3.607ArgGly: 3.607 ± 0.614
0.627ArgHis: 0.627 ± 0.265
2.744ArgIle: 2.744 ± 0.467
1.725ArgLys: 1.725 ± 0.503
4.626ArgLeu: 4.626 ± 0.728
2.196ArgMet: 2.196 ± 0.697
2.588ArgAsn: 2.588 ± 0.344
2.274ArgPro: 2.274 ± 0.373
2.352ArgGln: 2.352 ± 0.741
2.744ArgArg: 2.744 ± 0.672
3.764ArgSer: 3.764 ± 0.49
2.588ArgThr: 2.588 ± 0.663
3.529ArgVal: 3.529 ± 0.56
0.941ArgTrp: 0.941 ± 0.239
1.882ArgTyr: 1.882 ± 0.467
0.0ArgXaa: 0.0 ± 0.0
Ser
5.803SerAla: 5.803 ± 1.287
0.627SerCys: 0.627 ± 0.277
4.077SerAsp: 4.077 ± 0.846
4.862SerGlu: 4.862 ± 0.643
1.882SerPhe: 1.882 ± 0.371
6.587SerGly: 6.587 ± 2.088
0.784SerHis: 0.784 ± 0.338
3.842SerIle: 3.842 ± 0.596
3.685SerLys: 3.685 ± 0.621
5.724SerLeu: 5.724 ± 0.678
2.196SerMet: 2.196 ± 0.523
3.529SerAsn: 3.529 ± 0.608
2.274SerPro: 2.274 ± 0.612
3.293SerGln: 3.293 ± 0.614
3.372SerArg: 3.372 ± 0.584
4.626SerSer: 4.626 ± 1.205
5.175SerThr: 5.175 ± 0.675
3.921SerVal: 3.921 ± 0.548
1.568SerTrp: 1.568 ± 0.369
2.509SerTyr: 2.509 ± 0.567
0.0SerXaa: 0.0 ± 0.0
Thr
6.195ThrAla: 6.195 ± 0.862
0.784ThrCys: 0.784 ± 0.231
4.391ThrAsp: 4.391 ± 0.513
3.293ThrGlu: 3.293 ± 0.494
2.352ThrPhe: 2.352 ± 0.417
6.665ThrGly: 6.665 ± 0.774
0.706ThrHis: 0.706 ± 0.226
3.685ThrIle: 3.685 ± 0.88
3.529ThrLys: 3.529 ± 0.598
5.332ThrLeu: 5.332 ± 0.85
1.333ThrMet: 1.333 ± 0.311
2.823ThrAsn: 2.823 ± 0.627
4.94ThrPro: 4.94 ± 0.643
2.274ThrGln: 2.274 ± 0.402
1.96ThrArg: 1.96 ± 0.434
5.018ThrSer: 5.018 ± 1.254
5.254ThrThr: 5.254 ± 1.08
4.94ThrVal: 4.94 ± 0.885
0.627ThrTrp: 0.627 ± 0.265
2.509ThrTyr: 2.509 ± 0.451
0.0ThrXaa: 0.0 ± 0.0
Val
6.508ValAla: 6.508 ± 0.619
0.863ValCys: 0.863 ± 0.308
6.038ValAsp: 6.038 ± 0.922
4.391ValGlu: 4.391 ± 0.621
2.196ValPhe: 2.196 ± 0.447
4.313ValGly: 4.313 ± 0.467
0.47ValHis: 0.47 ± 0.219
2.666ValIle: 2.666 ± 0.51
2.98ValLys: 2.98 ± 0.461
4.234ValLeu: 4.234 ± 0.49
1.411ValMet: 1.411 ± 0.323
4.313ValAsn: 4.313 ± 0.708
2.901ValPro: 2.901 ± 0.446
3.372ValGln: 3.372 ± 0.424
3.058ValArg: 3.058 ± 0.565
3.685ValSer: 3.685 ± 0.558
5.332ValThr: 5.332 ± 0.68
3.685ValVal: 3.685 ± 0.625
1.019ValTrp: 1.019 ± 0.332
1.568ValTyr: 1.568 ± 0.318
0.0ValXaa: 0.0 ± 0.0
Trp
1.333TrpAla: 1.333 ± 0.403
0.392TrpCys: 0.392 ± 0.181
0.706TrpAsp: 0.706 ± 0.178
1.019TrpGlu: 1.019 ± 0.246
0.47TrpPhe: 0.47 ± 0.171
1.019TrpGly: 1.019 ± 0.309
0.549TrpHis: 0.549 ± 0.273
0.549TrpIle: 0.549 ± 0.201
0.863TrpLys: 0.863 ± 0.36
1.255TrpLeu: 1.255 ± 0.291
0.549TrpMet: 0.549 ± 0.209
0.863TrpAsn: 0.863 ± 0.277
0.549TrpPro: 0.549 ± 0.253
1.098TrpGln: 1.098 ± 0.379
0.47TrpArg: 0.47 ± 0.222
1.255TrpSer: 1.255 ± 0.646
1.255TrpThr: 1.255 ± 0.297
1.255TrpVal: 1.255 ± 0.389
0.235TrpTrp: 0.235 ± 0.178
0.078TrpTyr: 0.078 ± 0.067
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.647TyrAla: 1.647 ± 0.453
0.235TyrCys: 0.235 ± 0.15
1.96TyrAsp: 1.96 ± 0.385
1.647TyrGlu: 1.647 ± 0.338
0.863TyrPhe: 0.863 ± 0.324
2.431TyrGly: 2.431 ± 0.638
0.078TyrHis: 0.078 ± 0.089
1.96TyrIle: 1.96 ± 0.415
2.352TyrLys: 2.352 ± 0.397
2.196TyrLeu: 2.196 ± 0.525
0.627TyrMet: 0.627 ± 0.243
2.196TyrAsn: 2.196 ± 0.542
0.941TyrPro: 0.941 ± 0.297
1.725TyrGln: 1.725 ± 0.353
2.039TyrArg: 2.039 ± 0.374
2.352TyrSer: 2.352 ± 0.365
1.96TyrThr: 1.96 ± 0.47
2.352TyrVal: 2.352 ± 0.573
0.549TyrTrp: 0.549 ± 0.309
0.941TyrTyr: 0.941 ± 0.343
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 32 proteins (12754 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski