Amino acid dipepetide frequency for uncultured phage MedDCM-OCT-S04-C26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.511AlaAla: 9.511 ± 1.108
0.827AlaCys: 0.827 ± 0.285
4.032AlaAsp: 4.032 ± 0.648
4.652AlaGlu: 4.652 ± 0.86
5.479AlaPhe: 5.479 ± 0.801
6.616AlaGly: 6.616 ± 0.954
1.241AlaHis: 1.241 ± 0.361
4.239AlaIle: 4.239 ± 0.816
4.859AlaLys: 4.859 ± 0.627
5.996AlaLeu: 5.996 ± 0.698
1.551AlaMet: 1.551 ± 0.46
4.756AlaAsn: 4.756 ± 0.537
2.585AlaPro: 2.585 ± 0.478
2.481AlaGln: 2.481 ± 0.623
3.722AlaArg: 3.722 ± 0.606
5.996AlaSer: 5.996 ± 0.884
8.581AlaThr: 8.581 ± 1.996
6.099AlaVal: 6.099 ± 0.895
0.724AlaTrp: 0.724 ± 0.247
2.068AlaTyr: 2.068 ± 0.433
0.0AlaXaa: 0.0 ± 0.0
Cys
0.517CysAla: 0.517 ± 0.226
0.0CysCys: 0.0 ± 0.0
0.31CysAsp: 0.31 ± 0.157
0.517CysGlu: 0.517 ± 0.326
0.827CysPhe: 0.827 ± 0.281
0.62CysGly: 0.62 ± 0.275
0.103CysHis: 0.103 ± 0.097
0.207CysIle: 0.207 ± 0.129
0.62CysLys: 0.62 ± 0.243
0.414CysLeu: 0.414 ± 0.221
0.0CysMet: 0.0 ± 0.0
0.827CysAsn: 0.827 ± 0.249
0.414CysPro: 0.414 ± 0.227
0.517CysGln: 0.517 ± 0.219
0.827CysArg: 0.827 ± 0.31
0.62CysSer: 0.62 ± 0.235
0.827CysThr: 0.827 ± 0.375
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.517CysTyr: 0.517 ± 0.231
0.0CysXaa: 0.0 ± 0.0
Asp
4.445AspAla: 4.445 ± 0.652
0.31AspCys: 0.31 ± 0.183
2.998AspAsp: 2.998 ± 0.546
2.791AspGlu: 2.791 ± 0.53
3.308AspPhe: 3.308 ± 0.602
4.239AspGly: 4.239 ± 0.777
1.034AspHis: 1.034 ± 0.326
4.549AspIle: 4.549 ± 0.768
3.722AspLys: 3.722 ± 0.614
4.239AspLeu: 4.239 ± 0.674
1.241AspMet: 1.241 ± 0.321
2.171AspAsn: 2.171 ± 0.401
1.861AspPro: 1.861 ± 0.343
2.378AspGln: 2.378 ± 0.384
1.861AspArg: 1.861 ± 0.545
5.169AspSer: 5.169 ± 0.759
4.445AspThr: 4.445 ± 0.674
2.791AspVal: 2.791 ± 0.554
0.517AspTrp: 0.517 ± 0.261
1.344AspTyr: 1.344 ± 0.323
0.0AspXaa: 0.0 ± 0.0
Glu
4.652GluAla: 4.652 ± 0.868
0.517GluCys: 0.517 ± 0.238
1.551GluAsp: 1.551 ± 0.323
2.688GluGlu: 2.688 ± 0.644
2.274GluPhe: 2.274 ± 0.496
2.585GluGly: 2.585 ± 0.539
0.517GluHis: 0.517 ± 0.267
3.825GluIle: 3.825 ± 0.663
4.239GluLys: 4.239 ± 0.839
7.34GluLeu: 7.34 ± 1.118
1.654GluMet: 1.654 ± 0.447
2.068GluAsn: 2.068 ± 0.361
1.861GluPro: 1.861 ± 0.443
3.101GluGln: 3.101 ± 0.717
2.688GluArg: 2.688 ± 0.648
4.032GluSer: 4.032 ± 0.642
3.205GluThr: 3.205 ± 0.601
3.412GluVal: 3.412 ± 0.853
0.724GluTrp: 0.724 ± 0.239
1.861GluTyr: 1.861 ± 0.37
0.0GluXaa: 0.0 ± 0.0
Phe
2.585PheAla: 2.585 ± 0.53
0.103PheCys: 0.103 ± 0.118
3.412PheAsp: 3.412 ± 0.537
2.895PheGlu: 2.895 ± 0.548
0.93PhePhe: 0.93 ± 0.32
2.791PheGly: 2.791 ± 0.44
0.517PheHis: 0.517 ± 0.226
2.585PheIle: 2.585 ± 0.494
2.791PheLys: 2.791 ± 0.597
2.068PheLeu: 2.068 ± 0.475
1.137PheMet: 1.137 ± 0.369
2.895PheAsn: 2.895 ± 0.639
2.171PhePro: 2.171 ± 0.417
0.827PheGln: 0.827 ± 0.286
2.274PheArg: 2.274 ± 0.468
2.688PheSer: 2.688 ± 0.428
3.825PheThr: 3.825 ± 0.538
2.998PheVal: 2.998 ± 0.532
0.62PheTrp: 0.62 ± 0.222
1.034PheTyr: 1.034 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
5.376GlyAla: 5.376 ± 0.807
0.62GlyCys: 0.62 ± 0.291
4.032GlyAsp: 4.032 ± 0.749
3.515GlyGlu: 3.515 ± 0.6
1.861GlyPhe: 1.861 ± 0.475
6.513GlyGly: 6.513 ± 1.182
0.827GlyHis: 0.827 ± 0.27
4.756GlyIle: 4.756 ± 0.712
3.722GlyLys: 3.722 ± 0.632
4.859GlyLeu: 4.859 ± 0.855
1.447GlyMet: 1.447 ± 0.431
4.342GlyAsn: 4.342 ± 0.866
1.447GlyPro: 1.447 ± 0.438
2.585GlyGln: 2.585 ± 0.542
2.378GlyArg: 2.378 ± 0.438
7.857GlySer: 7.857 ± 1.207
7.133GlyThr: 7.133 ± 1.221
3.928GlyVal: 3.928 ± 0.696
0.31GlyTrp: 0.31 ± 0.16
1.757GlyTyr: 1.757 ± 0.473
0.0GlyXaa: 0.0 ± 0.0
His
1.241HisAla: 1.241 ± 0.424
0.103HisCys: 0.103 ± 0.104
0.724HisAsp: 0.724 ± 0.28
0.414HisGlu: 0.414 ± 0.193
0.724HisPhe: 0.724 ± 0.245
0.517HisGly: 0.517 ± 0.219
0.31HisHis: 0.31 ± 0.173
0.414HisIle: 0.414 ± 0.201
0.93HisLys: 0.93 ± 0.387
1.654HisLeu: 1.654 ± 0.392
0.31HisMet: 0.31 ± 0.171
1.137HisAsn: 1.137 ± 0.372
0.62HisPro: 0.62 ± 0.222
0.207HisGln: 0.207 ± 0.147
1.034HisArg: 1.034 ± 0.325
0.724HisSer: 0.724 ± 0.278
0.62HisThr: 0.62 ± 0.24
0.93HisVal: 0.93 ± 0.306
0.0HisTrp: 0.0 ± 0.0
1.344HisTyr: 1.344 ± 0.307
0.0HisXaa: 0.0 ± 0.0
Ile
4.239IleAla: 4.239 ± 0.742
0.724IleCys: 0.724 ± 0.279
4.445IleAsp: 4.445 ± 0.525
3.618IleGlu: 3.618 ± 0.541
2.274IlePhe: 2.274 ± 0.42
2.585IleGly: 2.585 ± 0.672
0.93IleHis: 0.93 ± 0.285
4.549IleIle: 4.549 ± 0.587
4.859IleLys: 4.859 ± 0.778
3.515IleLeu: 3.515 ± 0.736
1.447IleMet: 1.447 ± 0.335
3.205IleAsn: 3.205 ± 0.491
3.308IlePro: 3.308 ± 0.638
2.688IleGln: 2.688 ± 0.488
2.791IleArg: 2.791 ± 0.523
4.962IleSer: 4.962 ± 0.752
7.443IleThr: 7.443 ± 0.996
3.412IleVal: 3.412 ± 0.546
0.517IleTrp: 0.517 ± 0.284
1.964IleTyr: 1.964 ± 0.499
0.0IleXaa: 0.0 ± 0.0
Lys
5.789LysAla: 5.789 ± 1.015
0.724LysCys: 0.724 ± 0.291
3.722LysAsp: 3.722 ± 0.738
4.032LysGlu: 4.032 ± 0.659
2.378LysPhe: 2.378 ± 0.694
4.342LysGly: 4.342 ± 0.601
0.724LysHis: 0.724 ± 0.324
4.549LysIle: 4.549 ± 0.554
5.789LysLys: 5.789 ± 1.046
5.169LysLeu: 5.169 ± 0.748
1.447LysMet: 1.447 ± 0.361
3.825LysAsn: 3.825 ± 0.612
2.481LysPro: 2.481 ± 0.687
2.998LysGln: 2.998 ± 0.584
3.101LysArg: 3.101 ± 0.498
3.205LysSer: 3.205 ± 0.52
5.272LysThr: 5.272 ± 1.21
2.791LysVal: 2.791 ± 0.569
1.137LysTrp: 1.137 ± 0.426
2.688LysTyr: 2.688 ± 0.585
0.0LysXaa: 0.0 ± 0.0
Leu
7.133LeuAla: 7.133 ± 1.061
0.93LeuCys: 0.93 ± 0.401
4.756LeuAsp: 4.756 ± 0.765
5.169LeuGlu: 5.169 ± 0.779
2.791LeuPhe: 2.791 ± 0.511
3.618LeuGly: 3.618 ± 0.554
0.827LeuHis: 0.827 ± 0.3
4.342LeuIle: 4.342 ± 0.78
5.169LeuLys: 5.169 ± 0.772
5.169LeuLeu: 5.169 ± 0.754
1.034LeuMet: 1.034 ± 0.304
5.169LeuAsn: 5.169 ± 0.604
3.101LeuPro: 3.101 ± 0.64
3.412LeuGln: 3.412 ± 0.838
3.308LeuArg: 3.308 ± 0.604
5.169LeuSer: 5.169 ± 0.917
6.203LeuThr: 6.203 ± 0.869
4.342LeuVal: 4.342 ± 0.627
0.414LeuTrp: 0.414 ± 0.22
1.861LeuTyr: 1.861 ± 0.434
0.0LeuXaa: 0.0 ± 0.0
Met
3.515MetAla: 3.515 ± 0.622
0.207MetCys: 0.207 ± 0.143
1.034MetAsp: 1.034 ± 0.337
1.344MetGlu: 1.344 ± 0.403
0.31MetPhe: 0.31 ± 0.192
0.724MetGly: 0.724 ± 0.316
0.31MetHis: 0.31 ± 0.176
0.93MetIle: 0.93 ± 0.365
1.447MetLys: 1.447 ± 0.408
1.241MetLeu: 1.241 ± 0.34
0.62MetMet: 0.62 ± 0.234
1.551MetAsn: 1.551 ± 0.461
0.62MetPro: 0.62 ± 0.243
0.827MetGln: 0.827 ± 0.298
0.517MetArg: 0.517 ± 0.226
2.068MetSer: 2.068 ± 0.455
1.964MetThr: 1.964 ± 0.495
1.034MetVal: 1.034 ± 0.318
0.103MetTrp: 0.103 ± 0.121
0.31MetTyr: 0.31 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
4.962AsnAla: 4.962 ± 0.907
0.62AsnCys: 0.62 ± 0.261
2.274AsnAsp: 2.274 ± 0.411
2.791AsnGlu: 2.791 ± 0.425
2.068AsnPhe: 2.068 ± 0.441
4.652AsnGly: 4.652 ± 0.506
0.724AsnHis: 0.724 ± 0.255
4.032AsnIle: 4.032 ± 0.575
3.308AsnLys: 3.308 ± 0.573
4.859AsnLeu: 4.859 ± 0.682
1.034AsnMet: 1.034 ± 0.348
3.515AsnAsn: 3.515 ± 0.51
2.585AsnPro: 2.585 ± 0.684
3.205AsnGln: 3.205 ± 0.615
2.378AsnArg: 2.378 ± 0.483
3.515AsnSer: 3.515 ± 0.671
3.825AsnThr: 3.825 ± 0.721
3.515AsnVal: 3.515 ± 0.648
0.62AsnTrp: 0.62 ± 0.264
1.861AsnTyr: 1.861 ± 0.486
0.0AsnXaa: 0.0 ± 0.0
Pro
2.481ProAla: 2.481 ± 0.475
0.103ProCys: 0.103 ± 0.1
2.378ProAsp: 2.378 ± 0.471
3.101ProGlu: 3.101 ± 0.728
1.241ProPhe: 1.241 ± 0.343
0.0ProGly: 0.0 ± 0.0
0.93ProHis: 0.93 ± 0.298
2.791ProIle: 2.791 ± 0.613
2.274ProLys: 2.274 ± 0.485
2.481ProLeu: 2.481 ± 0.618
1.137ProMet: 1.137 ± 0.324
2.791ProAsn: 2.791 ± 0.54
1.861ProPro: 1.861 ± 0.429
1.551ProGln: 1.551 ± 0.451
1.654ProArg: 1.654 ± 0.338
4.859ProSer: 4.859 ± 0.562
3.308ProThr: 3.308 ± 0.742
3.101ProVal: 3.101 ± 0.558
0.0ProTrp: 0.0 ± 0.0
0.93ProTyr: 0.93 ± 0.275
0.0ProXaa: 0.0 ± 0.0
Gln
3.412GlnAla: 3.412 ± 0.578
0.207GlnCys: 0.207 ± 0.132
2.274GlnAsp: 2.274 ± 0.399
2.274GlnGlu: 2.274 ± 0.581
1.654GlnPhe: 1.654 ± 0.424
2.895GlnGly: 2.895 ± 0.642
0.827GlnHis: 0.827 ± 0.276
2.998GlnIle: 2.998 ± 0.488
3.825GlnLys: 3.825 ± 0.71
3.928GlnLeu: 3.928 ± 0.731
0.517GlnMet: 0.517 ± 0.21
1.861GlnAsn: 1.861 ± 0.464
1.344GlnPro: 1.344 ± 0.371
1.757GlnGln: 1.757 ± 0.397
1.447GlnArg: 1.447 ± 0.418
2.171GlnSer: 2.171 ± 0.499
3.412GlnThr: 3.412 ± 0.555
2.171GlnVal: 2.171 ± 0.454
0.414GlnTrp: 0.414 ± 0.2
1.447GlnTyr: 1.447 ± 0.474
0.0GlnXaa: 0.0 ± 0.0
Arg
3.205ArgAla: 3.205 ± 0.698
0.207ArgCys: 0.207 ± 0.173
1.757ArgAsp: 1.757 ± 0.412
2.068ArgGlu: 2.068 ± 0.631
2.378ArgPhe: 2.378 ± 0.491
1.551ArgGly: 1.551 ± 0.397
0.724ArgHis: 0.724 ± 0.262
2.791ArgIle: 2.791 ± 0.631
3.205ArgLys: 3.205 ± 0.72
3.308ArgLeu: 3.308 ± 0.504
0.517ArgMet: 0.517 ± 0.207
1.757ArgAsn: 1.757 ± 0.38
1.034ArgPro: 1.034 ± 0.291
1.447ArgGln: 1.447 ± 0.383
1.861ArgArg: 1.861 ± 0.429
3.205ArgSer: 3.205 ± 0.683
3.618ArgThr: 3.618 ± 0.611
4.342ArgVal: 4.342 ± 0.654
0.414ArgTrp: 0.414 ± 0.219
1.964ArgTyr: 1.964 ± 0.533
0.0ArgXaa: 0.0 ± 0.0
Ser
8.064SerAla: 8.064 ± 1.118
0.827SerCys: 0.827 ± 0.325
4.859SerAsp: 4.859 ± 0.903
3.205SerGlu: 3.205 ± 0.559
3.205SerPhe: 3.205 ± 0.508
9.925SerGly: 9.925 ± 1.663
1.034SerHis: 1.034 ± 0.434
5.169SerIle: 5.169 ± 0.899
3.412SerLys: 3.412 ± 0.625
5.376SerLeu: 5.376 ± 0.731
1.654SerMet: 1.654 ± 0.509
3.618SerAsn: 3.618 ± 0.685
3.618SerPro: 3.618 ± 0.505
2.274SerGln: 2.274 ± 0.59
2.481SerArg: 2.481 ± 0.621
7.857SerSer: 7.857 ± 1.461
5.996SerThr: 5.996 ± 0.829
4.652SerVal: 4.652 ± 0.706
0.827SerTrp: 0.827 ± 0.286
1.861SerTyr: 1.861 ± 0.413
0.0SerXaa: 0.0 ± 0.0
Thr
8.27ThrAla: 8.27 ± 1.306
0.62ThrCys: 0.62 ± 0.258
5.066ThrAsp: 5.066 ± 0.764
3.618ThrGlu: 3.618 ± 0.608
4.652ThrPhe: 4.652 ± 0.681
7.237ThrGly: 7.237 ± 1.341
0.724ThrHis: 0.724 ± 0.244
5.789ThrIle: 5.789 ± 0.762
4.652ThrLys: 4.652 ± 0.638
5.169ThrLeu: 5.169 ± 0.905
1.654ThrMet: 1.654 ± 0.383
4.549ThrAsn: 4.549 ± 0.716
4.342ThrPro: 4.342 ± 0.693
3.722ThrGln: 3.722 ± 0.562
2.378ThrArg: 2.378 ± 0.517
7.133ThrSer: 7.133 ± 1.005
7.857ThrThr: 7.857 ± 1.235
7.237ThrVal: 7.237 ± 1.022
0.517ThrTrp: 0.517 ± 0.23
2.171ThrTyr: 2.171 ± 0.464
0.0ThrXaa: 0.0 ± 0.0
Val
3.308ValAla: 3.308 ± 0.591
0.414ValCys: 0.414 ± 0.241
3.722ValAsp: 3.722 ± 0.592
3.825ValGlu: 3.825 ± 0.664
1.654ValPhe: 1.654 ± 0.385
4.652ValGly: 4.652 ± 0.652
1.034ValHis: 1.034 ± 0.318
2.998ValIle: 2.998 ± 0.582
4.962ValLys: 4.962 ± 0.854
4.445ValLeu: 4.445 ± 0.533
1.551ValMet: 1.551 ± 0.319
4.342ValAsn: 4.342 ± 0.609
2.068ValPro: 2.068 ± 0.481
2.585ValGln: 2.585 ± 0.424
2.171ValArg: 2.171 ± 0.415
5.272ValSer: 5.272 ± 0.656
6.72ValThr: 6.72 ± 1.091
4.445ValVal: 4.445 ± 0.741
0.414ValTrp: 0.414 ± 0.169
2.274ValTyr: 2.274 ± 0.435
0.0ValXaa: 0.0 ± 0.0
Trp
0.93TrpAla: 0.93 ± 0.261
0.103TrpCys: 0.103 ± 0.105
0.31TrpAsp: 0.31 ± 0.171
0.517TrpGlu: 0.517 ± 0.252
0.103TrpPhe: 0.103 ± 0.104
0.517TrpGly: 0.517 ± 0.31
0.0TrpHis: 0.0 ± 0.0
0.724TrpIle: 0.724 ± 0.297
0.62TrpLys: 0.62 ± 0.288
0.517TrpLeu: 0.517 ± 0.239
0.103TrpMet: 0.103 ± 0.105
0.414TrpAsn: 0.414 ± 0.238
0.207TrpPro: 0.207 ± 0.13
0.62TrpGln: 0.62 ± 0.219
0.62TrpArg: 0.62 ± 0.268
1.551TrpSer: 1.551 ± 0.395
0.62TrpThr: 0.62 ± 0.255
0.207TrpVal: 0.207 ± 0.157
0.103TrpTrp: 0.103 ± 0.102
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.378TyrAla: 2.378 ± 0.472
0.517TyrCys: 0.517 ± 0.249
1.757TyrAsp: 1.757 ± 0.462
1.757TyrGlu: 1.757 ± 0.415
1.034TyrPhe: 1.034 ± 0.297
2.688TyrGly: 2.688 ± 0.515
0.62TyrHis: 0.62 ± 0.237
1.241TyrIle: 1.241 ± 0.347
1.861TyrLys: 1.861 ± 0.462
2.171TyrLeu: 2.171 ± 0.478
0.517TyrMet: 0.517 ± 0.212
1.551TyrAsn: 1.551 ± 0.374
1.447TyrPro: 1.447 ± 0.389
1.654TyrGln: 1.654 ± 0.409
1.861TyrArg: 1.861 ± 0.505
1.861TyrSer: 1.861 ± 0.37
2.481TyrThr: 2.481 ± 0.492
1.447TyrVal: 1.447 ± 0.377
0.414TyrTrp: 0.414 ± 0.193
0.827TyrTyr: 0.827 ± 0.254
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (9674 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski