Amino acid dipepetide frequency for uncultured phage_MedDCM-OCT-S37-C6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.754AlaAla: 13.754 ± 2.48
0.728AlaCys: 0.728 ± 0.256
6.877AlaAsp: 6.877 ± 0.506
6.634AlaGlu: 6.634 ± 0.822
3.964AlaPhe: 3.964 ± 0.548
9.061AlaGly: 9.061 ± 0.9
1.942AlaHis: 1.942 ± 0.458
4.45AlaIle: 4.45 ± 0.589
4.854AlaLys: 4.854 ± 0.667
9.061AlaLeu: 9.061 ± 0.705
2.346AlaMet: 2.346 ± 0.432
5.583AlaAsn: 5.583 ± 1.512
4.126AlaPro: 4.126 ± 0.873
5.583AlaGln: 5.583 ± 0.856
6.068AlaArg: 6.068 ± 0.782
6.23AlaSer: 6.23 ± 1.134
8.091AlaThr: 8.091 ± 1.784
6.958AlaVal: 6.958 ± 0.643
2.023AlaTrp: 2.023 ± 0.42
2.427AlaTyr: 2.427 ± 0.567
0.0AlaXaa: 0.0 ± 0.0
Cys
0.809CysAla: 0.809 ± 0.289
0.243CysCys: 0.243 ± 0.159
0.728CysAsp: 0.728 ± 0.245
0.485CysGlu: 0.485 ± 0.231
0.324CysPhe: 0.324 ± 0.157
0.728CysGly: 0.728 ± 0.299
0.081CysHis: 0.081 ± 0.085
0.081CysIle: 0.081 ± 0.083
0.566CysLys: 0.566 ± 0.218
0.89CysLeu: 0.89 ± 0.298
0.485CysMet: 0.485 ± 0.216
0.081CysAsn: 0.081 ± 0.075
0.485CysPro: 0.485 ± 0.195
0.324CysGln: 0.324 ± 0.172
1.133CysArg: 1.133 ± 0.41
0.324CysSer: 0.324 ± 0.148
0.647CysThr: 0.647 ± 0.201
0.809CysVal: 0.809 ± 0.303
0.0CysTrp: 0.0 ± 0.0
0.243CysTyr: 0.243 ± 0.16
0.0CysXaa: 0.0 ± 0.0
Asp
6.553AspAla: 6.553 ± 0.677
0.485AspCys: 0.485 ± 0.201
3.883AspAsp: 3.883 ± 0.489
3.398AspGlu: 3.398 ± 0.766
2.427AspPhe: 2.427 ± 0.478
5.097AspGly: 5.097 ± 0.886
1.618AspHis: 1.618 ± 0.402
2.832AspIle: 2.832 ± 0.426
2.184AspLys: 2.184 ± 0.483
4.531AspLeu: 4.531 ± 0.511
1.456AspMet: 1.456 ± 0.34
1.375AspAsn: 1.375 ± 0.332
3.883AspPro: 3.883 ± 0.562
2.427AspGln: 2.427 ± 0.543
3.398AspArg: 3.398 ± 0.451
3.317AspSer: 3.317 ± 0.741
2.346AspThr: 2.346 ± 0.516
3.641AspVal: 3.641 ± 0.493
0.647AspTrp: 0.647 ± 0.188
2.023AspTyr: 2.023 ± 0.351
0.0AspXaa: 0.0 ± 0.0
Glu
5.906GluAla: 5.906 ± 0.802
0.405GluCys: 0.405 ± 0.174
2.184GluAsp: 2.184 ± 0.402
3.964GluGlu: 3.964 ± 0.901
1.294GluPhe: 1.294 ± 0.305
4.045GluGly: 4.045 ± 0.664
0.405GluHis: 0.405 ± 0.231
3.155GluIle: 3.155 ± 0.551
2.589GluLys: 2.589 ± 0.397
7.039GluLeu: 7.039 ± 1.126
1.537GluMet: 1.537 ± 0.456
1.861GluAsn: 1.861 ± 0.313
2.184GluPro: 2.184 ± 0.394
3.883GluGln: 3.883 ± 0.68
3.883GluArg: 3.883 ± 0.634
2.832GluSer: 2.832 ± 0.421
2.67GluThr: 2.67 ± 0.659
3.641GluVal: 3.641 ± 0.569
0.647GluTrp: 0.647 ± 0.236
2.265GluTyr: 2.265 ± 0.499
0.0GluXaa: 0.0 ± 0.0
Phe
3.074PheAla: 3.074 ± 0.59
0.405PheCys: 0.405 ± 0.228
2.994PheAsp: 2.994 ± 0.602
2.184PheGlu: 2.184 ± 0.457
0.647PhePhe: 0.647 ± 0.202
2.104PheGly: 2.104 ± 0.374
0.728PheHis: 0.728 ± 0.218
1.133PheIle: 1.133 ± 0.221
1.537PheLys: 1.537 ± 0.34
1.861PheLeu: 1.861 ± 0.379
0.728PheMet: 0.728 ± 0.241
2.104PheAsn: 2.104 ± 0.454
1.133PhePro: 1.133 ± 0.235
0.89PheGln: 0.89 ± 0.231
1.456PheArg: 1.456 ± 0.373
2.751PheSer: 2.751 ± 0.471
2.832PheThr: 2.832 ± 0.516
1.861PheVal: 1.861 ± 0.435
0.162PheTrp: 0.162 ± 0.141
1.78PheTyr: 1.78 ± 0.467
0.0PheXaa: 0.0 ± 0.0
Gly
5.744GlyAla: 5.744 ± 0.81
0.809GlyCys: 0.809 ± 0.303
4.288GlyAsp: 4.288 ± 0.553
2.265GlyGlu: 2.265 ± 0.429
3.074GlyPhe: 3.074 ± 0.749
6.472GlyGly: 6.472 ± 0.895
0.647GlyHis: 0.647 ± 0.207
3.074GlyIle: 3.074 ± 0.585
4.369GlyLys: 4.369 ± 0.592
6.553GlyLeu: 6.553 ± 0.754
2.104GlyMet: 2.104 ± 0.452
5.178GlyAsn: 5.178 ± 0.925
2.427GlyPro: 2.427 ± 0.558
4.531GlyGln: 4.531 ± 0.481
3.722GlyArg: 3.722 ± 0.498
6.553GlySer: 6.553 ± 1.057
5.259GlyThr: 5.259 ± 1.38
5.583GlyVal: 5.583 ± 0.419
1.214GlyTrp: 1.214 ± 0.336
2.346GlyTyr: 2.346 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
1.699HisAla: 1.699 ± 0.4
0.405HisCys: 0.405 ± 0.236
0.809HisAsp: 0.809 ± 0.261
0.971HisGlu: 0.971 ± 0.209
0.485HisPhe: 0.485 ± 0.166
1.052HisGly: 1.052 ± 0.271
0.243HisHis: 0.243 ± 0.125
0.971HisIle: 0.971 ± 0.263
0.809HisLys: 0.809 ± 0.247
2.023HisLeu: 2.023 ± 0.457
0.162HisMet: 0.162 ± 0.151
0.647HisAsn: 0.647 ± 0.223
0.566HisPro: 0.566 ± 0.184
0.566HisGln: 0.566 ± 0.242
0.971HisArg: 0.971 ± 0.32
1.052HisSer: 1.052 ± 0.274
1.214HisThr: 1.214 ± 0.255
1.052HisVal: 1.052 ± 0.239
0.243HisTrp: 0.243 ± 0.137
0.405HisTyr: 0.405 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
4.369IleAla: 4.369 ± 0.544
0.243IleCys: 0.243 ± 0.139
4.045IleAsp: 4.045 ± 0.668
3.074IleGlu: 3.074 ± 0.576
1.456IlePhe: 1.456 ± 0.367
3.317IleGly: 3.317 ± 0.366
0.485IleHis: 0.485 ± 0.221
0.89IleIle: 0.89 ± 0.346
2.104IleLys: 2.104 ± 0.415
2.994IleLeu: 2.994 ± 0.516
0.485IleMet: 0.485 ± 0.219
1.861IleAsn: 1.861 ± 0.432
1.861IlePro: 1.861 ± 0.311
2.508IleGln: 2.508 ± 0.713
2.508IleArg: 2.508 ± 0.36
2.427IleSer: 2.427 ± 0.448
3.074IleThr: 3.074 ± 0.575
1.942IleVal: 1.942 ± 0.411
0.405IleTrp: 0.405 ± 0.194
1.214IleTyr: 1.214 ± 0.317
0.0IleXaa: 0.0 ± 0.0
Lys
5.583LysAla: 5.583 ± 1.042
0.243LysCys: 0.243 ± 0.126
1.78LysAsp: 1.78 ± 0.414
2.832LysGlu: 2.832 ± 0.512
1.375LysPhe: 1.375 ± 0.356
3.155LysGly: 3.155 ± 0.569
0.809LysHis: 0.809 ± 0.205
2.427LysIle: 2.427 ± 0.356
2.832LysLys: 2.832 ± 0.659
4.045LysLeu: 4.045 ± 0.823
0.89LysMet: 0.89 ± 0.242
2.265LysAsn: 2.265 ± 0.325
2.832LysPro: 2.832 ± 0.5
2.751LysGln: 2.751 ± 0.512
2.589LysArg: 2.589 ± 0.496
3.398LysSer: 3.398 ± 0.491
3.074LysThr: 3.074 ± 0.656
2.184LysVal: 2.184 ± 0.5
0.728LysTrp: 0.728 ± 0.229
0.89LysTyr: 0.89 ± 0.219
0.0LysXaa: 0.0 ± 0.0
Leu
10.032LeuAla: 10.032 ± 1.233
1.052LeuCys: 1.052 ± 0.332
5.583LeuAsp: 5.583 ± 0.689
4.935LeuGlu: 4.935 ± 0.699
1.861LeuPhe: 1.861 ± 0.302
6.149LeuGly: 6.149 ± 0.646
1.133LeuHis: 1.133 ± 0.355
2.589LeuIle: 2.589 ± 0.46
3.479LeuLys: 3.479 ± 0.712
6.877LeuLeu: 6.877 ± 1.052
2.184LeuMet: 2.184 ± 0.412
4.369LeuAsn: 4.369 ± 0.611
3.964LeuPro: 3.964 ± 0.87
4.935LeuGln: 4.935 ± 0.839
5.987LeuArg: 5.987 ± 0.715
5.663LeuSer: 5.663 ± 0.83
6.068LeuThr: 6.068 ± 0.723
4.612LeuVal: 4.612 ± 0.54
0.89LeuTrp: 0.89 ± 0.287
2.023LeuTyr: 2.023 ± 0.365
0.0LeuXaa: 0.0 ± 0.0
Met
2.913MetAla: 2.913 ± 0.526
0.405MetCys: 0.405 ± 0.212
1.537MetAsp: 1.537 ± 0.32
1.537MetGlu: 1.537 ± 0.349
0.324MetPhe: 0.324 ± 0.177
2.265MetGly: 2.265 ± 0.575
0.324MetHis: 0.324 ± 0.141
0.89MetIle: 0.89 ± 0.253
1.699MetLys: 1.699 ± 0.494
1.618MetLeu: 1.618 ± 0.289
0.324MetMet: 0.324 ± 0.163
0.89MetAsn: 0.89 ± 0.236
1.78MetPro: 1.78 ± 0.308
1.861MetGln: 1.861 ± 0.502
1.456MetArg: 1.456 ± 0.408
1.375MetSer: 1.375 ± 0.375
2.023MetThr: 2.023 ± 0.422
1.052MetVal: 1.052 ± 0.371
0.566MetTrp: 0.566 ± 0.278
0.324MetTyr: 0.324 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
5.583AsnAla: 5.583 ± 1.27
0.485AsnCys: 0.485 ± 0.222
1.618AsnAsp: 1.618 ± 0.413
2.508AsnGlu: 2.508 ± 0.35
1.294AsnPhe: 1.294 ± 0.299
4.045AsnGly: 4.045 ± 1.031
1.052AsnHis: 1.052 ± 0.35
1.78AsnIle: 1.78 ± 0.405
2.427AsnLys: 2.427 ± 0.407
3.398AsnLeu: 3.398 ± 0.612
1.052AsnMet: 1.052 ± 0.278
2.184AsnAsn: 2.184 ± 0.87
2.427AsnPro: 2.427 ± 0.505
2.265AsnGln: 2.265 ± 0.407
2.751AsnArg: 2.751 ± 0.429
3.317AsnSer: 3.317 ± 0.828
3.155AsnThr: 3.155 ± 0.945
2.994AsnVal: 2.994 ± 0.47
0.728AsnTrp: 0.728 ± 0.222
1.456AsnTyr: 1.456 ± 0.369
0.0AsnXaa: 0.0 ± 0.0
Pro
4.612ProAla: 4.612 ± 0.549
0.324ProCys: 0.324 ± 0.167
2.751ProAsp: 2.751 ± 0.426
3.964ProGlu: 3.964 ± 0.746
1.618ProPhe: 1.618 ± 0.382
3.317ProGly: 3.317 ± 0.526
0.485ProHis: 0.485 ± 0.201
1.618ProIle: 1.618 ± 0.367
2.023ProLys: 2.023 ± 0.483
3.236ProLeu: 3.236 ± 0.477
1.052ProMet: 1.052 ± 0.345
1.375ProAsn: 1.375 ± 0.298
1.78ProPro: 1.78 ± 0.474
1.942ProGln: 1.942 ± 0.51
1.78ProArg: 1.78 ± 0.385
3.236ProSer: 3.236 ± 0.503
3.641ProThr: 3.641 ± 0.943
4.045ProVal: 4.045 ± 0.621
0.566ProTrp: 0.566 ± 0.244
1.942ProTyr: 1.942 ± 0.481
0.0ProXaa: 0.0 ± 0.0
Gln
6.392GlnAla: 6.392 ± 1.048
0.485GlnCys: 0.485 ± 0.166
1.861GlnAsp: 1.861 ± 0.415
2.751GlnGlu: 2.751 ± 0.569
1.699GlnPhe: 1.699 ± 0.472
2.023GlnGly: 2.023 ± 0.433
0.89GlnHis: 0.89 ± 0.312
2.508GlnIle: 2.508 ± 0.491
2.265GlnLys: 2.265 ± 0.468
5.016GlnLeu: 5.016 ± 0.743
2.104GlnMet: 2.104 ± 0.45
2.104GlnAsn: 2.104 ± 0.397
1.861GlnPro: 1.861 ± 0.341
3.964GlnGln: 3.964 ± 0.834
3.479GlnArg: 3.479 ± 0.638
2.913GlnSer: 2.913 ± 0.519
3.317GlnThr: 3.317 ± 0.686
3.641GlnVal: 3.641 ± 0.57
0.485GlnTrp: 0.485 ± 0.223
1.537GlnTyr: 1.537 ± 0.542
0.0GlnXaa: 0.0 ± 0.0
Arg
5.906ArgAla: 5.906 ± 0.729
0.566ArgCys: 0.566 ± 0.287
3.155ArgAsp: 3.155 ± 0.481
2.994ArgGlu: 2.994 ± 0.546
1.942ArgPhe: 1.942 ± 0.36
3.883ArgGly: 3.883 ± 0.467
0.809ArgHis: 0.809 ± 0.291
2.265ArgIle: 2.265 ± 0.525
3.722ArgLys: 3.722 ± 0.846
6.068ArgLeu: 6.068 ± 1.056
2.508ArgMet: 2.508 ± 0.434
2.508ArgAsn: 2.508 ± 0.513
2.427ArgPro: 2.427 ± 0.385
3.398ArgGln: 3.398 ± 0.579
3.883ArgArg: 3.883 ± 0.795
2.265ArgSer: 2.265 ± 0.295
2.67ArgThr: 2.67 ± 0.461
3.803ArgVal: 3.803 ± 0.666
0.89ArgTrp: 0.89 ± 0.297
1.133ArgTyr: 1.133 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
6.715SerAla: 6.715 ± 1.946
0.566SerCys: 0.566 ± 0.249
3.803SerAsp: 3.803 ± 0.518
2.346SerGlu: 2.346 ± 0.511
3.155SerPhe: 3.155 ± 0.677
5.583SerGly: 5.583 ± 1.173
1.052SerHis: 1.052 ± 0.35
2.832SerIle: 2.832 ± 0.487
2.023SerLys: 2.023 ± 0.342
5.663SerLeu: 5.663 ± 0.966
1.456SerMet: 1.456 ± 0.444
3.479SerAsn: 3.479 ± 0.595
2.913SerPro: 2.913 ± 0.53
2.832SerGln: 2.832 ± 0.54
3.236SerArg: 3.236 ± 0.396
6.634SerSer: 6.634 ± 1.913
5.259SerThr: 5.259 ± 1.391
3.56SerVal: 3.56 ± 0.737
0.89SerTrp: 0.89 ± 0.28
2.265SerTyr: 2.265 ± 0.44
0.0SerXaa: 0.0 ± 0.0
Thr
9.304ThrAla: 9.304 ± 2.163
0.324ThrCys: 0.324 ± 0.147
3.317ThrAsp: 3.317 ± 0.507
3.155ThrGlu: 3.155 ± 0.638
1.618ThrPhe: 1.618 ± 0.318
5.421ThrGly: 5.421 ± 1.006
1.456ThrHis: 1.456 ± 0.343
3.479ThrIle: 3.479 ± 0.547
2.913ThrLys: 2.913 ± 0.464
5.421ThrLeu: 5.421 ± 0.578
1.052ThrMet: 1.052 ± 0.315
3.398ThrAsn: 3.398 ± 0.908
3.964ThrPro: 3.964 ± 0.66
2.427ThrGln: 2.427 ± 0.496
2.508ThrArg: 2.508 ± 0.489
5.016ThrSer: 5.016 ± 1.584
7.039ThrThr: 7.039 ± 1.493
4.126ThrVal: 4.126 ± 0.932
1.214ThrTrp: 1.214 ± 0.425
2.104ThrTyr: 2.104 ± 0.438
0.0ThrXaa: 0.0 ± 0.0
Val
7.201ValAla: 7.201 ± 0.91
0.809ValCys: 0.809 ± 0.346
3.883ValAsp: 3.883 ± 0.507
3.803ValGlu: 3.803 ± 0.574
2.751ValPhe: 2.751 ± 0.403
5.259ValGly: 5.259 ± 0.814
1.133ValHis: 1.133 ± 0.299
2.589ValIle: 2.589 ± 0.379
2.589ValLys: 2.589 ± 0.407
4.369ValLeu: 4.369 ± 0.695
2.023ValMet: 2.023 ± 0.591
3.074ValAsn: 3.074 ± 0.925
3.074ValPro: 3.074 ± 0.656
2.265ValGln: 2.265 ± 0.508
3.398ValArg: 3.398 ± 0.464
3.964ValSer: 3.964 ± 0.675
3.883ValThr: 3.883 ± 0.545
5.663ValVal: 5.663 ± 0.795
1.133ValTrp: 1.133 ± 0.342
1.618ValTyr: 1.618 ± 0.446
0.0ValXaa: 0.0 ± 0.0
Trp
1.294TrpAla: 1.294 ± 0.391
0.081TrpCys: 0.081 ± 0.082
0.809TrpAsp: 0.809 ± 0.23
0.647TrpGlu: 0.647 ± 0.198
0.324TrpPhe: 0.324 ± 0.17
0.728TrpGly: 0.728 ± 0.185
0.485TrpHis: 0.485 ± 0.15
0.809TrpIle: 0.809 ± 0.273
0.162TrpLys: 0.162 ± 0.136
1.294TrpLeu: 1.294 ± 0.328
0.405TrpMet: 0.405 ± 0.198
0.728TrpAsn: 0.728 ± 0.245
0.809TrpPro: 0.809 ± 0.246
0.809TrpGln: 0.809 ± 0.246
0.809TrpArg: 0.809 ± 0.295
1.133TrpSer: 1.133 ± 0.33
1.375TrpThr: 1.375 ± 0.393
1.294TrpVal: 1.294 ± 0.332
0.162TrpTrp: 0.162 ± 0.104
0.162TrpTyr: 0.162 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.317TyrAla: 3.317 ± 0.553
0.405TyrCys: 0.405 ± 0.179
1.618TyrAsp: 1.618 ± 0.268
1.942TyrGlu: 1.942 ± 0.443
0.728TyrPhe: 0.728 ± 0.271
2.589TyrGly: 2.589 ± 0.547
0.566TyrHis: 0.566 ± 0.17
0.89TyrIle: 0.89 ± 0.283
1.537TyrLys: 1.537 ± 0.257
2.427TyrLeu: 2.427 ± 0.464
0.728TyrMet: 0.728 ± 0.212
1.456TyrAsn: 1.456 ± 0.304
0.809TyrPro: 0.809 ± 0.229
1.133TyrGln: 1.133 ± 0.388
1.78TyrArg: 1.78 ± 0.282
1.861TyrSer: 1.861 ± 0.365
1.618TyrThr: 1.618 ± 0.379
2.023TyrVal: 2.023 ± 0.408
0.728TyrTrp: 0.728 ± 0.317
1.052TyrTyr: 1.052 ± 0.314
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (12361 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski