Amino acid dipepetide frequency for Gordonia phage Gudmit

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.296AlaAla: 17.296 ± 1.481
0.668AlaCys: 0.668 ± 0.244
9.279AlaAsp: 9.279 ± 0.778
8.166AlaGlu: 8.166 ± 0.77
3.637AlaPhe: 3.637 ± 0.497
8.611AlaGly: 8.611 ± 0.768
2.524AlaHis: 2.524 ± 0.464
5.642AlaIle: 5.642 ± 0.637
4.974AlaLys: 4.974 ± 0.749
8.017AlaLeu: 8.017 ± 0.567
2.969AlaMet: 2.969 ± 0.424
3.341AlaAsn: 3.341 ± 0.456
5.568AlaPro: 5.568 ± 0.637
5.271AlaGln: 5.271 ± 0.632
8.166AlaArg: 8.166 ± 1.032
5.345AlaSer: 5.345 ± 0.644
6.829AlaThr: 6.829 ± 0.669
8.388AlaVal: 8.388 ± 0.844
2.079AlaTrp: 2.079 ± 0.263
2.45AlaTyr: 2.45 ± 0.309
0.0AlaXaa: 0.0 ± 0.0
Cys
0.817CysAla: 0.817 ± 0.272
0.074CysCys: 0.074 ± 0.072
0.742CysAsp: 0.742 ± 0.285
0.668CysGlu: 0.668 ± 0.243
0.074CysPhe: 0.074 ± 0.073
0.817CysGly: 0.817 ± 0.258
0.371CysHis: 0.371 ± 0.159
0.297CysIle: 0.297 ± 0.136
0.594CysLys: 0.594 ± 0.21
0.52CysLeu: 0.52 ± 0.248
0.074CysMet: 0.074 ± 0.09
0.297CysAsn: 0.297 ± 0.176
0.445CysPro: 0.445 ± 0.172
0.223CysGln: 0.223 ± 0.122
1.114CysArg: 1.114 ± 0.324
0.668CysSer: 0.668 ± 0.258
0.742CysThr: 0.742 ± 0.29
0.742CysVal: 0.742 ± 0.278
0.52CysTrp: 0.52 ± 0.241
0.297CysTyr: 0.297 ± 0.15
0.0CysXaa: 0.0 ± 0.0
Asp
6.829AspAla: 6.829 ± 0.601
0.52AspCys: 0.52 ± 0.191
5.642AspAsp: 5.642 ± 0.813
5.196AspGlu: 5.196 ± 0.621
1.633AspPhe: 1.633 ± 0.291
6.31AspGly: 6.31 ± 0.538
1.262AspHis: 1.262 ± 0.234
3.192AspIle: 3.192 ± 0.507
1.633AspLys: 1.633 ± 0.322
6.458AspLeu: 6.458 ± 0.682
1.559AspMet: 1.559 ± 0.353
2.227AspAsn: 2.227 ± 0.404
4.677AspPro: 4.677 ± 0.701
1.93AspGln: 1.93 ± 0.381
4.974AspArg: 4.974 ± 0.56
2.821AspSer: 2.821 ± 0.508
4.231AspThr: 4.231 ± 0.501
4.38AspVal: 4.38 ± 0.626
1.41AspTrp: 1.41 ± 0.322
2.004AspTyr: 2.004 ± 0.31
0.0AspXaa: 0.0 ± 0.0
Glu
7.275GluAla: 7.275 ± 0.629
0.52GluCys: 0.52 ± 0.197
3.118GluAsp: 3.118 ± 0.506
2.672GluGlu: 2.672 ± 0.408
1.782GluPhe: 1.782 ± 0.318
4.157GluGly: 4.157 ± 0.537
1.262GluHis: 1.262 ± 0.336
3.489GluIle: 3.489 ± 0.596
2.598GluLys: 2.598 ± 0.397
4.974GluLeu: 4.974 ± 0.649
1.559GluMet: 1.559 ± 0.342
1.707GluAsn: 1.707 ± 0.31
3.415GluPro: 3.415 ± 0.529
2.895GluGln: 2.895 ± 0.439
4.38GluArg: 4.38 ± 0.506
3.415GluSer: 3.415 ± 0.582
2.895GluThr: 2.895 ± 0.476
5.642GluVal: 5.642 ± 0.644
0.965GluTrp: 0.965 ± 0.322
1.559GluTyr: 1.559 ± 0.399
0.0GluXaa: 0.0 ± 0.0
Phe
3.637PheAla: 3.637 ± 0.566
0.297PheCys: 0.297 ± 0.136
2.598PheAsp: 2.598 ± 0.534
1.782PheGlu: 1.782 ± 0.412
1.039PhePhe: 1.039 ± 0.256
3.341PheGly: 3.341 ± 0.498
0.52PheHis: 0.52 ± 0.203
1.485PheIle: 1.485 ± 0.35
0.594PheLys: 0.594 ± 0.192
1.039PheLeu: 1.039 ± 0.24
0.371PheMet: 0.371 ± 0.145
0.594PheAsn: 0.594 ± 0.17
0.594PhePro: 0.594 ± 0.19
0.668PheGln: 0.668 ± 0.211
1.188PheArg: 1.188 ± 0.348
1.114PheSer: 1.114 ± 0.28
1.856PheThr: 1.856 ± 0.412
1.93PheVal: 1.93 ± 0.372
0.891PheTrp: 0.891 ± 0.324
0.817PheTyr: 0.817 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
9.873GlyAla: 9.873 ± 0.96
0.594GlyCys: 0.594 ± 0.244
5.79GlyAsp: 5.79 ± 0.84
4.974GlyGlu: 4.974 ± 0.577
1.707GlyPhe: 1.707 ± 0.373
8.166GlyGly: 8.166 ± 1.1
1.856GlyHis: 1.856 ± 0.389
5.345GlyIle: 5.345 ± 0.589
2.969GlyLys: 2.969 ± 0.513
6.755GlyLeu: 6.755 ± 0.789
2.004GlyMet: 2.004 ± 0.393
2.153GlyAsn: 2.153 ± 0.405
4.306GlyPro: 4.306 ± 0.652
2.375GlyGln: 2.375 ± 0.434
6.829GlyArg: 6.829 ± 0.74
4.38GlySer: 4.38 ± 0.555
4.974GlyThr: 4.974 ± 0.595
6.607GlyVal: 6.607 ± 0.596
1.262GlyTrp: 1.262 ± 0.261
2.672GlyTyr: 2.672 ± 0.554
0.0GlyXaa: 0.0 ± 0.0
His
2.153HisAla: 2.153 ± 0.341
0.668HisCys: 0.668 ± 0.198
1.633HisAsp: 1.633 ± 0.409
1.188HisGlu: 1.188 ± 0.337
0.52HisPhe: 0.52 ± 0.195
2.227HisGly: 2.227 ± 0.4
0.445HisHis: 0.445 ± 0.177
0.891HisIle: 0.891 ± 0.297
0.445HisLys: 0.445 ± 0.18
1.856HisLeu: 1.856 ± 0.429
0.297HisMet: 0.297 ± 0.165
0.594HisAsn: 0.594 ± 0.225
1.559HisPro: 1.559 ± 0.356
1.039HisGln: 1.039 ± 0.276
1.93HisArg: 1.93 ± 0.505
0.668HisSer: 0.668 ± 0.33
1.707HisThr: 1.707 ± 0.354
1.485HisVal: 1.485 ± 0.339
0.223HisTrp: 0.223 ± 0.118
0.297HisTyr: 0.297 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
6.681IleAla: 6.681 ± 0.73
0.297IleCys: 0.297 ± 0.14
4.231IleAsp: 4.231 ± 0.656
3.563IleGlu: 3.563 ± 0.406
1.188IlePhe: 1.188 ± 0.364
4.38IleGly: 4.38 ± 0.808
1.262IleHis: 1.262 ± 0.371
1.262IleIle: 1.262 ± 0.337
0.891IleLys: 0.891 ± 0.273
4.157IleLeu: 4.157 ± 0.679
0.148IleMet: 0.148 ± 0.113
1.93IleAsn: 1.93 ± 0.448
2.747IlePro: 2.747 ± 0.407
1.782IleGln: 1.782 ± 0.489
3.044IleArg: 3.044 ± 0.392
2.524IleSer: 2.524 ± 0.441
3.415IleThr: 3.415 ± 0.598
3.266IleVal: 3.266 ± 0.443
0.965IleTrp: 0.965 ± 0.266
1.039IleTyr: 1.039 ± 0.295
0.0IleXaa: 0.0 ± 0.0
Lys
5.196LysAla: 5.196 ± 0.651
0.371LysCys: 0.371 ± 0.16
2.153LysAsp: 2.153 ± 0.32
2.004LysGlu: 2.004 ± 0.452
0.594LysPhe: 0.594 ± 0.183
2.45LysGly: 2.45 ± 0.402
0.594LysHis: 0.594 ± 0.258
1.93LysIle: 1.93 ± 0.366
1.262LysLys: 1.262 ± 0.299
2.672LysLeu: 2.672 ± 0.397
0.742LysMet: 0.742 ± 0.319
1.039LysAsn: 1.039 ± 0.32
2.153LysPro: 2.153 ± 0.394
0.965LysGln: 0.965 ± 0.267
2.747LysArg: 2.747 ± 0.478
2.747LysSer: 2.747 ± 0.528
1.633LysThr: 1.633 ± 0.334
2.301LysVal: 2.301 ± 0.335
0.371LysTrp: 0.371 ± 0.197
0.965LysTyr: 0.965 ± 0.259
0.0LysXaa: 0.0 ± 0.0
Leu
8.314LeuAla: 8.314 ± 0.674
0.817LeuCys: 0.817 ± 0.254
5.122LeuAsp: 5.122 ± 0.487
4.974LeuGlu: 4.974 ± 0.704
2.079LeuPhe: 2.079 ± 0.495
7.275LeuGly: 7.275 ± 0.835
2.227LeuHis: 2.227 ± 0.489
3.192LeuIle: 3.192 ± 0.524
3.192LeuLys: 3.192 ± 0.598
5.048LeuLeu: 5.048 ± 0.642
1.262LeuMet: 1.262 ± 0.256
2.153LeuAsn: 2.153 ± 0.322
3.637LeuPro: 3.637 ± 0.536
2.079LeuGln: 2.079 ± 0.455
6.458LeuArg: 6.458 ± 0.853
4.306LeuSer: 4.306 ± 0.561
4.899LeuThr: 4.899 ± 0.627
4.38LeuVal: 4.38 ± 0.59
1.559LeuTrp: 1.559 ± 0.317
1.93LeuTyr: 1.93 ± 0.484
0.0LeuXaa: 0.0 ± 0.0
Met
2.969MetAla: 2.969 ± 0.701
0.148MetCys: 0.148 ± 0.113
0.52MetAsp: 0.52 ± 0.189
0.594MetGlu: 0.594 ± 0.216
0.297MetPhe: 0.297 ± 0.175
1.188MetGly: 1.188 ± 0.341
0.297MetHis: 0.297 ± 0.158
1.188MetIle: 1.188 ± 0.211
0.668MetLys: 0.668 ± 0.194
1.633MetLeu: 1.633 ± 0.425
0.668MetMet: 0.668 ± 0.226
0.817MetAsn: 0.817 ± 0.209
1.782MetPro: 1.782 ± 0.322
1.039MetGln: 1.039 ± 0.437
1.039MetArg: 1.039 ± 0.317
1.262MetSer: 1.262 ± 0.348
3.118MetThr: 3.118 ± 0.532
1.336MetVal: 1.336 ± 0.31
0.371MetTrp: 0.371 ± 0.135
0.297MetTyr: 0.297 ± 0.122
0.0MetXaa: 0.0 ± 0.0
Asn
2.747AsnAla: 2.747 ± 0.444
0.223AsnCys: 0.223 ± 0.135
1.782AsnAsp: 1.782 ± 0.35
0.965AsnGlu: 0.965 ± 0.199
0.817AsnPhe: 0.817 ± 0.197
3.563AsnGly: 3.563 ± 0.566
0.594AsnHis: 0.594 ± 0.178
1.114AsnIle: 1.114 ± 0.252
1.114AsnLys: 1.114 ± 0.311
2.524AsnLeu: 2.524 ± 0.369
0.668AsnMet: 0.668 ± 0.24
1.559AsnAsn: 1.559 ± 0.319
2.598AsnPro: 2.598 ± 0.458
1.188AsnGln: 1.188 ± 0.312
2.079AsnArg: 2.079 ± 0.341
1.336AsnSer: 1.336 ± 0.331
2.227AsnThr: 2.227 ± 0.488
1.633AsnVal: 1.633 ± 0.356
0.52AsnTrp: 0.52 ± 0.195
0.742AsnTyr: 0.742 ± 0.189
0.0AsnXaa: 0.0 ± 0.0
Pro
6.384ProAla: 6.384 ± 0.827
0.817ProCys: 0.817 ± 0.231
4.751ProAsp: 4.751 ± 0.627
4.009ProGlu: 4.009 ± 0.586
1.41ProPhe: 1.41 ± 0.262
5.345ProGly: 5.345 ± 0.575
1.262ProHis: 1.262 ± 0.384
2.747ProIle: 2.747 ± 0.458
2.227ProLys: 2.227 ± 0.344
2.969ProLeu: 2.969 ± 0.464
1.114ProMet: 1.114 ± 0.355
1.782ProAsn: 1.782 ± 0.431
3.489ProPro: 3.489 ± 0.599
1.782ProGln: 1.782 ± 0.395
2.747ProArg: 2.747 ± 0.524
3.118ProSer: 3.118 ± 0.433
4.083ProThr: 4.083 ± 0.528
3.192ProVal: 3.192 ± 0.389
1.633ProTrp: 1.633 ± 0.321
1.41ProTyr: 1.41 ± 0.281
0.0ProXaa: 0.0 ± 0.0
Gln
4.454GlnAla: 4.454 ± 0.779
0.371GlnCys: 0.371 ± 0.153
1.039GlnAsp: 1.039 ± 0.268
1.633GlnGlu: 1.633 ± 0.388
0.817GlnPhe: 0.817 ± 0.24
2.375GlnGly: 2.375 ± 0.484
0.965GlnHis: 0.965 ± 0.288
2.672GlnIle: 2.672 ± 0.391
1.188GlnLys: 1.188 ± 0.341
3.415GlnLeu: 3.415 ± 0.454
0.371GlnMet: 0.371 ± 0.169
1.114GlnAsn: 1.114 ± 0.242
1.856GlnPro: 1.856 ± 0.436
2.004GlnGln: 2.004 ± 0.493
3.118GlnArg: 3.118 ± 0.509
1.707GlnSer: 1.707 ± 0.306
2.153GlnThr: 2.153 ± 0.494
2.301GlnVal: 2.301 ± 0.437
0.891GlnTrp: 0.891 ± 0.234
1.336GlnTyr: 1.336 ± 0.34
0.0GlnXaa: 0.0 ± 0.0
Arg
8.166ArgAla: 8.166 ± 0.794
1.114ArgCys: 1.114 ± 0.352
5.419ArgAsp: 5.419 ± 0.553
4.677ArgGlu: 4.677 ± 0.567
1.707ArgPhe: 1.707 ± 0.367
4.825ArgGly: 4.825 ± 0.581
1.485ArgHis: 1.485 ± 0.341
3.266ArgIle: 3.266 ± 0.542
3.415ArgLys: 3.415 ± 0.384
6.161ArgLeu: 6.161 ± 0.763
2.598ArgMet: 2.598 ± 0.535
2.079ArgAsn: 2.079 ± 0.38
4.009ArgPro: 4.009 ± 0.634
2.747ArgGln: 2.747 ± 0.573
7.052ArgArg: 7.052 ± 0.871
2.821ArgSer: 2.821 ± 0.445
4.454ArgThr: 4.454 ± 0.639
4.825ArgVal: 4.825 ± 0.609
1.856ArgTrp: 1.856 ± 0.325
1.485ArgTyr: 1.485 ± 0.325
0.0ArgXaa: 0.0 ± 0.0
Ser
6.458SerAla: 6.458 ± 0.603
0.297SerCys: 0.297 ± 0.142
3.266SerAsp: 3.266 ± 0.454
2.301SerGlu: 2.301 ± 0.377
1.559SerPhe: 1.559 ± 0.245
5.642SerGly: 5.642 ± 0.784
1.188SerHis: 1.188 ± 0.329
3.118SerIle: 3.118 ± 0.652
1.633SerLys: 1.633 ± 0.34
4.677SerLeu: 4.677 ± 0.652
1.262SerMet: 1.262 ± 0.329
2.004SerAsn: 2.004 ± 0.37
2.672SerPro: 2.672 ± 0.447
1.707SerGln: 1.707 ± 0.344
3.266SerArg: 3.266 ± 0.514
2.821SerSer: 2.821 ± 0.603
2.375SerThr: 2.375 ± 0.392
3.489SerVal: 3.489 ± 0.539
1.188SerTrp: 1.188 ± 0.328
1.114SerTyr: 1.114 ± 0.271
0.0SerXaa: 0.0 ± 0.0
Thr
7.052ThrAla: 7.052 ± 0.947
0.817ThrCys: 0.817 ± 0.303
3.192ThrAsp: 3.192 ± 0.519
3.341ThrGlu: 3.341 ± 0.53
2.375ThrPhe: 2.375 ± 0.447
6.236ThrGly: 6.236 ± 0.585
1.114ThrHis: 1.114 ± 0.364
4.38ThrIle: 4.38 ± 0.549
1.856ThrLys: 1.856 ± 0.412
4.157ThrLeu: 4.157 ± 0.641
1.336ThrMet: 1.336 ± 0.234
1.559ThrAsn: 1.559 ± 0.335
4.899ThrPro: 4.899 ± 0.84
1.782ThrGln: 1.782 ± 0.389
3.637ThrArg: 3.637 ± 0.39
4.677ThrSer: 4.677 ± 0.556
4.677ThrThr: 4.677 ± 0.73
5.345ThrVal: 5.345 ± 0.711
1.188ThrTrp: 1.188 ± 0.296
1.188ThrTyr: 1.188 ± 0.349
0.0ThrXaa: 0.0 ± 0.0
Val
8.166ValAla: 8.166 ± 0.759
0.891ValCys: 0.891 ± 0.26
5.196ValAsp: 5.196 ± 0.676
4.677ValGlu: 4.677 ± 0.622
1.93ValPhe: 1.93 ± 0.544
5.048ValGly: 5.048 ± 0.607
1.262ValHis: 1.262 ± 0.323
2.524ValIle: 2.524 ± 0.399
2.375ValLys: 2.375 ± 0.538
4.528ValLeu: 4.528 ± 0.718
1.262ValMet: 1.262 ± 0.269
2.227ValAsn: 2.227 ± 0.367
3.415ValPro: 3.415 ± 0.5
2.524ValGln: 2.524 ± 0.54
6.607ValArg: 6.607 ± 0.702
3.86ValSer: 3.86 ± 0.659
5.79ValThr: 5.79 ± 0.789
4.825ValVal: 4.825 ± 0.642
1.336ValTrp: 1.336 ± 0.338
1.707ValTyr: 1.707 ± 0.406
0.0ValXaa: 0.0 ± 0.0
Trp
2.45TrpAla: 2.45 ± 0.387
0.223TrpCys: 0.223 ± 0.112
1.856TrpAsp: 1.856 ± 0.358
1.262TrpGlu: 1.262 ± 0.28
0.668TrpPhe: 0.668 ± 0.225
0.965TrpGly: 0.965 ± 0.228
0.817TrpHis: 0.817 ± 0.217
0.371TrpIle: 0.371 ± 0.169
0.742TrpLys: 0.742 ± 0.205
1.262TrpLeu: 1.262 ± 0.308
0.52TrpMet: 0.52 ± 0.18
0.445TrpAsn: 0.445 ± 0.161
1.188TrpPro: 1.188 ± 0.262
0.445TrpGln: 0.445 ± 0.155
2.079TrpArg: 2.079 ± 0.462
1.262TrpSer: 1.262 ± 0.313
1.114TrpThr: 1.114 ± 0.347
1.707TrpVal: 1.707 ± 0.41
0.668TrpTrp: 0.668 ± 0.215
0.445TrpTyr: 0.445 ± 0.181
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.375TyrAla: 2.375 ± 0.503
0.371TyrCys: 0.371 ± 0.132
1.559TyrAsp: 1.559 ± 0.383
1.633TyrGlu: 1.633 ± 0.291
0.52TyrPhe: 0.52 ± 0.194
2.672TyrGly: 2.672 ± 0.383
0.52TyrHis: 0.52 ± 0.256
0.668TyrIle: 0.668 ± 0.172
0.52TyrLys: 0.52 ± 0.205
2.079TyrLeu: 2.079 ± 0.395
0.371TyrMet: 0.371 ± 0.142
0.371TyrAsn: 0.371 ± 0.213
1.262TyrPro: 1.262 ± 0.317
1.336TyrGln: 1.336 ± 0.318
1.707TyrArg: 1.707 ± 0.359
1.188TyrSer: 1.188 ± 0.296
1.633TyrThr: 1.633 ± 0.328
2.375TyrVal: 2.375 ± 0.34
0.594TyrTrp: 0.594 ± 0.226
0.52TyrTyr: 0.52 ± 0.181
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (13472 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski