Amino acid dipepetide frequency for Gordonia phage GMA3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.565AlaAla: 11.565 ± 2.781
0.826AlaCys: 0.826 ± 0.214
5.782AlaAsp: 5.782 ± 0.431
5.204AlaGlu: 5.204 ± 0.56
2.974AlaPhe: 2.974 ± 0.322
6.939AlaGly: 6.939 ± 1.091
1.074AlaHis: 1.074 ± 0.243
4.833AlaIle: 4.833 ± 0.86
4.874AlaLys: 4.874 ± 0.686
6.774AlaLeu: 6.774 ± 0.667
2.561AlaMet: 2.561 ± 0.411
3.883AlaAsn: 3.883 ± 0.44
4.543AlaPro: 4.543 ± 0.73
3.717AlaGln: 3.717 ± 0.83
5.039AlaArg: 5.039 ± 0.53
5.7AlaSer: 5.7 ± 0.64
5.948AlaThr: 5.948 ± 0.943
5.617AlaVal: 5.617 ± 0.521
1.693AlaTrp: 1.693 ± 0.331
2.148AlaTyr: 2.148 ± 0.321
0.0AlaXaa: 0.0 ± 0.0
Cys
0.785CysAla: 0.785 ± 0.194
0.124CysCys: 0.124 ± 0.069
0.743CysAsp: 0.743 ± 0.193
0.62CysGlu: 0.62 ± 0.174
0.248CysPhe: 0.248 ± 0.12
0.743CysGly: 0.743 ± 0.234
0.248CysHis: 0.248 ± 0.106
0.62CysIle: 0.62 ± 0.298
0.62CysLys: 0.62 ± 0.212
0.867CysLeu: 0.867 ± 0.23
0.165CysMet: 0.165 ± 0.081
0.289CysAsn: 0.289 ± 0.148
0.62CysPro: 0.62 ± 0.196
0.248CysGln: 0.248 ± 0.099
0.702CysArg: 0.702 ± 0.207
0.578CysSer: 0.578 ± 0.145
0.496CysThr: 0.496 ± 0.161
0.661CysVal: 0.661 ± 0.176
0.165CysTrp: 0.165 ± 0.079
0.124CysTyr: 0.124 ± 0.071
0.0CysXaa: 0.0 ± 0.0
Asp
5.824AspAla: 5.824 ± 0.492
0.62AspCys: 0.62 ± 0.138
5.163AspAsp: 5.163 ± 0.676
4.254AspGlu: 4.254 ± 0.614
2.024AspPhe: 2.024 ± 0.295
4.585AspGly: 4.585 ± 0.411
1.115AspHis: 1.115 ± 0.276
4.709AspIle: 4.709 ± 0.401
3.759AspLys: 3.759 ± 0.393
4.172AspLeu: 4.172 ± 0.342
1.735AspMet: 1.735 ± 0.275
2.726AspAsn: 2.726 ± 0.405
3.841AspPro: 3.841 ± 0.506
2.189AspGln: 2.189 ± 0.316
3.015AspArg: 3.015 ± 0.471
5.246AspSer: 5.246 ± 0.544
3.304AspThr: 3.304 ± 0.496
3.676AspVal: 3.676 ± 0.374
1.28AspTrp: 1.28 ± 0.224
2.189AspTyr: 2.189 ± 0.338
0.0AspXaa: 0.0 ± 0.0
Glu
5.411GluAla: 5.411 ± 0.481
0.743GluCys: 0.743 ± 0.203
3.263GluAsp: 3.263 ± 0.329
3.552GluGlu: 3.552 ± 0.503
2.809GluPhe: 2.809 ± 0.458
2.809GluGly: 2.809 ± 0.316
0.991GluHis: 0.991 ± 0.223
4.667GluIle: 4.667 ± 0.451
3.759GluLys: 3.759 ± 0.441
5.122GluLeu: 5.122 ± 0.508
2.065GluMet: 2.065 ± 0.324
3.18GluAsn: 3.18 ± 0.329
2.106GluPro: 2.106 ± 0.313
2.396GluGln: 2.396 ± 0.388
3.759GluArg: 3.759 ± 0.525
3.841GluSer: 3.841 ± 0.42
3.222GluThr: 3.222 ± 0.32
4.213GluVal: 4.213 ± 0.37
1.074GluTrp: 1.074 ± 0.226
2.396GluTyr: 2.396 ± 0.364
0.0GluXaa: 0.0 ± 0.0
Phe
2.85PheAla: 2.85 ± 0.352
0.496PheCys: 0.496 ± 0.166
2.974PheAsp: 2.974 ± 0.382
1.983PheGlu: 1.983 ± 0.272
1.487PhePhe: 1.487 ± 0.256
3.8PheGly: 3.8 ± 0.344
0.496PheHis: 0.496 ± 0.15
2.23PheIle: 2.23 ± 0.287
1.859PheLys: 1.859 ± 0.31
1.652PheLeu: 1.652 ± 0.242
1.487PheMet: 1.487 ± 0.237
1.9PheAsn: 1.9 ± 0.258
1.487PhePro: 1.487 ± 0.279
1.611PheGln: 1.611 ± 0.273
2.148PheArg: 2.148 ± 0.285
2.602PheSer: 2.602 ± 0.362
2.354PheThr: 2.354 ± 0.24
2.065PheVal: 2.065 ± 0.262
0.578PheTrp: 0.578 ± 0.174
1.198PheTyr: 1.198 ± 0.192
0.0PheXaa: 0.0 ± 0.0
Gly
6.815GlyAla: 6.815 ± 1.204
0.578GlyCys: 0.578 ± 0.201
5.535GlyAsp: 5.535 ± 1.034
4.296GlyGlu: 4.296 ± 0.411
2.272GlyPhe: 2.272 ± 0.344
6.98GlyGly: 6.98 ± 1.19
1.693GlyHis: 1.693 ± 0.309
5.08GlyIle: 5.08 ± 0.497
5.411GlyLys: 5.411 ± 0.478
5.659GlyLeu: 5.659 ± 0.578
2.726GlyMet: 2.726 ± 0.395
3.593GlyAsn: 3.593 ± 0.444
3.222GlyPro: 3.222 ± 0.501
2.437GlyGln: 2.437 ± 0.338
3.676GlyArg: 3.676 ± 0.406
5.782GlySer: 5.782 ± 0.658
5.782GlyThr: 5.782 ± 0.611
6.03GlyVal: 6.03 ± 0.436
1.363GlyTrp: 1.363 ± 0.228
3.18GlyTyr: 3.18 ± 0.403
0.0GlyXaa: 0.0 ± 0.0
His
0.867HisAla: 0.867 ± 0.237
0.207HisCys: 0.207 ± 0.091
0.867HisAsp: 0.867 ± 0.171
1.156HisGlu: 1.156 ± 0.249
0.661HisPhe: 0.661 ± 0.186
0.867HisGly: 0.867 ± 0.19
0.372HisHis: 0.372 ± 0.144
0.743HisIle: 0.743 ± 0.199
0.991HisLys: 0.991 ± 0.209
1.239HisLeu: 1.239 ± 0.21
0.578HisMet: 0.578 ± 0.181
0.743HisAsn: 0.743 ± 0.134
0.95HisPro: 0.95 ± 0.249
0.743HisGln: 0.743 ± 0.146
0.785HisArg: 0.785 ± 0.204
1.156HisSer: 1.156 ± 0.23
0.826HisThr: 0.826 ± 0.16
1.28HisVal: 1.28 ± 0.269
0.33HisTrp: 0.33 ± 0.188
0.33HisTyr: 0.33 ± 0.133
0.0HisXaa: 0.0 ± 0.0
Ile
5.948IleAla: 5.948 ± 0.494
0.661IleCys: 0.661 ± 0.176
4.833IleAsp: 4.833 ± 0.491
4.213IleGlu: 4.213 ± 0.397
1.693IlePhe: 1.693 ± 0.269
4.419IleGly: 4.419 ± 0.569
0.867IleHis: 0.867 ± 0.218
3.056IleIle: 3.056 ± 0.39
3.222IleLys: 3.222 ± 0.337
4.254IleLeu: 4.254 ± 0.384
1.487IleMet: 1.487 ± 0.191
2.85IleAsn: 2.85 ± 0.369
3.552IlePro: 3.552 ± 0.433
2.23IleGln: 2.23 ± 0.231
2.891IleArg: 2.891 ± 0.253
3.759IleSer: 3.759 ± 0.436
3.924IleThr: 3.924 ± 0.38
4.337IleVal: 4.337 ± 0.404
1.198IleTrp: 1.198 ± 0.256
1.446IleTyr: 1.446 ± 0.25
0.0IleXaa: 0.0 ± 0.0
Lys
5.163LysAla: 5.163 ± 0.662
0.33LysCys: 0.33 ± 0.147
3.015LysAsp: 3.015 ± 0.43
2.478LysGlu: 2.478 ± 0.369
2.891LysPhe: 2.891 ± 0.321
4.213LysGly: 4.213 ± 0.755
1.198LysHis: 1.198 ± 0.271
3.428LysIle: 3.428 ± 0.341
4.296LysLys: 4.296 ± 0.529
4.13LysLeu: 4.13 ± 0.539
1.693LysMet: 1.693 ± 0.25
3.139LysAsn: 3.139 ± 0.276
3.139LysPro: 3.139 ± 0.429
1.983LysGln: 1.983 ± 0.324
3.139LysArg: 3.139 ± 0.385
3.841LysSer: 3.841 ± 0.458
3.015LysThr: 3.015 ± 0.318
3.552LysVal: 3.552 ± 0.39
1.033LysTrp: 1.033 ± 0.209
1.57LysTyr: 1.57 ± 0.253
0.0LysXaa: 0.0 ± 0.0
Leu
7.063LeuAla: 7.063 ± 0.751
0.537LeuCys: 0.537 ± 0.167
4.543LeuAsp: 4.543 ± 0.433
5.122LeuGlu: 5.122 ± 0.565
2.809LeuPhe: 2.809 ± 0.382
5.824LeuGly: 5.824 ± 0.61
0.661LeuHis: 0.661 ± 0.187
4.089LeuIle: 4.089 ± 0.401
3.263LeuLys: 3.263 ± 0.372
4.502LeuLeu: 4.502 ± 0.467
1.941LeuMet: 1.941 ± 0.235
3.717LeuAsn: 3.717 ± 0.412
3.469LeuPro: 3.469 ± 0.425
1.57LeuGln: 1.57 ± 0.26
3.18LeuArg: 3.18 ± 0.311
4.461LeuSer: 4.461 ± 0.428
5.08LeuThr: 5.08 ± 0.438
4.998LeuVal: 4.998 ± 0.374
0.826LeuTrp: 0.826 ± 0.149
2.065LeuTyr: 2.065 ± 0.366
0.0LeuXaa: 0.0 ± 0.0
Met
2.396MetAla: 2.396 ± 0.285
0.33MetCys: 0.33 ± 0.109
1.487MetAsp: 1.487 ± 0.24
1.983MetGlu: 1.983 ± 0.267
0.867MetPhe: 0.867 ± 0.169
2.478MetGly: 2.478 ± 0.41
0.372MetHis: 0.372 ± 0.159
1.611MetIle: 1.611 ± 0.278
1.57MetLys: 1.57 ± 0.319
2.024MetLeu: 2.024 ± 0.279
0.867MetMet: 0.867 ± 0.18
1.735MetAsn: 1.735 ± 0.217
0.826MetPro: 0.826 ± 0.229
1.033MetGln: 1.033 ± 0.221
1.652MetArg: 1.652 ± 0.24
1.941MetSer: 1.941 ± 0.325
2.23MetThr: 2.23 ± 0.332
1.487MetVal: 1.487 ± 0.286
0.413MetTrp: 0.413 ± 0.125
0.661MetTyr: 0.661 ± 0.163
0.0MetXaa: 0.0 ± 0.0
Asn
4.172AsnAla: 4.172 ± 0.673
0.207AsnCys: 0.207 ± 0.106
2.85AsnAsp: 2.85 ± 0.37
2.313AsnGlu: 2.313 ± 0.315
2.065AsnPhe: 2.065 ± 0.288
4.502AsnGly: 4.502 ± 0.485
1.115AsnHis: 1.115 ± 0.24
2.685AsnIle: 2.685 ± 0.321
2.354AsnLys: 2.354 ± 0.293
3.676AsnLeu: 3.676 ± 0.377
0.785AsnMet: 0.785 ± 0.16
2.396AsnAsn: 2.396 ± 0.29
3.015AsnPro: 3.015 ± 0.33
1.735AsnGln: 1.735 ± 0.283
3.098AsnArg: 3.098 ± 0.328
3.056AsnSer: 3.056 ± 0.341
2.809AsnThr: 2.809 ± 0.262
3.387AsnVal: 3.387 ± 0.396
0.661AsnTrp: 0.661 ± 0.144
1.156AsnTyr: 1.156 ± 0.185
0.0AsnXaa: 0.0 ± 0.0
Pro
4.998ProAla: 4.998 ± 0.735
0.33ProCys: 0.33 ± 0.132
2.726ProAsp: 2.726 ± 0.319
3.387ProGlu: 3.387 ± 0.415
1.859ProPhe: 1.859 ± 0.251
4.172ProGly: 4.172 ± 0.505
0.867ProHis: 0.867 ± 0.212
2.272ProIle: 2.272 ± 0.325
2.478ProLys: 2.478 ± 0.294
3.098ProLeu: 3.098 ± 0.45
0.991ProMet: 0.991 ± 0.165
2.024ProAsn: 2.024 ± 0.372
2.106ProPro: 2.106 ± 0.375
1.652ProGln: 1.652 ± 0.29
1.693ProArg: 1.693 ± 0.335
3.346ProSer: 3.346 ± 0.32
2.891ProThr: 2.891 ± 0.358
3.552ProVal: 3.552 ± 0.479
0.578ProTrp: 0.578 ± 0.184
1.57ProTyr: 1.57 ± 0.276
0.0ProXaa: 0.0 ± 0.0
Gln
2.478GlnAla: 2.478 ± 0.418
0.248GlnCys: 0.248 ± 0.083
1.983GlnAsp: 1.983 ± 0.272
1.776GlnGlu: 1.776 ± 0.295
1.487GlnPhe: 1.487 ± 0.295
2.85GlnGly: 2.85 ± 0.399
0.165GlnHis: 0.165 ± 0.075
2.643GlnIle: 2.643 ± 0.325
2.478GlnLys: 2.478 ± 0.492
2.643GlnLeu: 2.643 ± 0.35
1.363GlnMet: 1.363 ± 0.244
1.941GlnAsn: 1.941 ± 0.26
1.239GlnPro: 1.239 ± 0.252
1.487GlnGln: 1.487 ± 0.475
2.106GlnArg: 2.106 ± 0.224
2.313GlnSer: 2.313 ± 0.259
1.693GlnThr: 1.693 ± 0.244
2.23GlnVal: 2.23 ± 0.326
0.62GlnTrp: 0.62 ± 0.159
1.239GlnTyr: 1.239 ± 0.235
0.0GlnXaa: 0.0 ± 0.0
Arg
4.998ArgAla: 4.998 ± 0.568
0.702ArgCys: 0.702 ± 0.2
3.304ArgAsp: 3.304 ± 0.431
3.511ArgGlu: 3.511 ± 0.487
1.528ArgPhe: 1.528 ± 0.242
3.676ArgGly: 3.676 ± 0.444
0.95ArgHis: 0.95 ± 0.292
4.006ArgIle: 4.006 ± 0.373
3.883ArgLys: 3.883 ± 0.445
3.511ArgLeu: 3.511 ± 0.377
1.652ArgMet: 1.652 ± 0.314
2.85ArgAsn: 2.85 ± 0.417
2.726ArgPro: 2.726 ± 0.446
2.065ArgGln: 2.065 ± 0.339
2.85ArgArg: 2.85 ± 0.526
2.602ArgSer: 2.602 ± 0.331
2.809ArgThr: 2.809 ± 0.305
2.933ArgVal: 2.933 ± 0.423
0.62ArgTrp: 0.62 ± 0.156
1.859ArgTyr: 1.859 ± 0.338
0.0ArgXaa: 0.0 ± 0.0
Ser
5.824SerAla: 5.824 ± 0.556
0.496SerCys: 0.496 ± 0.162
4.13SerAsp: 4.13 ± 0.542
4.089SerGlu: 4.089 ± 0.516
2.643SerPhe: 2.643 ± 0.342
6.691SerGly: 6.691 ± 0.711
0.702SerHis: 0.702 ± 0.133
4.13SerIle: 4.13 ± 0.459
3.304SerLys: 3.304 ± 0.426
4.709SerLeu: 4.709 ± 0.521
1.859SerMet: 1.859 ± 0.275
2.685SerAsn: 2.685 ± 0.283
2.602SerPro: 2.602 ± 0.392
2.437SerGln: 2.437 ± 0.282
2.974SerArg: 2.974 ± 0.344
4.709SerSer: 4.709 ± 0.466
4.006SerThr: 4.006 ± 0.337
4.419SerVal: 4.419 ± 0.421
0.826SerTrp: 0.826 ± 0.198
1.983SerTyr: 1.983 ± 0.275
0.0SerXaa: 0.0 ± 0.0
Thr
6.278ThrAla: 6.278 ± 0.805
0.743ThrCys: 0.743 ± 0.252
3.635ThrAsp: 3.635 ± 0.453
4.006ThrGlu: 4.006 ± 0.43
2.065ThrPhe: 2.065 ± 0.352
7.146ThrGly: 7.146 ± 0.807
0.867ThrHis: 0.867 ± 0.234
3.139ThrIle: 3.139 ± 0.312
2.643ThrLys: 2.643 ± 0.269
4.502ThrLeu: 4.502 ± 0.423
1.322ThrMet: 1.322 ± 0.191
2.148ThrAsn: 2.148 ± 0.272
2.643ThrPro: 2.643 ± 0.412
2.024ThrGln: 2.024 ± 0.258
3.139ThrArg: 3.139 ± 0.343
3.428ThrSer: 3.428 ± 0.374
2.767ThrThr: 2.767 ± 0.488
4.956ThrVal: 4.956 ± 0.437
0.991ThrTrp: 0.991 ± 0.213
1.693ThrTyr: 1.693 ± 0.24
0.0ThrXaa: 0.0 ± 0.0
Val
4.791ValAla: 4.791 ± 0.518
0.743ValCys: 0.743 ± 0.183
5.163ValAsp: 5.163 ± 0.552
4.502ValGlu: 4.502 ± 0.521
2.767ValPhe: 2.767 ± 0.256
4.998ValGly: 4.998 ± 0.456
1.198ValHis: 1.198 ± 0.201
4.254ValIle: 4.254 ± 0.413
3.593ValLys: 3.593 ± 0.38
3.759ValLeu: 3.759 ± 0.417
1.28ValMet: 1.28 ± 0.195
3.759ValAsn: 3.759 ± 0.485
3.511ValPro: 3.511 ± 0.428
1.983ValGln: 1.983 ± 0.284
3.841ValArg: 3.841 ± 0.367
4.213ValSer: 4.213 ± 0.4
4.667ValThr: 4.667 ± 0.67
3.924ValVal: 3.924 ± 0.406
0.991ValTrp: 0.991 ± 0.203
2.313ValTyr: 2.313 ± 0.432
0.0ValXaa: 0.0 ± 0.0
Trp
0.991TrpAla: 0.991 ± 0.255
0.165TrpCys: 0.165 ± 0.073
0.991TrpAsp: 0.991 ± 0.241
1.156TrpGlu: 1.156 ± 0.19
0.702TrpPhe: 0.702 ± 0.175
0.909TrpGly: 0.909 ± 0.191
0.289TrpHis: 0.289 ± 0.117
0.909TrpIle: 0.909 ± 0.191
0.785TrpLys: 0.785 ± 0.153
1.115TrpLeu: 1.115 ± 0.202
0.537TrpMet: 0.537 ± 0.128
1.322TrpAsn: 1.322 ± 0.222
0.289TrpPro: 0.289 ± 0.126
0.702TrpGln: 0.702 ± 0.177
1.115TrpArg: 1.115 ± 0.44
1.115TrpSer: 1.115 ± 0.193
0.785TrpThr: 0.785 ± 0.192
1.074TrpVal: 1.074 ± 0.247
0.165TrpTrp: 0.165 ± 0.067
0.702TrpTyr: 0.702 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.478TyrAla: 2.478 ± 0.362
0.578TyrCys: 0.578 ± 0.174
2.437TyrAsp: 2.437 ± 0.314
1.776TyrGlu: 1.776 ± 0.28
1.404TyrPhe: 1.404 ± 0.268
3.511TyrGly: 3.511 ± 0.448
0.454TyrHis: 0.454 ± 0.149
1.735TyrIle: 1.735 ± 0.281
2.148TyrLys: 2.148 ± 0.277
2.313TyrLeu: 2.313 ± 0.312
0.826TyrMet: 0.826 ± 0.314
1.156TyrAsn: 1.156 ± 0.212
0.702TyrPro: 0.702 ± 0.168
0.743TyrGln: 0.743 ± 0.17
2.23TyrArg: 2.23 ± 0.479
1.528TyrSer: 1.528 ± 0.336
1.528TyrThr: 1.528 ± 0.254
1.859TyrVal: 1.859 ± 0.271
0.413TyrTrp: 0.413 ± 0.114
1.115TyrTyr: 1.115 ± 0.212
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 104 proteins (24212 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski