Amino acid dipepetide frequency for Gordonia phage Frokostdame

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.234AlaAla: 16.234 ± 1.234
0.765AlaCys: 0.765 ± 0.238
7.47AlaAsp: 7.47 ± 0.732
7.117AlaGlu: 7.117 ± 0.85
3.294AlaPhe: 3.294 ± 0.469
8.294AlaGly: 8.294 ± 0.909
1.706AlaHis: 1.706 ± 0.282
5.529AlaIle: 5.529 ± 0.541
4.235AlaLys: 4.235 ± 0.703
10.235AlaLeu: 10.235 ± 0.901
2.882AlaMet: 2.882 ± 0.519
3.294AlaAsn: 3.294 ± 0.447
5.176AlaPro: 5.176 ± 0.516
4.47AlaGln: 4.47 ± 0.539
8.117AlaArg: 8.117 ± 0.794
6.0AlaSer: 6.0 ± 0.564
6.764AlaThr: 6.764 ± 0.858
7.647AlaVal: 7.647 ± 1.012
1.765AlaTrp: 1.765 ± 0.295
1.647AlaTyr: 1.647 ± 0.286
0.0AlaXaa: 0.0 ± 0.0
Cys
0.823CysAla: 0.823 ± 0.221
0.118CysCys: 0.118 ± 0.109
1.471CysAsp: 1.471 ± 0.376
0.294CysGlu: 0.294 ± 0.133
0.0CysPhe: 0.0 ± 0.0
1.059CysGly: 1.059 ± 0.341
0.588CysHis: 0.588 ± 0.232
0.235CysIle: 0.235 ± 0.154
0.235CysLys: 0.235 ± 0.139
0.471CysLeu: 0.471 ± 0.174
0.412CysMet: 0.412 ± 0.161
0.471CysAsn: 0.471 ± 0.172
0.765CysPro: 0.765 ± 0.217
0.471CysGln: 0.471 ± 0.169
0.882CysArg: 0.882 ± 0.252
0.529CysSer: 0.529 ± 0.203
0.588CysThr: 0.588 ± 0.184
0.529CysVal: 0.529 ± 0.186
0.353CysTrp: 0.353 ± 0.172
0.059CysTyr: 0.059 ± 0.065
0.0CysXaa: 0.0 ± 0.0
Asp
7.941AspAla: 7.941 ± 0.719
0.471AspCys: 0.471 ± 0.147
5.706AspAsp: 5.706 ± 0.752
5.059AspGlu: 5.059 ± 0.718
1.882AspPhe: 1.882 ± 0.352
6.411AspGly: 6.411 ± 0.686
1.529AspHis: 1.529 ± 0.385
2.823AspIle: 2.823 ± 0.345
1.529AspLys: 1.529 ± 0.317
6.294AspLeu: 6.294 ± 0.673
1.235AspMet: 1.235 ± 0.261
2.176AspAsn: 2.176 ± 0.329
4.412AspPro: 4.412 ± 0.463
2.294AspGln: 2.294 ± 0.394
5.764AspArg: 5.764 ± 0.848
3.529AspSer: 3.529 ± 0.389
4.294AspThr: 4.294 ± 0.565
5.588AspVal: 5.588 ± 0.637
0.941AspTrp: 0.941 ± 0.219
1.706AspTyr: 1.706 ± 0.347
0.0AspXaa: 0.0 ± 0.0
Glu
6.353GluAla: 6.353 ± 0.845
0.529GluCys: 0.529 ± 0.191
2.647GluAsp: 2.647 ± 0.407
2.823GluGlu: 2.823 ± 0.601
2.118GluPhe: 2.118 ± 0.372
3.588GluGly: 3.588 ± 0.433
1.765GluHis: 1.765 ± 0.334
2.706GluIle: 2.706 ± 0.405
2.294GluLys: 2.294 ± 0.362
5.529GluLeu: 5.529 ± 0.892
1.059GluMet: 1.059 ± 0.187
1.118GluAsn: 1.118 ± 0.298
3.588GluPro: 3.588 ± 0.857
2.647GluGln: 2.647 ± 0.418
5.059GluArg: 5.059 ± 0.617
3.117GluSer: 3.117 ± 0.508
3.117GluThr: 3.117 ± 0.404
5.176GluVal: 5.176 ± 0.609
1.118GluTrp: 1.118 ± 0.263
1.882GluTyr: 1.882 ± 0.348
0.0GluXaa: 0.0 ± 0.0
Phe
2.823PheAla: 2.823 ± 0.427
0.294PheCys: 0.294 ± 0.112
2.0PheAsp: 2.0 ± 0.292
1.588PheGlu: 1.588 ± 0.265
0.823PhePhe: 0.823 ± 0.221
2.882PheGly: 2.882 ± 0.394
0.588PheHis: 0.588 ± 0.169
0.882PheIle: 0.882 ± 0.253
0.941PheLys: 0.941 ± 0.273
1.647PheLeu: 1.647 ± 0.408
0.823PheMet: 0.823 ± 0.218
0.941PheAsn: 0.941 ± 0.233
1.471PhePro: 1.471 ± 0.247
0.588PheGln: 0.588 ± 0.174
2.0PheArg: 2.0 ± 0.345
1.412PheSer: 1.412 ± 0.295
2.0PheThr: 2.0 ± 0.354
2.235PheVal: 2.235 ± 0.345
0.235PheTrp: 0.235 ± 0.097
0.529PheTyr: 0.529 ± 0.151
0.0PheXaa: 0.0 ± 0.0
Gly
8.647GlyAla: 8.647 ± 1.178
0.647GlyCys: 0.647 ± 0.233
6.647GlyAsp: 6.647 ± 0.622
4.941GlyGlu: 4.941 ± 0.673
3.059GlyPhe: 3.059 ± 0.451
7.47GlyGly: 7.47 ± 0.774
1.412GlyHis: 1.412 ± 0.292
3.588GlyIle: 3.588 ± 0.534
3.059GlyLys: 3.059 ± 0.477
6.705GlyLeu: 6.705 ± 0.977
1.353GlyMet: 1.353 ± 0.271
3.0GlyAsn: 3.0 ± 0.465
3.764GlyPro: 3.764 ± 0.508
2.706GlyGln: 2.706 ± 0.384
7.117GlyArg: 7.117 ± 0.57
4.235GlySer: 4.235 ± 0.512
4.941GlyThr: 4.941 ± 0.735
6.353GlyVal: 6.353 ± 0.625
1.765GlyTrp: 1.765 ± 0.317
2.47GlyTyr: 2.47 ± 0.305
0.0GlyXaa: 0.0 ± 0.0
His
2.235HisAla: 2.235 ± 0.355
0.353HisCys: 0.353 ± 0.163
1.471HisAsp: 1.471 ± 0.279
1.0HisGlu: 1.0 ± 0.202
0.765HisPhe: 0.765 ± 0.207
1.941HisGly: 1.941 ± 0.294
0.471HisHis: 0.471 ± 0.208
0.765HisIle: 0.765 ± 0.19
0.294HisLys: 0.294 ± 0.132
1.882HisLeu: 1.882 ± 0.293
0.176HisMet: 0.176 ± 0.093
0.235HisAsn: 0.235 ± 0.126
1.823HisPro: 1.823 ± 0.374
0.529HisGln: 0.529 ± 0.242
1.647HisArg: 1.647 ± 0.41
1.176HisSer: 1.176 ± 0.267
1.529HisThr: 1.529 ± 0.306
1.588HisVal: 1.588 ± 0.366
0.412HisTrp: 0.412 ± 0.155
0.529HisTyr: 0.529 ± 0.165
0.0HisXaa: 0.0 ± 0.0
Ile
5.353IleAla: 5.353 ± 0.636
0.353IleCys: 0.353 ± 0.165
4.412IleAsp: 4.412 ± 0.609
2.706IleGlu: 2.706 ± 0.478
0.823IlePhe: 0.823 ± 0.228
4.412IleGly: 4.412 ± 0.601
0.647IleHis: 0.647 ± 0.203
1.706IleIle: 1.706 ± 0.318
1.706IleLys: 1.706 ± 0.389
2.118IleLeu: 2.118 ± 0.349
0.529IleMet: 0.529 ± 0.157
1.235IleAsn: 1.235 ± 0.263
3.059IlePro: 3.059 ± 0.448
0.941IleGln: 0.941 ± 0.2
4.0IleArg: 4.0 ± 0.584
1.529IleSer: 1.529 ± 0.303
3.588IleThr: 3.588 ± 0.46
3.823IleVal: 3.823 ± 0.449
0.471IleTrp: 0.471 ± 0.149
0.941IleTyr: 0.941 ± 0.238
0.0IleXaa: 0.0 ± 0.0
Lys
3.47LysAla: 3.47 ± 0.413
0.118LysCys: 0.118 ± 0.088
2.059LysAsp: 2.059 ± 0.385
1.412LysGlu: 1.412 ± 0.364
1.118LysPhe: 1.118 ± 0.249
2.882LysGly: 2.882 ± 0.515
0.471LysHis: 0.471 ± 0.183
1.765LysIle: 1.765 ± 0.297
1.706LysLys: 1.706 ± 0.301
2.941LysLeu: 2.941 ± 0.396
0.412LysMet: 0.412 ± 0.152
1.059LysAsn: 1.059 ± 0.231
2.529LysPro: 2.529 ± 0.401
0.706LysGln: 0.706 ± 0.228
2.176LysArg: 2.176 ± 0.4
1.412LysSer: 1.412 ± 0.321
2.47LysThr: 2.47 ± 0.323
2.823LysVal: 2.823 ± 0.55
0.588LysTrp: 0.588 ± 0.151
0.647LysTyr: 0.647 ± 0.261
0.0LysXaa: 0.0 ± 0.0
Leu
10.176LeuAla: 10.176 ± 0.978
1.0LeuCys: 1.0 ± 0.27
4.706LeuAsp: 4.706 ± 0.604
4.059LeuGlu: 4.059 ± 0.565
2.0LeuPhe: 2.0 ± 0.304
6.529LeuGly: 6.529 ± 0.8
1.0LeuHis: 1.0 ± 0.273
3.47LeuIle: 3.47 ± 0.547
1.765LeuLys: 1.765 ± 0.263
4.0LeuLeu: 4.0 ± 0.55
2.118LeuMet: 2.118 ± 0.414
2.118LeuAsn: 2.118 ± 0.379
4.117LeuPro: 4.117 ± 0.53
1.823LeuGln: 1.823 ± 0.314
5.47LeuArg: 5.47 ± 0.587
4.176LeuSer: 4.176 ± 0.494
6.0LeuThr: 6.0 ± 0.554
6.882LeuVal: 6.882 ± 0.61
1.823LeuTrp: 1.823 ± 0.35
1.588LeuTyr: 1.588 ± 0.269
0.0LeuXaa: 0.0 ± 0.0
Met
3.235MetAla: 3.235 ± 0.55
0.294MetCys: 0.294 ± 0.122
0.765MetAsp: 0.765 ± 0.231
0.823MetGlu: 0.823 ± 0.174
0.588MetPhe: 0.588 ± 0.187
1.412MetGly: 1.412 ± 0.271
0.471MetHis: 0.471 ± 0.165
1.059MetIle: 1.059 ± 0.242
0.588MetLys: 0.588 ± 0.167
1.529MetLeu: 1.529 ± 0.322
0.588MetMet: 0.588 ± 0.193
0.529MetAsn: 0.529 ± 0.158
2.0MetPro: 2.0 ± 0.381
0.647MetGln: 0.647 ± 0.172
1.706MetArg: 1.706 ± 0.524
1.882MetSer: 1.882 ± 0.362
2.529MetThr: 2.529 ± 0.339
0.706MetVal: 0.706 ± 0.268
0.529MetTrp: 0.529 ± 0.231
0.412MetTyr: 0.412 ± 0.126
0.0MetXaa: 0.0 ± 0.0
Asn
2.765AsnAla: 2.765 ± 0.412
0.235AsnCys: 0.235 ± 0.118
2.235AsnAsp: 2.235 ± 0.32
0.941AsnGlu: 0.941 ± 0.279
0.412AsnPhe: 0.412 ± 0.149
2.823AsnGly: 2.823 ± 0.459
1.059AsnHis: 1.059 ± 0.375
1.059AsnIle: 1.059 ± 0.271
0.706AsnLys: 0.706 ± 0.201
2.353AsnLeu: 2.353 ± 0.529
0.471AsnMet: 0.471 ± 0.161
1.235AsnAsn: 1.235 ± 0.259
3.059AsnPro: 3.059 ± 0.453
0.706AsnGln: 0.706 ± 0.178
1.765AsnArg: 1.765 ± 0.398
2.412AsnSer: 2.412 ± 0.427
2.47AsnThr: 2.47 ± 0.498
1.882AsnVal: 1.882 ± 0.341
0.765AsnTrp: 0.765 ± 0.222
0.823AsnTyr: 0.823 ± 0.218
0.0AsnXaa: 0.0 ± 0.0
Pro
5.47ProAla: 5.47 ± 0.676
0.706ProCys: 0.706 ± 0.232
4.706ProAsp: 4.706 ± 0.561
3.823ProGlu: 3.823 ± 0.53
1.529ProPhe: 1.529 ± 0.313
4.882ProGly: 4.882 ± 0.548
1.529ProHis: 1.529 ± 0.379
2.706ProIle: 2.706 ± 0.382
2.353ProLys: 2.353 ± 0.385
4.117ProLeu: 4.117 ± 0.463
1.471ProMet: 1.471 ± 0.388
1.588ProAsn: 1.588 ± 0.309
3.235ProPro: 3.235 ± 0.619
2.0ProGln: 2.0 ± 0.319
4.176ProArg: 4.176 ± 0.587
3.176ProSer: 3.176 ± 0.506
4.647ProThr: 4.647 ± 0.635
3.706ProVal: 3.706 ± 0.455
1.235ProTrp: 1.235 ± 0.275
1.176ProTyr: 1.176 ± 0.302
0.0ProXaa: 0.0 ± 0.0
Gln
3.647GlnAla: 3.647 ± 0.45
0.412GlnCys: 0.412 ± 0.151
1.471GlnAsp: 1.471 ± 0.289
1.353GlnGlu: 1.353 ± 0.298
0.882GlnPhe: 0.882 ± 0.236
2.176GlnGly: 2.176 ± 0.351
0.529GlnHis: 0.529 ± 0.177
1.471GlnIle: 1.471 ± 0.275
0.882GlnLys: 0.882 ± 0.199
2.941GlnLeu: 2.941 ± 0.389
1.118GlnMet: 1.118 ± 0.235
1.059GlnAsn: 1.059 ± 0.222
2.235GlnPro: 2.235 ± 0.409
2.47GlnGln: 2.47 ± 0.386
2.765GlnArg: 2.765 ± 0.433
2.176GlnSer: 2.176 ± 0.354
1.471GlnThr: 1.471 ± 0.239
2.47GlnVal: 2.47 ± 0.344
0.765GlnTrp: 0.765 ± 0.203
1.0GlnTyr: 1.0 ± 0.239
0.0GlnXaa: 0.0 ± 0.0
Arg
7.823ArgAla: 7.823 ± 0.784
1.294ArgCys: 1.294 ± 0.313
6.882ArgAsp: 6.882 ± 0.637
4.647ArgGlu: 4.647 ± 0.578
1.882ArgPhe: 1.882 ± 0.336
6.117ArgGly: 6.117 ± 0.618
2.235ArgHis: 2.235 ± 0.331
4.0ArgIle: 4.0 ± 0.52
3.0ArgLys: 3.0 ± 0.424
5.411ArgLeu: 5.411 ± 0.513
2.235ArgMet: 2.235 ± 0.362
2.588ArgAsn: 2.588 ± 0.295
3.706ArgPro: 3.706 ± 0.487
2.588ArgGln: 2.588 ± 0.454
8.0ArgArg: 8.0 ± 1.084
3.353ArgSer: 3.353 ± 0.449
4.588ArgThr: 4.588 ± 0.523
5.588ArgVal: 5.588 ± 0.594
1.353ArgTrp: 1.353 ± 0.368
2.235ArgTyr: 2.235 ± 0.398
0.0ArgXaa: 0.0 ± 0.0
Ser
5.47SerAla: 5.47 ± 0.583
0.294SerCys: 0.294 ± 0.131
3.294SerAsp: 3.294 ± 0.513
3.588SerGlu: 3.588 ± 0.466
1.294SerPhe: 1.294 ± 0.274
5.353SerGly: 5.353 ± 0.759
1.0SerHis: 1.0 ± 0.26
2.823SerIle: 2.823 ± 0.386
1.823SerLys: 1.823 ± 0.341
2.765SerLeu: 2.765 ± 0.466
1.176SerMet: 1.176 ± 0.248
1.823SerAsn: 1.823 ± 0.309
3.0SerPro: 3.0 ± 0.415
1.294SerGln: 1.294 ± 0.29
4.235SerArg: 4.235 ± 0.49
3.059SerSer: 3.059 ± 0.486
3.823SerThr: 3.823 ± 0.549
3.882SerVal: 3.882 ± 0.609
1.353SerTrp: 1.353 ± 0.286
0.882SerTyr: 0.882 ± 0.218
0.0SerXaa: 0.0 ± 0.0
Thr
7.823ThrAla: 7.823 ± 0.883
0.882ThrCys: 0.882 ± 0.233
4.941ThrAsp: 4.941 ± 0.519
4.529ThrGlu: 4.529 ± 0.528
1.471ThrPhe: 1.471 ± 0.333
5.294ThrGly: 5.294 ± 0.793
1.529ThrHis: 1.529 ± 0.325
3.353ThrIle: 3.353 ± 0.472
2.176ThrLys: 2.176 ± 0.312
4.764ThrLeu: 4.764 ± 0.637
1.176ThrMet: 1.176 ± 0.266
2.353ThrAsn: 2.353 ± 0.427
4.0ThrPro: 4.0 ± 0.359
2.706ThrGln: 2.706 ± 0.419
4.647ThrArg: 4.647 ± 0.493
3.235ThrSer: 3.235 ± 0.484
4.941ThrThr: 4.941 ± 0.505
6.176ThrVal: 6.176 ± 0.586
1.118ThrTrp: 1.118 ± 0.256
1.529ThrTyr: 1.529 ± 0.359
0.0ThrXaa: 0.0 ± 0.0
Val
8.411ValAla: 8.411 ± 0.822
1.0ValCys: 1.0 ± 0.265
6.294ValAsp: 6.294 ± 0.631
5.47ValGlu: 5.47 ± 0.615
1.647ValPhe: 1.647 ± 0.357
6.588ValGly: 6.588 ± 0.677
1.294ValHis: 1.294 ± 0.238
3.117ValIle: 3.117 ± 0.454
2.176ValLys: 2.176 ± 0.38
5.411ValLeu: 5.411 ± 0.57
2.118ValMet: 2.118 ± 0.412
1.941ValAsn: 1.941 ± 0.36
4.0ValPro: 4.0 ± 0.525
2.353ValGln: 2.353 ± 0.417
6.294ValArg: 6.294 ± 0.655
2.882ValSer: 2.882 ± 0.483
6.529ValThr: 6.529 ± 0.619
6.823ValVal: 6.823 ± 0.639
1.294ValTrp: 1.294 ± 0.296
1.882ValTyr: 1.882 ± 0.296
0.0ValXaa: 0.0 ± 0.0
Trp
1.588TrpAla: 1.588 ± 0.346
0.235TrpCys: 0.235 ± 0.134
1.176TrpAsp: 1.176 ± 0.365
0.706TrpGlu: 0.706 ± 0.251
0.529TrpPhe: 0.529 ± 0.22
1.353TrpGly: 1.353 ± 0.295
0.294TrpHis: 0.294 ± 0.17
0.647TrpIle: 0.647 ± 0.163
0.471TrpLys: 0.471 ± 0.146
2.059TrpLeu: 2.059 ± 0.336
0.529TrpMet: 0.529 ± 0.143
0.765TrpAsn: 0.765 ± 0.306
1.353TrpPro: 1.353 ± 0.305
0.706TrpGln: 0.706 ± 0.193
1.765TrpArg: 1.765 ± 0.269
1.471TrpSer: 1.471 ± 0.303
0.882TrpThr: 0.882 ± 0.196
1.353TrpVal: 1.353 ± 0.286
0.353TrpTrp: 0.353 ± 0.131
0.471TrpTyr: 0.471 ± 0.184
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.353TyrAla: 2.353 ± 0.441
0.353TyrCys: 0.353 ± 0.167
1.235TyrAsp: 1.235 ± 0.24
1.353TyrGlu: 1.353 ± 0.361
0.588TyrPhe: 0.588 ± 0.203
2.412TyrGly: 2.412 ± 0.444
0.706TyrHis: 0.706 ± 0.24
0.647TyrIle: 0.647 ± 0.197
0.823TyrLys: 0.823 ± 0.211
1.647TyrLeu: 1.647 ± 0.353
0.353TyrMet: 0.353 ± 0.117
0.647TyrAsn: 0.647 ± 0.198
1.0TyrPro: 1.0 ± 0.261
0.647TyrGln: 0.647 ± 0.253
1.941TyrArg: 1.941 ± 0.441
1.471TyrSer: 1.471 ± 0.264
1.471TyrThr: 1.471 ± 0.31
2.294TyrVal: 2.294 ± 0.372
0.471TyrTrp: 0.471 ± 0.165
0.823TyrTyr: 0.823 ± 0.21
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (17002 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski