Amino acid dipepetide frequency for Gordonia phage MintFen

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.297AlaAla: 17.297 ± 1.383
0.629AlaCys: 0.629 ± 0.227
8.051AlaAsp: 8.051 ± 0.727
6.478AlaGlu: 6.478 ± 0.664
3.271AlaPhe: 3.271 ± 0.614
9.309AlaGly: 9.309 ± 0.916
1.887AlaHis: 1.887 ± 0.389
5.912AlaIle: 5.912 ± 0.583
3.9AlaLys: 3.9 ± 0.619
10.755AlaLeu: 10.755 ± 0.791
2.893AlaMet: 2.893 ± 0.562
3.334AlaAsn: 3.334 ± 0.565
5.22AlaPro: 5.22 ± 0.597
3.963AlaGln: 3.963 ± 0.592
8.365AlaArg: 8.365 ± 0.879
6.101AlaSer: 6.101 ± 0.81
7.17AlaThr: 7.17 ± 0.654
8.302AlaVal: 8.302 ± 1.031
2.264AlaTrp: 2.264 ± 0.294
2.013AlaTyr: 2.013 ± 0.397
0.0AlaXaa: 0.0 ± 0.0
Cys
0.503CysAla: 0.503 ± 0.157
0.126CysCys: 0.126 ± 0.113
1.069CysAsp: 1.069 ± 0.334
0.252CysGlu: 0.252 ± 0.139
0.0CysPhe: 0.0 ± 0.0
0.818CysGly: 0.818 ± 0.223
0.503CysHis: 0.503 ± 0.18
0.126CysIle: 0.126 ± 0.082
0.126CysLys: 0.126 ± 0.082
0.755CysLeu: 0.755 ± 0.192
0.252CysMet: 0.252 ± 0.132
0.252CysAsn: 0.252 ± 0.131
0.881CysPro: 0.881 ± 0.253
0.314CysGln: 0.314 ± 0.139
0.503CysArg: 0.503 ± 0.183
0.377CysSer: 0.377 ± 0.143
0.755CysThr: 0.755 ± 0.206
0.314CysVal: 0.314 ± 0.135
0.252CysTrp: 0.252 ± 0.166
0.063CysTyr: 0.063 ± 0.063
0.0CysXaa: 0.0 ± 0.0
Asp
7.107AspAla: 7.107 ± 0.65
0.314AspCys: 0.314 ± 0.148
4.78AspAsp: 4.78 ± 0.67
4.591AspGlu: 4.591 ± 0.707
1.95AspPhe: 1.95 ± 0.413
7.044AspGly: 7.044 ± 0.855
1.95AspHis: 1.95 ± 0.372
2.893AspIle: 2.893 ± 0.461
1.51AspLys: 1.51 ± 0.352
5.787AspLeu: 5.787 ± 0.655
1.258AspMet: 1.258 ± 0.292
1.95AspAsn: 1.95 ± 0.483
4.214AspPro: 4.214 ± 0.543
2.579AspGln: 2.579 ± 0.363
4.591AspArg: 4.591 ± 0.674
3.9AspSer: 3.9 ± 0.429
4.214AspThr: 4.214 ± 0.529
5.535AspVal: 5.535 ± 0.723
1.006AspTrp: 1.006 ± 0.243
1.635AspTyr: 1.635 ± 0.353
0.0AspXaa: 0.0 ± 0.0
Glu
5.409GluAla: 5.409 ± 0.582
0.44GluCys: 0.44 ± 0.139
3.208GluAsp: 3.208 ± 0.406
2.013GluGlu: 2.013 ± 0.462
1.824GluPhe: 1.824 ± 0.332
3.963GluGly: 3.963 ± 0.479
1.258GluHis: 1.258 ± 0.262
2.516GluIle: 2.516 ± 0.364
2.076GluLys: 2.076 ± 0.337
4.843GluLeu: 4.843 ± 0.669
1.572GluMet: 1.572 ± 0.309
1.447GluAsn: 1.447 ± 0.262
3.711GluPro: 3.711 ± 0.669
3.271GluGln: 3.271 ± 0.545
4.088GluArg: 4.088 ± 0.525
2.705GluSer: 2.705 ± 0.444
2.327GluThr: 2.327 ± 0.279
5.095GluVal: 5.095 ± 0.532
0.943GluTrp: 0.943 ± 0.308
1.447GluTyr: 1.447 ± 0.351
0.0GluXaa: 0.0 ± 0.0
Phe
2.893PheAla: 2.893 ± 0.399
0.189PheCys: 0.189 ± 0.135
2.39PheAsp: 2.39 ± 0.324
1.761PheGlu: 1.761 ± 0.292
1.006PhePhe: 1.006 ± 0.268
2.327PheGly: 2.327 ± 0.377
0.377PheHis: 0.377 ± 0.187
1.006PheIle: 1.006 ± 0.344
1.069PheLys: 1.069 ± 0.301
1.761PheLeu: 1.761 ± 0.375
0.503PheMet: 0.503 ± 0.154
0.566PheAsn: 0.566 ± 0.205
1.824PhePro: 1.824 ± 0.313
0.755PheGln: 0.755 ± 0.173
1.95PheArg: 1.95 ± 0.305
1.824PheSer: 1.824 ± 0.515
2.201PheThr: 2.201 ± 0.364
2.893PheVal: 2.893 ± 0.386
0.189PheTrp: 0.189 ± 0.106
0.629PheTyr: 0.629 ± 0.162
0.0PheXaa: 0.0 ± 0.0
Gly
8.743GlyAla: 8.743 ± 1.022
0.44GlyCys: 0.44 ± 0.15
5.661GlyAsp: 5.661 ± 0.563
4.906GlyGlu: 4.906 ± 0.634
2.956GlyPhe: 2.956 ± 0.447
6.856GlyGly: 6.856 ± 0.78
1.761GlyHis: 1.761 ± 0.349
3.774GlyIle: 3.774 ± 0.587
3.459GlyLys: 3.459 ± 0.539
8.24GlyLeu: 8.24 ± 1.112
1.51GlyMet: 1.51 ± 0.295
2.327GlyAsn: 2.327 ± 0.347
3.837GlyPro: 3.837 ± 0.488
3.019GlyGln: 3.019 ± 0.362
6.478GlyArg: 6.478 ± 0.624
5.095GlySer: 5.095 ± 0.525
4.529GlyThr: 4.529 ± 0.534
6.227GlyVal: 6.227 ± 0.759
2.579GlyTrp: 2.579 ± 0.351
2.516GlyTyr: 2.516 ± 0.374
0.0GlyXaa: 0.0 ± 0.0
His
2.327HisAla: 2.327 ± 0.362
0.377HisCys: 0.377 ± 0.173
1.51HisAsp: 1.51 ± 0.289
1.069HisGlu: 1.069 ± 0.222
0.377HisPhe: 0.377 ± 0.196
1.572HisGly: 1.572 ± 0.394
0.629HisHis: 0.629 ± 0.234
0.881HisIle: 0.881 ± 0.261
0.566HisLys: 0.566 ± 0.189
2.138HisLeu: 2.138 ± 0.433
0.252HisMet: 0.252 ± 0.104
0.377HisAsn: 0.377 ± 0.148
1.698HisPro: 1.698 ± 0.322
0.755HisGln: 0.755 ± 0.268
1.698HisArg: 1.698 ± 0.277
1.006HisSer: 1.006 ± 0.306
1.069HisThr: 1.069 ± 0.308
1.887HisVal: 1.887 ± 0.401
0.44HisTrp: 0.44 ± 0.18
0.503HisTyr: 0.503 ± 0.183
0.0HisXaa: 0.0 ± 0.0
Ile
5.787IleAla: 5.787 ± 0.52
0.252IleCys: 0.252 ± 0.119
4.277IleAsp: 4.277 ± 0.573
2.201IleGlu: 2.201 ± 0.387
0.943IlePhe: 0.943 ± 0.279
4.717IleGly: 4.717 ± 0.704
0.755IleHis: 0.755 ± 0.228
1.572IleIle: 1.572 ± 0.373
1.761IleLys: 1.761 ± 0.41
2.767IleLeu: 2.767 ± 0.444
0.252IleMet: 0.252 ± 0.117
1.069IleAsn: 1.069 ± 0.271
3.208IlePro: 3.208 ± 0.452
0.818IleGln: 0.818 ± 0.233
4.151IleArg: 4.151 ± 0.602
2.39IleSer: 2.39 ± 0.484
3.522IleThr: 3.522 ± 0.433
4.151IleVal: 4.151 ± 0.432
0.44IleTrp: 0.44 ± 0.144
1.006IleTyr: 1.006 ± 0.224
0.0IleXaa: 0.0 ± 0.0
Lys
3.648LysAla: 3.648 ± 0.517
0.189LysCys: 0.189 ± 0.112
1.887LysAsp: 1.887 ± 0.406
1.258LysGlu: 1.258 ± 0.263
1.132LysPhe: 1.132 ± 0.313
2.453LysGly: 2.453 ± 0.472
0.44LysHis: 0.44 ± 0.211
1.51LysIle: 1.51 ± 0.333
2.076LysLys: 2.076 ± 0.362
3.145LysLeu: 3.145 ± 0.505
0.818LysMet: 0.818 ± 0.254
0.818LysAsn: 0.818 ± 0.282
2.893LysPro: 2.893 ± 0.331
1.195LysGln: 1.195 ± 0.302
2.201LysArg: 2.201 ± 0.319
1.95LysSer: 1.95 ± 0.324
2.39LysThr: 2.39 ± 0.422
2.705LysVal: 2.705 ± 0.402
0.692LysTrp: 0.692 ± 0.15
0.818LysTyr: 0.818 ± 0.277
0.0LysXaa: 0.0 ± 0.0
Leu
11.384LeuAla: 11.384 ± 0.905
0.818LeuCys: 0.818 ± 0.214
5.912LeuAsp: 5.912 ± 0.819
3.271LeuGlu: 3.271 ± 0.555
2.138LeuPhe: 2.138 ± 0.429
7.17LeuGly: 7.17 ± 0.763
1.069LeuHis: 1.069 ± 0.246
3.396LeuIle: 3.396 ± 0.459
1.887LeuLys: 1.887 ± 0.388
4.843LeuLeu: 4.843 ± 0.704
2.013LeuMet: 2.013 ± 0.371
2.138LeuAsn: 2.138 ± 0.361
4.403LeuPro: 4.403 ± 0.496
1.95LeuGln: 1.95 ± 0.381
5.095LeuArg: 5.095 ± 0.498
5.095LeuSer: 5.095 ± 0.668
5.409LeuThr: 5.409 ± 0.629
6.038LeuVal: 6.038 ± 0.626
2.138LeuTrp: 2.138 ± 0.385
1.384LeuTyr: 1.384 ± 0.223
0.0LeuXaa: 0.0 ± 0.0
Met
3.459MetAla: 3.459 ± 0.579
0.252MetCys: 0.252 ± 0.129
0.692MetAsp: 0.692 ± 0.202
0.755MetGlu: 0.755 ± 0.192
0.503MetPhe: 0.503 ± 0.165
1.51MetGly: 1.51 ± 0.331
0.503MetHis: 0.503 ± 0.194
1.006MetIle: 1.006 ± 0.251
0.566MetLys: 0.566 ± 0.194
1.824MetLeu: 1.824 ± 0.38
0.189MetMet: 0.189 ± 0.161
0.377MetAsn: 0.377 ± 0.141
2.076MetPro: 2.076 ± 0.364
0.692MetGln: 0.692 ± 0.268
2.138MetArg: 2.138 ± 0.5
1.95MetSer: 1.95 ± 0.405
2.327MetThr: 2.327 ± 0.466
0.692MetVal: 0.692 ± 0.226
0.629MetTrp: 0.629 ± 0.22
0.314MetTyr: 0.314 ± 0.15
0.0MetXaa: 0.0 ± 0.0
Asn
2.201AsnAla: 2.201 ± 0.385
0.314AsnCys: 0.314 ± 0.146
2.076AsnAsp: 2.076 ± 0.307
1.195AsnGlu: 1.195 ± 0.278
0.377AsnPhe: 0.377 ± 0.152
3.459AsnGly: 3.459 ± 0.488
0.692AsnHis: 0.692 ± 0.178
1.069AsnIle: 1.069 ± 0.324
0.881AsnLys: 0.881 ± 0.252
2.138AsnLeu: 2.138 ± 0.363
0.503AsnMet: 0.503 ± 0.178
0.629AsnAsn: 0.629 ± 0.193
3.082AsnPro: 3.082 ± 0.436
0.943AsnGln: 0.943 ± 0.201
1.95AsnArg: 1.95 ± 0.431
1.635AsnSer: 1.635 ± 0.367
2.201AsnThr: 2.201 ± 0.397
1.95AsnVal: 1.95 ± 0.399
0.44AsnTrp: 0.44 ± 0.141
0.818AsnTyr: 0.818 ± 0.23
0.0AsnXaa: 0.0 ± 0.0
Pro
7.044ProAla: 7.044 ± 0.837
0.881ProCys: 0.881 ± 0.312
3.9ProAsp: 3.9 ± 0.411
4.088ProGlu: 4.088 ± 0.6
1.761ProPhe: 1.761 ± 0.312
5.095ProGly: 5.095 ± 0.546
1.384ProHis: 1.384 ± 0.315
3.334ProIle: 3.334 ± 0.479
2.83ProLys: 2.83 ± 0.383
3.019ProLeu: 3.019 ± 0.44
1.635ProMet: 1.635 ± 0.506
2.138ProAsn: 2.138 ± 0.333
3.208ProPro: 3.208 ± 0.576
1.572ProGln: 1.572 ± 0.264
3.522ProArg: 3.522 ± 0.45
3.459ProSer: 3.459 ± 0.339
4.025ProThr: 4.025 ± 0.584
4.088ProVal: 4.088 ± 0.554
1.069ProTrp: 1.069 ± 0.28
1.195ProTyr: 1.195 ± 0.303
0.0ProXaa: 0.0 ± 0.0
Gln
3.396GlnAla: 3.396 ± 0.475
0.189GlnCys: 0.189 ± 0.103
1.069GlnAsp: 1.069 ± 0.274
1.51GlnGlu: 1.51 ± 0.346
0.818GlnPhe: 0.818 ± 0.238
2.264GlnGly: 2.264 ± 0.427
1.069GlnHis: 1.069 ± 0.316
1.258GlnIle: 1.258 ± 0.219
0.818GlnLys: 0.818 ± 0.196
3.208GlnLeu: 3.208 ± 0.46
0.881GlnMet: 0.881 ± 0.219
0.943GlnAsn: 0.943 ± 0.197
2.516GlnPro: 2.516 ± 0.436
1.51GlnGln: 1.51 ± 0.396
3.082GlnArg: 3.082 ± 0.382
1.761GlnSer: 1.761 ± 0.367
2.201GlnThr: 2.201 ± 0.402
2.39GlnVal: 2.39 ± 0.486
0.818GlnTrp: 0.818 ± 0.297
0.943GlnTyr: 0.943 ± 0.209
0.0GlnXaa: 0.0 ± 0.0
Arg
8.114ArgAla: 8.114 ± 0.883
0.818ArgCys: 0.818 ± 0.229
5.661ArgAsp: 5.661 ± 0.616
4.088ArgGlu: 4.088 ± 0.528
2.327ArgPhe: 2.327 ± 0.386
6.227ArgGly: 6.227 ± 0.761
2.076ArgHis: 2.076 ± 0.462
4.151ArgIle: 4.151 ± 0.614
2.264ArgLys: 2.264 ± 0.334
5.346ArgLeu: 5.346 ± 0.481
2.39ArgMet: 2.39 ± 0.367
2.327ArgAsn: 2.327 ± 0.303
3.459ArgPro: 3.459 ± 0.484
2.453ArgGln: 2.453 ± 0.479
6.667ArgArg: 6.667 ± 0.876
3.648ArgSer: 3.648 ± 0.493
4.403ArgThr: 4.403 ± 0.524
5.283ArgVal: 5.283 ± 0.609
1.384ArgTrp: 1.384 ± 0.289
1.321ArgTyr: 1.321 ± 0.247
0.0ArgXaa: 0.0 ± 0.0
Ser
6.856SerAla: 6.856 ± 0.854
0.189SerCys: 0.189 ± 0.131
3.145SerAsp: 3.145 ± 0.409
3.648SerGlu: 3.648 ± 0.681
1.635SerPhe: 1.635 ± 0.323
5.912SerGly: 5.912 ± 0.763
0.629SerHis: 0.629 ± 0.17
2.956SerIle: 2.956 ± 0.388
1.572SerLys: 1.572 ± 0.285
3.396SerLeu: 3.396 ± 0.513
1.51SerMet: 1.51 ± 0.294
2.138SerAsn: 2.138 ± 0.472
2.83SerPro: 2.83 ± 0.484
1.069SerGln: 1.069 ± 0.305
4.214SerArg: 4.214 ± 0.585
2.076SerSer: 2.076 ± 0.451
4.088SerThr: 4.088 ± 0.54
4.654SerVal: 4.654 ± 0.573
1.258SerTrp: 1.258 ± 0.331
1.006SerTyr: 1.006 ± 0.226
0.0SerXaa: 0.0 ± 0.0
Thr
7.296ThrAla: 7.296 ± 0.744
0.377ThrCys: 0.377 ± 0.137
4.969ThrAsp: 4.969 ± 0.598
4.151ThrGlu: 4.151 ± 0.561
2.453ThrPhe: 2.453 ± 0.556
5.283ThrGly: 5.283 ± 0.709
1.51ThrHis: 1.51 ± 0.329
2.767ThrIle: 2.767 ± 0.445
2.579ThrLys: 2.579 ± 0.433
5.535ThrLeu: 5.535 ± 0.705
1.258ThrMet: 1.258 ± 0.235
2.076ThrAsn: 2.076 ± 0.435
3.9ThrPro: 3.9 ± 0.53
2.138ThrGln: 2.138 ± 0.44
3.9ThrArg: 3.9 ± 0.485
3.585ThrSer: 3.585 ± 0.591
4.088ThrThr: 4.088 ± 0.629
5.912ThrVal: 5.912 ± 0.582
1.195ThrTrp: 1.195 ± 0.272
1.258ThrTyr: 1.258 ± 0.307
0.0ThrXaa: 0.0 ± 0.0
Val
9.497ValAla: 9.497 ± 0.945
1.132ValCys: 1.132 ± 0.326
6.415ValAsp: 6.415 ± 0.721
4.529ValGlu: 4.529 ± 0.551
1.824ValPhe: 1.824 ± 0.439
5.724ValGly: 5.724 ± 0.671
1.321ValHis: 1.321 ± 0.291
4.466ValIle: 4.466 ± 0.544
2.327ValLys: 2.327 ± 0.392
4.78ValLeu: 4.78 ± 0.449
1.698ValMet: 1.698 ± 0.383
1.887ValAsn: 1.887 ± 0.311
3.9ValPro: 3.9 ± 0.501
2.516ValGln: 2.516 ± 0.439
5.849ValArg: 5.849 ± 0.676
3.522ValSer: 3.522 ± 0.606
6.164ValThr: 6.164 ± 0.78
7.485ValVal: 7.485 ± 0.731
1.51ValTrp: 1.51 ± 0.373
2.327ValTyr: 2.327 ± 0.37
0.0ValXaa: 0.0 ± 0.0
Trp
1.824TrpAla: 1.824 ± 0.361
0.252TrpCys: 0.252 ± 0.14
1.132TrpAsp: 1.132 ± 0.365
0.755TrpGlu: 0.755 ± 0.186
0.503TrpPhe: 0.503 ± 0.217
0.818TrpGly: 0.818 ± 0.249
0.566TrpHis: 0.566 ± 0.198
0.629TrpIle: 0.629 ± 0.184
1.006TrpLys: 1.006 ± 0.231
1.887TrpLeu: 1.887 ± 0.402
0.44TrpMet: 0.44 ± 0.175
1.006TrpAsn: 1.006 ± 0.36
1.384TrpPro: 1.384 ± 0.285
0.566TrpGln: 0.566 ± 0.187
1.698TrpArg: 1.698 ± 0.27
1.384TrpSer: 1.384 ± 0.348
1.447TrpThr: 1.447 ± 0.262
1.761TrpVal: 1.761 ± 0.287
0.566TrpTrp: 0.566 ± 0.186
0.566TrpTyr: 0.566 ± 0.203
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.327TyrAla: 2.327 ± 0.34
0.063TyrCys: 0.063 ± 0.069
1.006TyrAsp: 1.006 ± 0.216
1.635TyrGlu: 1.635 ± 0.379
0.314TyrPhe: 0.314 ± 0.148
2.39TyrGly: 2.39 ± 0.399
0.755TyrHis: 0.755 ± 0.229
0.818TyrIle: 0.818 ± 0.312
0.943TyrLys: 0.943 ± 0.253
1.384TyrLeu: 1.384 ± 0.309
0.566TyrMet: 0.566 ± 0.209
0.755TyrAsn: 0.755 ± 0.189
1.006TyrPro: 1.006 ± 0.238
0.566TyrGln: 0.566 ± 0.271
2.327TyrArg: 2.327 ± 0.448
1.321TyrSer: 1.321 ± 0.28
1.698TyrThr: 1.698 ± 0.408
1.51TyrVal: 1.51 ± 0.309
0.377TyrTrp: 0.377 ± 0.139
0.566TyrTyr: 0.566 ± 0.227
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (15900 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski