Amino acid dipepetide frequency for Streptomyces phage Kromp

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.903AlaAla: 21.903 ± 1.236
1.098AlaCys: 1.098 ± 0.231
9.112AlaAsp: 9.112 ± 1.084
7.246AlaGlu: 7.246 ± 0.841
2.69AlaPhe: 2.69 ± 0.58
11.308AlaGly: 11.308 ± 1.123
2.415AlaHis: 2.415 ± 0.425
4.172AlaIle: 4.172 ± 0.503
2.964AlaLys: 2.964 ± 0.544
12.516AlaLeu: 12.516 ± 0.968
2.635AlaMet: 2.635 ± 0.376
2.251AlaAsn: 2.251 ± 0.381
7.026AlaPro: 7.026 ± 0.659
4.886AlaGln: 4.886 ± 0.769
10.704AlaArg: 10.704 ± 0.844
5.764AlaSer: 5.764 ± 0.556
7.136AlaThr: 7.136 ± 0.73
9.716AlaVal: 9.716 ± 0.735
2.251AlaTrp: 2.251 ± 0.372
2.525AlaTyr: 2.525 ± 0.421
0.0AlaXaa: 0.0 ± 0.0
Cys
1.317CysAla: 1.317 ± 0.266
0.165CysCys: 0.165 ± 0.091
0.878CysAsp: 0.878 ± 0.294
0.769CysGlu: 0.769 ± 0.175
0.11CysPhe: 0.11 ± 0.068
1.372CysGly: 1.372 ± 0.329
0.439CysHis: 0.439 ± 0.151
0.329CysIle: 0.329 ± 0.148
0.165CysLys: 0.165 ± 0.09
0.494CysLeu: 0.494 ± 0.169
0.11CysMet: 0.11 ± 0.07
0.165CysAsn: 0.165 ± 0.084
0.714CysPro: 0.714 ± 0.209
0.439CysGln: 0.439 ± 0.184
1.098CysArg: 1.098 ± 0.287
0.439CysSer: 0.439 ± 0.145
1.098CysThr: 1.098 ± 0.247
0.274CysVal: 0.274 ± 0.112
0.11CysTrp: 0.11 ± 0.082
0.11CysTyr: 0.11 ± 0.063
0.0CysXaa: 0.0 ± 0.0
Asp
6.477AspAla: 6.477 ± 0.874
0.714AspCys: 0.714 ± 0.201
4.062AspAsp: 4.062 ± 0.512
4.062AspGlu: 4.062 ± 0.48
0.714AspPhe: 0.714 ± 0.231
6.477AspGly: 6.477 ± 0.643
1.702AspHis: 1.702 ± 0.321
1.757AspIle: 1.757 ± 0.312
1.647AspLys: 1.647 ± 0.348
5.654AspLeu: 5.654 ± 0.5
0.823AspMet: 0.823 ± 0.181
1.098AspAsn: 1.098 ± 0.2
5.544AspPro: 5.544 ± 0.988
2.964AspGln: 2.964 ± 0.396
4.172AspArg: 4.172 ± 0.53
3.239AspSer: 3.239 ± 0.505
4.721AspThr: 4.721 ± 0.711
3.897AspVal: 3.897 ± 0.557
1.372AspTrp: 1.372 ± 0.296
1.208AspTyr: 1.208 ± 0.307
0.0AspXaa: 0.0 ± 0.0
Glu
7.136GluAla: 7.136 ± 0.699
0.549GluCys: 0.549 ± 0.193
3.129GluAsp: 3.129 ± 0.401
3.458GluGlu: 3.458 ± 0.528
1.317GluPhe: 1.317 ± 0.275
3.568GluGly: 3.568 ± 0.402
1.427GluHis: 1.427 ± 0.346
2.251GluIle: 2.251 ± 0.355
1.153GluLys: 1.153 ± 0.253
6.368GluLeu: 6.368 ± 0.588
1.702GluMet: 1.702 ± 0.281
0.988GluAsn: 0.988 ± 0.249
3.952GluPro: 3.952 ± 0.677
2.415GluGln: 2.415 ± 0.381
4.666GluArg: 4.666 ± 0.557
1.811GluSer: 1.811 ± 0.316
3.788GluThr: 3.788 ± 0.485
4.007GluVal: 4.007 ± 0.523
1.317GluTrp: 1.317 ± 0.263
1.043GluTyr: 1.043 ± 0.236
0.0GluXaa: 0.0 ± 0.0
Phe
2.141PheAla: 2.141 ± 0.362
0.11PheCys: 0.11 ± 0.075
1.208PheAsp: 1.208 ± 0.26
1.482PheGlu: 1.482 ± 0.322
0.439PhePhe: 0.439 ± 0.148
2.36PheGly: 2.36 ± 0.449
0.549PheHis: 0.549 ± 0.173
0.22PheIle: 0.22 ± 0.111
0.439PheLys: 0.439 ± 0.192
1.482PheLeu: 1.482 ± 0.376
0.165PheMet: 0.165 ± 0.083
0.329PheAsn: 0.329 ± 0.123
0.769PhePro: 0.769 ± 0.171
0.714PheGln: 0.714 ± 0.211
1.592PheArg: 1.592 ± 0.253
0.659PheSer: 0.659 ± 0.177
2.47PheThr: 2.47 ± 0.376
1.757PheVal: 1.757 ± 0.328
0.329PheTrp: 0.329 ± 0.13
0.11PheTyr: 0.11 ± 0.06
0.0PheXaa: 0.0 ± 0.0
Gly
9.442GlyAla: 9.442 ± 0.952
0.878GlyCys: 0.878 ± 0.243
4.282GlyAsp: 4.282 ± 0.474
5.27GlyGlu: 5.27 ± 0.678
1.427GlyPhe: 1.427 ± 0.26
8.179GlyGly: 8.179 ± 1.173
1.866GlyHis: 1.866 ± 0.333
3.678GlyIle: 3.678 ± 0.594
2.415GlyLys: 2.415 ± 0.354
6.807GlyLeu: 6.807 ± 0.748
1.811GlyMet: 1.811 ± 0.261
1.702GlyAsn: 1.702 ± 0.346
4.392GlyPro: 4.392 ± 0.817
4.172GlyGln: 4.172 ± 0.374
6.807GlyArg: 6.807 ± 0.611
4.611GlySer: 4.611 ± 0.469
6.258GlyThr: 6.258 ± 0.652
6.093GlyVal: 6.093 ± 0.708
2.635GlyTrp: 2.635 ± 0.486
2.58GlyTyr: 2.58 ± 0.248
0.0GlyXaa: 0.0 ± 0.0
His
2.69HisAla: 2.69 ± 0.434
0.11HisCys: 0.11 ± 0.082
1.427HisAsp: 1.427 ± 0.291
1.098HisGlu: 1.098 ± 0.306
0.549HisPhe: 0.549 ± 0.17
2.306HisGly: 2.306 ± 0.381
0.659HisHis: 0.659 ± 0.192
0.878HisIle: 0.878 ± 0.194
0.494HisLys: 0.494 ± 0.161
2.251HisLeu: 2.251 ± 0.364
0.165HisMet: 0.165 ± 0.092
0.165HisAsn: 0.165 ± 0.082
1.372HisPro: 1.372 ± 0.257
0.988HisGln: 0.988 ± 0.245
1.921HisArg: 1.921 ± 0.38
1.153HisSer: 1.153 ± 0.242
1.592HisThr: 1.592 ± 0.318
1.208HisVal: 1.208 ± 0.258
0.329HisTrp: 0.329 ± 0.149
0.329HisTyr: 0.329 ± 0.139
0.0HisXaa: 0.0 ± 0.0
Ile
4.556IleAla: 4.556 ± 0.732
0.549IleCys: 0.549 ± 0.175
2.196IleAsp: 2.196 ± 0.313
2.086IleGlu: 2.086 ± 0.313
0.384IlePhe: 0.384 ± 0.145
3.513IleGly: 3.513 ± 0.446
0.933IleHis: 0.933 ± 0.261
0.878IleIle: 0.878 ± 0.233
0.714IleLys: 0.714 ± 0.214
2.141IleLeu: 2.141 ± 0.362
0.329IleMet: 0.329 ± 0.143
0.933IleAsn: 0.933 ± 0.206
2.415IlePro: 2.415 ± 0.362
0.933IleGln: 0.933 ± 0.233
3.074IleArg: 3.074 ± 0.425
1.866IleSer: 1.866 ± 0.365
3.568IleThr: 3.568 ± 0.422
2.69IleVal: 2.69 ± 0.436
0.878IleTrp: 0.878 ± 0.217
0.769IleTyr: 0.769 ± 0.205
0.0IleXaa: 0.0 ± 0.0
Lys
3.239LysAla: 3.239 ± 0.55
0.22LysCys: 0.22 ± 0.105
1.317LysAsp: 1.317 ± 0.245
1.263LysGlu: 1.263 ± 0.265
0.329LysPhe: 0.329 ± 0.096
1.921LysGly: 1.921 ± 0.31
0.329LysHis: 0.329 ± 0.138
1.208LysIle: 1.208 ± 0.237
0.933LysLys: 0.933 ± 0.292
1.976LysLeu: 1.976 ± 0.389
0.22LysMet: 0.22 ± 0.11
0.549LysAsn: 0.549 ± 0.156
1.702LysPro: 1.702 ± 0.355
1.153LysGln: 1.153 ± 0.25
1.921LysArg: 1.921 ± 0.39
1.317LysSer: 1.317 ± 0.332
1.482LysThr: 1.482 ± 0.372
1.647LysVal: 1.647 ± 0.357
0.549LysTrp: 0.549 ± 0.142
0.604LysTyr: 0.604 ± 0.156
0.0LysXaa: 0.0 ± 0.0
Leu
12.79LeuAla: 12.79 ± 1.052
0.769LeuCys: 0.769 ± 0.234
5.764LeuAsp: 5.764 ± 0.572
5.105LeuGlu: 5.105 ± 0.467
1.976LeuPhe: 1.976 ± 0.307
7.356LeuGly: 7.356 ± 0.576
1.702LeuHis: 1.702 ± 0.329
3.184LeuIle: 3.184 ± 0.43
2.251LeuLys: 2.251 ± 0.546
7.685LeuLeu: 7.685 ± 0.593
1.263LeuMet: 1.263 ± 0.277
1.537LeuAsn: 1.537 ± 0.273
5.38LeuPro: 5.38 ± 0.51
3.294LeuGln: 3.294 ± 0.398
7.466LeuArg: 7.466 ± 0.77
3.843LeuSer: 3.843 ± 0.647
6.917LeuThr: 6.917 ± 0.554
6.587LeuVal: 6.587 ± 0.603
1.427LeuTrp: 1.427 ± 0.291
1.921LeuTyr: 1.921 ± 0.322
0.0LeuXaa: 0.0 ± 0.0
Met
2.58MetAla: 2.58 ± 0.333
0.11MetCys: 0.11 ± 0.071
0.769MetAsp: 0.769 ± 0.192
0.878MetGlu: 0.878 ± 0.202
0.22MetPhe: 0.22 ± 0.108
0.878MetGly: 0.878 ± 0.204
0.329MetHis: 0.329 ± 0.146
0.604MetIle: 0.604 ± 0.185
0.769MetLys: 0.769 ± 0.207
0.659MetLeu: 0.659 ± 0.195
0.11MetMet: 0.11 ± 0.075
0.274MetAsn: 0.274 ± 0.115
1.757MetPro: 1.757 ± 0.304
0.604MetGln: 0.604 ± 0.156
1.866MetArg: 1.866 ± 0.411
1.647MetSer: 1.647 ± 0.292
1.976MetThr: 1.976 ± 0.378
0.823MetVal: 0.823 ± 0.2
0.274MetTrp: 0.274 ± 0.113
0.494MetTyr: 0.494 ± 0.188
0.0MetXaa: 0.0 ± 0.0
Asn
2.141AsnAla: 2.141 ± 0.377
0.274AsnCys: 0.274 ± 0.146
1.043AsnAsp: 1.043 ± 0.3
0.714AsnGlu: 0.714 ± 0.197
0.329AsnPhe: 0.329 ± 0.111
1.976AsnGly: 1.976 ± 0.352
0.494AsnHis: 0.494 ± 0.187
0.329AsnIle: 0.329 ± 0.124
0.329AsnLys: 0.329 ± 0.129
1.757AsnLeu: 1.757 ± 0.243
0.384AsnMet: 0.384 ± 0.153
0.439AsnAsn: 0.439 ± 0.171
1.482AsnPro: 1.482 ± 0.291
0.439AsnGln: 0.439 ± 0.129
1.647AsnArg: 1.647 ± 0.251
1.427AsnSer: 1.427 ± 0.284
1.208AsnThr: 1.208 ± 0.262
0.988AsnVal: 0.988 ± 0.22
0.165AsnTrp: 0.165 ± 0.084
0.274AsnTyr: 0.274 ± 0.09
0.0AsnXaa: 0.0 ± 0.0
Pro
9.606ProAla: 9.606 ± 1.082
0.823ProCys: 0.823 ± 0.241
5.105ProAsp: 5.105 ± 0.596
4.172ProGlu: 4.172 ± 0.594
0.714ProPhe: 0.714 ± 0.176
6.258ProGly: 6.258 ± 0.615
1.427ProHis: 1.427 ± 0.274
1.757ProIle: 1.757 ± 0.337
1.317ProLys: 1.317 ± 0.317
4.721ProLeu: 4.721 ± 0.639
1.372ProMet: 1.372 ± 0.253
1.317ProAsn: 1.317 ± 0.246
6.203ProPro: 6.203 ± 0.91
2.36ProGln: 2.36 ± 0.369
5.434ProArg: 5.434 ± 0.744
3.733ProSer: 3.733 ± 0.489
4.831ProThr: 4.831 ± 0.605
5.544ProVal: 5.544 ± 0.624
0.823ProTrp: 0.823 ± 0.268
0.823ProTyr: 0.823 ± 0.202
0.0ProXaa: 0.0 ± 0.0
Gln
5.215GlnAla: 5.215 ± 0.833
0.22GlnCys: 0.22 ± 0.104
1.702GlnAsp: 1.702 ± 0.295
2.031GlnGlu: 2.031 ± 0.292
0.604GlnPhe: 0.604 ± 0.177
2.251GlnGly: 2.251 ± 0.335
0.933GlnHis: 0.933 ± 0.261
1.866GlnIle: 1.866 ± 0.355
0.823GlnLys: 0.823 ± 0.23
4.721GlnLeu: 4.721 ± 0.652
1.043GlnMet: 1.043 ± 0.25
0.714GlnAsn: 0.714 ± 0.24
3.349GlnPro: 3.349 ± 0.472
2.086GlnGln: 2.086 ± 0.387
3.294GlnArg: 3.294 ± 0.456
1.372GlnSer: 1.372 ± 0.234
1.866GlnThr: 1.866 ± 0.336
3.733GlnVal: 3.733 ± 0.477
0.549GlnTrp: 0.549 ± 0.146
0.769GlnTyr: 0.769 ± 0.185
0.0GlnXaa: 0.0 ± 0.0
Arg
11.089ArgAla: 11.089 ± 0.986
1.427ArgCys: 1.427 ± 0.295
4.831ArgAsp: 4.831 ± 0.444
3.623ArgGlu: 3.623 ± 0.496
2.251ArgPhe: 2.251 ± 0.425
5.544ArgGly: 5.544 ± 0.465
2.306ArgHis: 2.306 ± 0.404
3.184ArgIle: 3.184 ± 0.444
2.415ArgLys: 2.415 ± 0.373
8.069ArgLeu: 8.069 ± 0.961
1.921ArgMet: 1.921 ± 0.362
1.153ArgAsn: 1.153 ± 0.256
5.983ArgPro: 5.983 ± 0.836
3.019ArgGln: 3.019 ± 0.468
9.442ArgArg: 9.442 ± 1.193
4.172ArgSer: 4.172 ± 0.48
6.258ArgThr: 6.258 ± 0.635
4.995ArgVal: 4.995 ± 0.493
1.647ArgTrp: 1.647 ± 0.29
1.647ArgTyr: 1.647 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
6.642SerAla: 6.642 ± 0.534
0.604SerCys: 0.604 ± 0.163
2.745SerAsp: 2.745 ± 0.368
2.196SerGlu: 2.196 ± 0.31
1.263SerPhe: 1.263 ± 0.302
4.886SerGly: 4.886 ± 0.518
0.439SerHis: 0.439 ± 0.15
1.317SerIle: 1.317 ± 0.265
1.153SerLys: 1.153 ± 0.291
4.666SerLeu: 4.666 ± 0.427
0.933SerMet: 0.933 ± 0.189
1.043SerAsn: 1.043 ± 0.193
2.854SerPro: 2.854 ± 0.443
2.251SerGln: 2.251 ± 0.361
3.349SerArg: 3.349 ± 0.424
2.964SerSer: 2.964 ± 0.348
4.392SerThr: 4.392 ± 0.501
3.733SerVal: 3.733 ± 0.445
1.592SerTrp: 1.592 ± 0.285
0.823SerTyr: 0.823 ± 0.217
0.0SerXaa: 0.0 ± 0.0
Thr
9.771ThrAla: 9.771 ± 0.678
0.604ThrCys: 0.604 ± 0.198
5.38ThrAsp: 5.38 ± 0.659
3.184ThrGlu: 3.184 ± 0.456
1.757ThrPhe: 1.757 ± 0.354
7.356ThrGly: 7.356 ± 0.753
1.098ThrHis: 1.098 ± 0.22
3.184ThrIle: 3.184 ± 0.421
1.372ThrLys: 1.372 ± 0.271
5.16ThrLeu: 5.16 ± 0.537
0.988ThrMet: 0.988 ± 0.224
0.988ThrAsn: 0.988 ± 0.19
6.093ThrPro: 6.093 ± 0.582
2.141ThrGln: 2.141 ± 0.335
5.819ThrArg: 5.819 ± 0.593
2.8ThrSer: 2.8 ± 0.379
6.313ThrThr: 6.313 ± 0.625
6.917ThrVal: 6.917 ± 0.578
1.757ThrTrp: 1.757 ± 0.293
1.757ThrTyr: 1.757 ± 0.324
0.0ThrXaa: 0.0 ± 0.0
Val
7.356ValAla: 7.356 ± 0.682
0.878ValCys: 0.878 ± 0.247
4.995ValAsp: 4.995 ± 0.495
4.721ValGlu: 4.721 ± 0.589
1.427ValPhe: 1.427 ± 0.252
4.337ValGly: 4.337 ± 0.655
1.921ValHis: 1.921 ± 0.38
3.349ValIle: 3.349 ± 0.583
1.647ValLys: 1.647 ± 0.314
7.466ValLeu: 7.466 ± 0.727
1.098ValMet: 1.098 ± 0.272
0.933ValAsn: 0.933 ± 0.27
4.666ValPro: 4.666 ± 0.578
3.239ValGln: 3.239 ± 0.559
6.697ValArg: 6.697 ± 0.668
4.392ValSer: 4.392 ± 0.572
5.654ValThr: 5.654 ± 0.523
5.929ValVal: 5.929 ± 0.656
1.372ValTrp: 1.372 ± 0.311
1.702ValTyr: 1.702 ± 0.293
0.0ValXaa: 0.0 ± 0.0
Trp
1.811TrpAla: 1.811 ± 0.366
0.22TrpCys: 0.22 ± 0.105
0.988TrpAsp: 0.988 ± 0.231
1.263TrpGlu: 1.263 ± 0.236
0.549TrpPhe: 0.549 ± 0.166
1.153TrpGly: 1.153 ± 0.234
0.439TrpHis: 0.439 ± 0.155
0.714TrpIle: 0.714 ± 0.215
0.604TrpLys: 0.604 ± 0.234
2.251TrpLeu: 2.251 ± 0.381
0.329TrpMet: 0.329 ± 0.128
0.769TrpAsn: 0.769 ± 0.234
1.537TrpPro: 1.537 ± 0.306
0.659TrpGln: 0.659 ± 0.212
2.415TrpArg: 2.415 ± 0.416
0.878TrpSer: 0.878 ± 0.184
1.427TrpThr: 1.427 ± 0.284
1.647TrpVal: 1.647 ± 0.277
0.878TrpTrp: 0.878 ± 0.245
0.274TrpTyr: 0.274 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.415TyrAla: 2.415 ± 0.359
0.274TyrCys: 0.274 ± 0.128
1.372TyrAsp: 1.372 ± 0.283
1.427TyrGlu: 1.427 ± 0.256
0.274TyrPhe: 0.274 ± 0.117
2.306TyrGly: 2.306 ± 0.393
0.274TyrHis: 0.274 ± 0.116
0.549TyrIle: 0.549 ± 0.226
0.274TyrLys: 0.274 ± 0.124
1.372TyrLeu: 1.372 ± 0.331
0.165TyrMet: 0.165 ± 0.084
0.494TyrAsn: 0.494 ± 0.144
1.153TyrPro: 1.153 ± 0.216
0.384TyrGln: 0.384 ± 0.175
1.592TyrArg: 1.592 ± 0.298
1.811TyrSer: 1.811 ± 0.329
1.482TyrThr: 1.482 ± 0.289
1.592TyrVal: 1.592 ± 0.29
0.549TyrTrp: 0.549 ± 0.213
0.384TyrTyr: 0.384 ± 0.133
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (18218 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski