Amino acid dipepetide frequency for Stenotrophomonas phage vB_SmaS-DLP_6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.843AlaAla: 7.843 ± 0.5
1.046AlaCys: 1.046 ± 0.153
4.887AlaAsp: 4.887 ± 0.375
5.309AlaGlu: 5.309 ± 0.366
3.439AlaPhe: 3.439 ± 0.247
6.475AlaGly: 6.475 ± 0.39
1.347AlaHis: 1.347 ± 0.189
4.444AlaIle: 4.444 ± 0.312
5.168AlaLys: 5.168 ± 0.374
6.395AlaLeu: 6.395 ± 0.401
2.554AlaMet: 2.554 ± 0.228
4.364AlaAsn: 4.364 ± 0.561
3.66AlaPro: 3.66 ± 0.391
2.976AlaGln: 2.976 ± 0.262
4.505AlaArg: 4.505 ± 0.339
4.605AlaSer: 4.605 ± 0.301
5.631AlaThr: 5.631 ± 0.659
5.53AlaVal: 5.53 ± 0.47
0.824AlaTrp: 0.824 ± 0.136
2.996AlaTyr: 2.996 ± 0.232
0.0AlaXaa: 0.0 ± 0.0
Cys
0.724CysAla: 0.724 ± 0.138
0.201CysCys: 0.201 ± 0.065
1.066CysAsp: 1.066 ± 0.167
0.583CysGlu: 0.583 ± 0.123
0.302CysPhe: 0.302 ± 0.069
0.845CysGly: 0.845 ± 0.136
0.282CysHis: 0.282 ± 0.076
0.804CysIle: 0.804 ± 0.133
0.543CysLys: 0.543 ± 0.118
0.523CysLeu: 0.523 ± 0.114
0.201CysMet: 0.201 ± 0.062
0.282CysAsn: 0.282 ± 0.081
0.422CysPro: 0.422 ± 0.107
0.261CysGln: 0.261 ± 0.061
0.422CysArg: 0.422 ± 0.082
0.644CysSer: 0.644 ± 0.117
0.302CysThr: 0.302 ± 0.075
0.704CysVal: 0.704 ± 0.114
0.08CysTrp: 0.08 ± 0.036
0.241CysTyr: 0.241 ± 0.064
0.0CysXaa: 0.0 ± 0.0
Asp
5.369AspAla: 5.369 ± 0.367
0.563AspCys: 0.563 ± 0.115
4.143AspAsp: 4.143 ± 0.357
4.525AspGlu: 4.525 ± 0.355
3.218AspPhe: 3.218 ± 0.28
4.685AspGly: 4.685 ± 0.31
1.146AspHis: 1.146 ± 0.179
3.439AspIle: 3.439 ± 0.245
4.062AspLys: 4.062 ± 0.319
5.269AspLeu: 5.269 ± 0.358
1.729AspMet: 1.729 ± 0.223
2.433AspAsn: 2.433 ± 0.228
3.037AspPro: 3.037 ± 0.229
1.971AspGln: 1.971 ± 0.222
2.876AspArg: 2.876 ± 0.292
3.559AspSer: 3.559 ± 0.345
3.74AspThr: 3.74 ± 0.551
4.444AspVal: 4.444 ± 0.336
1.126AspTrp: 1.126 ± 0.153
2.433AspTyr: 2.433 ± 0.212
0.0AspXaa: 0.0 ± 0.0
Glu
5.349GluAla: 5.349 ± 0.363
0.623GluCys: 0.623 ± 0.113
4.203GluAsp: 4.203 ± 0.353
4.545GluGlu: 4.545 ± 0.411
3.218GluPhe: 3.218 ± 0.23
4.042GluGly: 4.042 ± 0.317
1.669GluHis: 1.669 ± 0.206
3.378GluIle: 3.378 ± 0.285
4.183GluLys: 4.183 ± 0.35
6.053GluLeu: 6.053 ± 0.388
1.971GluMet: 1.971 ± 0.202
2.675GluAsn: 2.675 ± 0.227
1.669GluPro: 1.669 ± 0.216
2.916GluGln: 2.916 ± 0.226
3.499GluArg: 3.499 ± 0.3
3.218GluSer: 3.218 ± 0.259
2.916GluThr: 2.916 ± 0.236
4.324GluVal: 4.324 ± 0.367
0.945GluTrp: 0.945 ± 0.137
2.433GluTyr: 2.433 ± 0.229
0.0GluXaa: 0.0 ± 0.0
Phe
3.459PheAla: 3.459 ± 0.279
0.523PheCys: 0.523 ± 0.111
3.559PheAsp: 3.559 ± 0.244
2.534PheGlu: 2.534 ± 0.202
1.931PhePhe: 1.931 ± 0.184
2.936PheGly: 2.936 ± 0.286
0.865PheHis: 0.865 ± 0.141
1.951PheIle: 1.951 ± 0.183
2.976PheLys: 2.976 ± 0.259
3.378PheLeu: 3.378 ± 0.272
1.086PheMet: 1.086 ± 0.152
2.333PheAsn: 2.333 ± 0.231
1.448PhePro: 1.448 ± 0.171
1.307PheGln: 1.307 ± 0.156
1.77PheArg: 1.77 ± 0.256
2.755PheSer: 2.755 ± 0.231
2.755PheThr: 2.755 ± 0.274
3.378PheVal: 3.378 ± 0.279
0.442PheTrp: 0.442 ± 0.091
1.89PheTyr: 1.89 ± 0.207
0.0PheXaa: 0.0 ± 0.0
Gly
6.234GlyAla: 6.234 ± 0.653
0.664GlyCys: 0.664 ± 0.118
4.223GlyAsp: 4.223 ± 0.291
4.122GlyGlu: 4.122 ± 0.318
3.137GlyPhe: 3.137 ± 0.249
5.812GlyGly: 5.812 ± 0.721
1.186GlyHis: 1.186 ± 0.197
3.801GlyIle: 3.801 ± 0.319
4.786GlyLys: 4.786 ± 0.344
4.303GlyLeu: 4.303 ± 0.384
1.91GlyMet: 1.91 ± 0.215
3.539GlyAsn: 3.539 ± 0.513
1.649GlyPro: 1.649 ± 0.29
1.951GlyGln: 1.951 ± 0.263
3.117GlyArg: 3.117 ± 0.262
4.484GlySer: 4.484 ± 0.467
4.826GlyThr: 4.826 ± 0.527
4.907GlyVal: 4.907 ± 0.274
1.126GlyTrp: 1.126 ± 0.141
3.077GlyTyr: 3.077 ± 0.219
0.0GlyXaa: 0.0 ± 0.0
His
1.347HisAla: 1.347 ± 0.17
0.161HisCys: 0.161 ± 0.061
1.548HisAsp: 1.548 ± 0.188
1.247HisGlu: 1.247 ± 0.209
1.026HisPhe: 1.026 ± 0.145
1.247HisGly: 1.247 ± 0.147
0.704HisHis: 0.704 ± 0.196
1.207HisIle: 1.207 ± 0.175
1.408HisLys: 1.408 ± 0.18
1.709HisLeu: 1.709 ± 0.217
0.623HisMet: 0.623 ± 0.109
0.885HisAsn: 0.885 ± 0.124
1.106HisPro: 1.106 ± 0.178
0.764HisGln: 0.764 ± 0.132
1.046HisArg: 1.046 ± 0.153
1.046HisSer: 1.046 ± 0.159
0.925HisThr: 0.925 ± 0.159
1.569HisVal: 1.569 ± 0.195
0.261HisTrp: 0.261 ± 0.071
1.086HisTyr: 1.086 ± 0.168
0.0HisXaa: 0.0 ± 0.0
Ile
5.208IleAla: 5.208 ± 0.327
0.563IleCys: 0.563 ± 0.103
4.223IleAsp: 4.223 ± 0.282
3.559IleGlu: 3.559 ± 0.282
2.031IlePhe: 2.031 ± 0.239
3.66IleGly: 3.66 ± 0.312
1.247IleHis: 1.247 ± 0.196
3.077IleIle: 3.077 ± 0.279
4.062IleLys: 4.062 ± 0.278
4.324IleLeu: 4.324 ± 0.345
1.247IleMet: 1.247 ± 0.183
2.976IleAsn: 2.976 ± 0.211
2.353IlePro: 2.353 ± 0.209
2.192IleGln: 2.192 ± 0.204
2.614IleArg: 2.614 ± 0.215
3.137IleSer: 3.137 ± 0.261
3.781IleThr: 3.781 ± 0.303
4.484IleVal: 4.484 ± 0.333
0.342IleTrp: 0.342 ± 0.091
1.75IleTyr: 1.75 ± 0.173
0.0IleXaa: 0.0 ± 0.0
Lys
4.866LysAla: 4.866 ± 0.363
0.282LysCys: 0.282 ± 0.087
4.102LysAsp: 4.102 ± 0.365
4.565LysGlu: 4.565 ± 0.368
2.835LysPhe: 2.835 ± 0.287
3.801LysGly: 3.801 ± 0.339
1.89LysHis: 1.89 ± 0.242
3.901LysIle: 3.901 ± 0.297
5.57LysLys: 5.57 ± 0.498
5.53LysLeu: 5.53 ± 0.37
2.453LysMet: 2.453 ± 0.257
3.218LysAsn: 3.218 ± 0.275
3.117LysPro: 3.117 ± 0.309
2.192LysGln: 2.192 ± 0.237
3.218LysArg: 3.218 ± 0.305
3.459LysSer: 3.459 ± 0.366
3.398LysThr: 3.398 ± 0.251
4.303LysVal: 4.303 ± 0.275
0.985LysTrp: 0.985 ± 0.145
2.534LysTyr: 2.534 ± 0.236
0.0LysXaa: 0.0 ± 0.0
Leu
5.952LeuAla: 5.952 ± 0.403
0.905LeuCys: 0.905 ± 0.162
5.047LeuAsp: 5.047 ± 0.371
4.866LeuGlu: 4.866 ± 0.4
2.876LeuPhe: 2.876 ± 0.203
4.786LeuGly: 4.786 ± 0.386
1.689LeuHis: 1.689 ± 0.206
4.404LeuIle: 4.404 ± 0.284
5.55LeuLys: 5.55 ± 0.428
5.671LeuLeu: 5.671 ± 0.432
2.393LeuMet: 2.393 ± 0.218
3.841LeuAsn: 3.841 ± 0.273
3.74LeuPro: 3.74 ± 0.316
3.077LeuGln: 3.077 ± 0.201
4.645LeuArg: 4.645 ± 0.295
5.068LeuSer: 5.068 ± 0.364
5.047LeuThr: 5.047 ± 0.337
5.108LeuVal: 5.108 ± 0.359
0.945LeuTrp: 0.945 ± 0.146
2.514LeuTyr: 2.514 ± 0.243
0.0LeuXaa: 0.0 ± 0.0
Met
2.091MetAla: 2.091 ± 0.195
0.322MetCys: 0.322 ± 0.085
1.146MetAsp: 1.146 ± 0.169
1.468MetGlu: 1.468 ± 0.148
0.945MetPhe: 0.945 ± 0.128
1.287MetGly: 1.287 ± 0.161
0.523MetHis: 0.523 ± 0.104
1.569MetIle: 1.569 ± 0.196
1.729MetLys: 1.729 ± 0.18
2.152MetLeu: 2.152 ± 0.251
0.784MetMet: 0.784 ± 0.131
1.428MetAsn: 1.428 ± 0.145
1.146MetPro: 1.146 ± 0.16
1.086MetGln: 1.086 ± 0.197
1.146MetArg: 1.146 ± 0.168
2.614MetSer: 2.614 ± 0.212
2.333MetThr: 2.333 ± 0.226
1.609MetVal: 1.609 ± 0.198
0.422MetTrp: 0.422 ± 0.087
1.186MetTyr: 1.186 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
3.66AsnAla: 3.66 ± 0.313
0.463AsnCys: 0.463 ± 0.112
2.393AsnAsp: 2.393 ± 0.198
2.996AsnGlu: 2.996 ± 0.249
1.991AsnPhe: 1.991 ± 0.221
3.6AsnGly: 3.6 ± 0.347
1.046AsnHis: 1.046 ± 0.158
2.735AsnIle: 2.735 ± 0.247
3.016AsnLys: 3.016 ± 0.257
3.579AsnLeu: 3.579 ± 0.335
0.804AsnMet: 0.804 ± 0.136
2.353AsnAsn: 2.353 ± 0.281
2.433AsnPro: 2.433 ± 0.206
1.729AsnGln: 1.729 ± 0.215
3.057AsnArg: 3.057 ± 0.371
2.936AsnSer: 2.936 ± 0.32
3.921AsnThr: 3.921 ± 0.668
3.197AsnVal: 3.197 ± 0.293
0.704AsnTrp: 0.704 ± 0.126
1.709AsnTyr: 1.709 ± 0.2
0.0AsnXaa: 0.0 ± 0.0
Pro
3.579ProAla: 3.579 ± 0.304
0.302ProCys: 0.302 ± 0.082
2.494ProAsp: 2.494 ± 0.211
3.137ProGlu: 3.137 ± 0.286
1.729ProPhe: 1.729 ± 0.187
2.735ProGly: 2.735 ± 0.288
0.764ProHis: 0.764 ± 0.127
2.292ProIle: 2.292 ± 0.229
2.554ProLys: 2.554 ± 0.24
2.272ProLeu: 2.272 ± 0.185
0.905ProMet: 0.905 ± 0.121
1.488ProAsn: 1.488 ± 0.162
1.729ProPro: 1.729 ± 0.483
1.468ProGln: 1.468 ± 0.146
1.971ProArg: 1.971 ± 0.226
2.413ProSer: 2.413 ± 0.28
2.815ProThr: 2.815 ± 0.231
2.795ProVal: 2.795 ± 0.239
0.603ProTrp: 0.603 ± 0.104
1.77ProTyr: 1.77 ± 0.209
0.0ProXaa: 0.0 ± 0.0
Gln
2.976GlnAla: 2.976 ± 0.201
0.362GlnCys: 0.362 ± 0.084
1.448GlnAsp: 1.448 ± 0.179
2.333GlnGlu: 2.333 ± 0.242
1.689GlnPhe: 1.689 ± 0.175
2.051GlnGly: 2.051 ± 0.218
0.724GlnHis: 0.724 ± 0.137
2.091GlnIle: 2.091 ± 0.239
2.373GlnLys: 2.373 ± 0.248
4.122GlnLeu: 4.122 ± 0.308
0.804GlnMet: 0.804 ± 0.142
1.589GlnAsn: 1.589 ± 0.163
1.186GlnPro: 1.186 ± 0.142
1.488GlnGln: 1.488 ± 0.184
2.132GlnArg: 2.132 ± 0.19
1.89GlnSer: 1.89 ± 0.222
1.79GlnThr: 1.79 ± 0.191
2.313GlnVal: 2.313 ± 0.223
0.503GlnTrp: 0.503 ± 0.12
1.629GlnTyr: 1.629 ± 0.165
0.0GlnXaa: 0.0 ± 0.0
Arg
4.585ArgAla: 4.585 ± 0.317
0.623ArgCys: 0.623 ± 0.113
3.559ArgAsp: 3.559 ± 0.255
3.459ArgGlu: 3.459 ± 0.296
2.252ArgPhe: 2.252 ± 0.23
3.72ArgGly: 3.72 ± 0.223
0.985ArgHis: 0.985 ± 0.152
3.077ArgIle: 3.077 ± 0.262
3.238ArgLys: 3.238 ± 0.279
4.404ArgLeu: 4.404 ± 0.324
1.428ArgMet: 1.428 ± 0.183
2.333ArgAsn: 2.333 ± 0.26
1.388ArgPro: 1.388 ± 0.173
1.669ArgGln: 1.669 ± 0.217
2.916ArgArg: 2.916 ± 0.266
2.514ArgSer: 2.514 ± 0.256
2.594ArgThr: 2.594 ± 0.242
3.66ArgVal: 3.66 ± 0.283
0.865ArgTrp: 0.865 ± 0.143
2.111ArgTyr: 2.111 ± 0.192
0.0ArgXaa: 0.0 ± 0.0
Ser
4.927SerAla: 4.927 ± 0.448
0.442SerCys: 0.442 ± 0.102
3.338SerAsp: 3.338 ± 0.224
3.439SerGlu: 3.439 ± 0.247
2.956SerPhe: 2.956 ± 0.29
4.303SerGly: 4.303 ± 0.421
1.186SerHis: 1.186 ± 0.163
3.519SerIle: 3.519 ± 0.284
3.74SerLys: 3.74 ± 0.318
4.746SerLeu: 4.746 ± 0.4
1.569SerMet: 1.569 ± 0.157
3.218SerAsn: 3.218 ± 0.335
2.252SerPro: 2.252 ± 0.211
1.971SerGln: 1.971 ± 0.179
2.634SerArg: 2.634 ± 0.333
3.861SerSer: 3.861 ± 0.45
3.74SerThr: 3.74 ± 0.379
4.002SerVal: 4.002 ± 0.299
0.905SerTrp: 0.905 ± 0.165
2.413SerTyr: 2.413 ± 0.245
0.0SerXaa: 0.0 ± 0.0
Thr
5.57ThrAla: 5.57 ± 0.547
0.402ThrCys: 0.402 ± 0.095
3.7ThrAsp: 3.7 ± 0.325
3.519ThrGlu: 3.519 ± 0.282
3.016ThrPhe: 3.016 ± 0.411
5.168ThrGly: 5.168 ± 0.751
1.166ThrHis: 1.166 ± 0.163
3.861ThrIle: 3.861 ± 0.365
3.499ThrLys: 3.499 ± 0.326
4.505ThrLeu: 4.505 ± 0.328
1.488ThrMet: 1.488 ± 0.166
3.338ThrAsn: 3.338 ± 0.459
3.137ThrPro: 3.137 ± 0.317
1.991ThrGln: 1.991 ± 0.204
2.695ThrArg: 2.695 ± 0.223
3.479ThrSer: 3.479 ± 0.538
4.806ThrThr: 4.806 ± 1.043
4.042ThrVal: 4.042 ± 0.43
0.623ThrTrp: 0.623 ± 0.12
2.473ThrTyr: 2.473 ± 0.261
0.0ThrXaa: 0.0 ± 0.0
Val
6.314ValAla: 6.314 ± 0.407
0.442ValCys: 0.442 ± 0.096
4.404ValAsp: 4.404 ± 0.355
4.102ValGlu: 4.102 ± 0.311
2.594ValPhe: 2.594 ± 0.263
4.283ValGly: 4.283 ± 0.277
1.247ValHis: 1.247 ± 0.161
4.082ValIle: 4.082 ± 0.263
4.324ValLys: 4.324 ± 0.333
5.249ValLeu: 5.249 ± 0.28
1.649ValMet: 1.649 ± 0.176
3.238ValAsn: 3.238 ± 0.273
2.856ValPro: 2.856 ± 0.264
2.473ValGln: 2.473 ± 0.201
4.223ValArg: 4.223 ± 0.287
4.283ValSer: 4.283 ± 0.407
4.364ValThr: 4.364 ± 0.399
5.389ValVal: 5.389 ± 0.432
0.885ValTrp: 0.885 ± 0.117
2.594ValTyr: 2.594 ± 0.195
0.0ValXaa: 0.0 ± 0.0
Trp
0.865TrpAla: 0.865 ± 0.129
0.181TrpCys: 0.181 ± 0.061
0.985TrpAsp: 0.985 ± 0.141
0.764TrpGlu: 0.764 ± 0.123
0.583TrpPhe: 0.583 ± 0.11
0.684TrpGly: 0.684 ± 0.103
0.463TrpHis: 0.463 ± 0.106
0.764TrpIle: 0.764 ± 0.127
1.106TrpLys: 1.106 ± 0.159
1.166TrpLeu: 1.166 ± 0.151
0.523TrpMet: 0.523 ± 0.112
0.623TrpAsn: 0.623 ± 0.102
0.0TrpPro: 0.0 ± 0.0
0.583TrpGln: 0.583 ± 0.098
0.724TrpArg: 0.724 ± 0.106
1.066TrpSer: 1.066 ± 0.146
0.704TrpThr: 0.704 ± 0.116
0.824TrpVal: 0.824 ± 0.13
0.362TrpTrp: 0.362 ± 0.086
0.442TrpTyr: 0.442 ± 0.088
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.077TyrAla: 3.077 ± 0.293
0.362TyrCys: 0.362 ± 0.074
3.298TyrAsp: 3.298 ± 0.221
2.755TyrGlu: 2.755 ± 0.249
1.508TyrPhe: 1.508 ± 0.157
2.735TyrGly: 2.735 ± 0.213
0.784TyrHis: 0.784 ± 0.136
2.433TyrIle: 2.433 ± 0.222
2.353TyrLys: 2.353 ± 0.234
2.775TyrLeu: 2.775 ± 0.248
0.845TyrMet: 0.845 ± 0.153
2.172TyrAsn: 2.172 ± 0.204
1.488TyrPro: 1.488 ± 0.185
1.428TyrGln: 1.428 ± 0.189
2.172TyrArg: 2.172 ± 0.192
2.132TyrSer: 2.132 ± 0.198
2.091TyrThr: 2.091 ± 0.247
2.413TyrVal: 2.413 ± 0.223
0.463TyrTrp: 0.463 ± 0.09
1.428TyrTyr: 1.428 ± 0.184
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 241 proteins (49729 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski