Amino acid dipepetide frequency for Stenotrophomonas phage YB07

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.696AlaAla: 6.696 ± 0.488
0.614AlaCys: 0.614 ± 0.107
5.043AlaAsp: 5.043 ± 0.32
5.043AlaGlu: 5.043 ± 0.428
2.924AlaPhe: 2.924 ± 0.249
5.552AlaGly: 5.552 ± 0.395
1.187AlaHis: 1.187 ± 0.17
4.958AlaIle: 4.958 ± 0.325
3.75AlaLys: 3.75 ± 0.316
6.717AlaLeu: 6.717 ± 0.37
2.182AlaMet: 2.182 ± 0.208
3.411AlaAsn: 3.411 ± 0.324
2.966AlaPro: 2.966 ± 0.316
2.797AlaGln: 2.797 ± 0.237
4.09AlaArg: 4.09 ± 0.289
4.958AlaSer: 4.958 ± 0.383
4.471AlaThr: 4.471 ± 0.45
4.725AlaVal: 4.725 ± 0.317
1.356AlaTrp: 1.356 ± 0.173
2.818AlaTyr: 2.818 ± 0.288
0.0AlaXaa: 0.0 ± 0.0
Cys
0.614CysAla: 0.614 ± 0.098
0.148CysCys: 0.148 ± 0.051
0.636CysAsp: 0.636 ± 0.113
0.487CysGlu: 0.487 ± 0.095
0.36CysPhe: 0.36 ± 0.094
0.614CysGly: 0.614 ± 0.135
0.339CysHis: 0.339 ± 0.093
0.509CysIle: 0.509 ± 0.109
0.509CysLys: 0.509 ± 0.1
0.699CysLeu: 0.699 ± 0.141
0.593CysMet: 0.593 ± 0.108
0.424CysAsn: 0.424 ± 0.112
0.657CysPro: 0.657 ± 0.112
0.445CysGln: 0.445 ± 0.113
0.445CysArg: 0.445 ± 0.109
0.678CysSer: 0.678 ± 0.115
0.509CysThr: 0.509 ± 0.1
0.699CysVal: 0.699 ± 0.13
0.212CysTrp: 0.212 ± 0.063
0.36CysTyr: 0.36 ± 0.091
0.0CysXaa: 0.0 ± 0.0
Asp
5.721AspAla: 5.721 ± 0.42
0.509AspCys: 0.509 ± 0.113
4.174AspAsp: 4.174 ± 0.316
4.64AspGlu: 4.64 ± 0.368
3.221AspPhe: 3.221 ± 0.2
5.001AspGly: 5.001 ± 0.329
1.017AspHis: 1.017 ± 0.13
3.369AspIle: 3.369 ± 0.295
3.666AspLys: 3.666 ± 0.319
6.145AspLeu: 6.145 ± 0.405
1.928AspMet: 1.928 ± 0.188
2.246AspAsn: 2.246 ± 0.222
3.666AspPro: 3.666 ± 0.293
2.627AspGln: 2.627 ± 0.266
3.263AspArg: 3.263 ± 0.272
3.899AspSer: 3.899 ± 0.317
3.475AspThr: 3.475 ± 0.315
4.45AspVal: 4.45 ± 0.297
1.208AspTrp: 1.208 ± 0.143
2.564AspTyr: 2.564 ± 0.254
0.0AspXaa: 0.0 ± 0.0
Glu
5.403GluAla: 5.403 ± 0.428
0.678GluCys: 0.678 ± 0.129
4.005GluAsp: 4.005 ± 0.384
4.916GluGlu: 4.916 ± 0.445
2.945GluPhe: 2.945 ± 0.266
4.238GluGly: 4.238 ± 0.353
1.356GluHis: 1.356 ± 0.192
4.513GluIle: 4.513 ± 0.374
3.899GluLys: 3.899 ± 0.361
6.526GluLeu: 6.526 ± 0.487
1.992GluMet: 1.992 ± 0.238
3.178GluAsn: 3.178 ± 0.265
2.225GluPro: 2.225 ± 0.237
2.5GluGln: 2.5 ± 0.226
3.39GluArg: 3.39 ± 0.307
3.645GluSer: 3.645 ± 0.32
3.454GluThr: 3.454 ± 0.247
4.386GluVal: 4.386 ± 0.312
1.547GluTrp: 1.547 ± 0.194
2.797GluTyr: 2.797 ± 0.241
0.0GluXaa: 0.0 ± 0.0
Phe
2.691PheAla: 2.691 ± 0.289
0.572PheCys: 0.572 ± 0.109
3.645PheAsp: 3.645 ± 0.345
2.352PheGlu: 2.352 ± 0.21
1.632PhePhe: 1.632 ± 0.204
3.178PheGly: 3.178 ± 0.279
0.932PheHis: 0.932 ± 0.13
2.627PheIle: 2.627 ± 0.243
3.115PheLys: 3.115 ± 0.283
2.776PheLeu: 2.776 ± 0.234
1.271PheMet: 1.271 ± 0.158
2.331PheAsn: 2.331 ± 0.229
1.208PhePro: 1.208 ± 0.17
1.907PheGln: 1.907 ± 0.209
2.394PheArg: 2.394 ± 0.263
2.352PheSer: 2.352 ± 0.208
2.564PheThr: 2.564 ± 0.238
2.797PheVal: 2.797 ± 0.253
0.657PheTrp: 0.657 ± 0.141
1.547PheTyr: 1.547 ± 0.201
0.0PheXaa: 0.0 ± 0.0
Gly
4.958GlyAla: 4.958 ± 0.346
0.742GlyCys: 0.742 ± 0.101
3.941GlyAsp: 3.941 ± 0.249
4.429GlyGlu: 4.429 ± 0.377
2.755GlyPhe: 2.755 ± 0.26
4.746GlyGly: 4.746 ± 0.31
1.102GlyHis: 1.102 ± 0.18
3.496GlyIle: 3.496 ± 0.266
4.09GlyLys: 4.09 ± 0.359
5.255GlyLeu: 5.255 ± 0.377
1.907GlyMet: 1.907 ± 0.22
3.009GlyAsn: 3.009 ± 0.298
1.78GlyPro: 1.78 ± 0.205
2.543GlyGln: 2.543 ± 0.227
2.903GlyArg: 2.903 ± 0.244
4.047GlySer: 4.047 ± 0.391
4.662GlyThr: 4.662 ± 0.468
4.895GlyVal: 4.895 ± 0.372
1.632GlyTrp: 1.632 ± 0.175
3.242GlyTyr: 3.242 ± 0.242
0.0GlyXaa: 0.0 ± 0.0
His
1.208HisAla: 1.208 ± 0.183
0.233HisCys: 0.233 ± 0.082
1.187HisAsp: 1.187 ± 0.198
1.081HisGlu: 1.081 ± 0.152
0.932HisPhe: 0.932 ± 0.167
1.377HisGly: 1.377 ± 0.202
0.487HisHis: 0.487 ± 0.13
0.932HisIle: 0.932 ± 0.145
1.187HisLys: 1.187 ± 0.157
1.462HisLeu: 1.462 ± 0.224
0.551HisMet: 0.551 ± 0.131
0.763HisAsn: 0.763 ± 0.142
0.784HisPro: 0.784 ± 0.119
0.466HisGln: 0.466 ± 0.086
1.25HisArg: 1.25 ± 0.154
0.784HisSer: 0.784 ± 0.141
0.954HisThr: 0.954 ± 0.137
1.293HisVal: 1.293 ± 0.163
0.212HisTrp: 0.212 ± 0.061
0.551HisTyr: 0.551 ± 0.109
0.0HisXaa: 0.0 ± 0.0
Ile
4.556IleAla: 4.556 ± 0.312
0.509IleCys: 0.509 ± 0.104
4.026IleAsp: 4.026 ± 0.312
4.471IleGlu: 4.471 ± 0.326
1.822IlePhe: 1.822 ± 0.193
3.581IleGly: 3.581 ± 0.316
1.059IleHis: 1.059 ± 0.195
2.988IleIle: 2.988 ± 0.288
4.64IleLys: 4.64 ± 0.297
4.619IleLeu: 4.619 ± 0.367
1.356IleMet: 1.356 ± 0.177
3.094IleAsn: 3.094 ± 0.246
3.242IlePro: 3.242 ± 0.287
2.479IleGln: 2.479 ± 0.2
3.645IleArg: 3.645 ± 0.268
3.136IleSer: 3.136 ± 0.251
3.878IleThr: 3.878 ± 0.248
3.941IleVal: 3.941 ± 0.276
0.678IleTrp: 0.678 ± 0.11
2.034IleTyr: 2.034 ± 0.211
0.0IleXaa: 0.0 ± 0.0
Lys
4.916LysAla: 4.916 ± 0.364
0.487LysCys: 0.487 ± 0.096
3.581LysAsp: 3.581 ± 0.282
3.92LysGlu: 3.92 ± 0.39
2.437LysPhe: 2.437 ± 0.235
3.284LysGly: 3.284 ± 0.278
1.017LysHis: 1.017 ± 0.153
4.45LysIle: 4.45 ± 0.344
4.471LysLys: 4.471 ± 0.43
5.255LysLeu: 5.255 ± 0.338
2.331LysMet: 2.331 ± 0.208
2.585LysAsn: 2.585 ± 0.246
2.733LysPro: 2.733 ± 0.267
2.394LysGln: 2.394 ± 0.235
3.581LysArg: 3.581 ± 0.305
3.178LysSer: 3.178 ± 0.302
3.306LysThr: 3.306 ± 0.274
3.962LysVal: 3.962 ± 0.321
1.144LysTrp: 1.144 ± 0.188
2.416LysTyr: 2.416 ± 0.248
0.0LysXaa: 0.0 ± 0.0
Leu
5.721LeuAla: 5.721 ± 0.357
0.72LeuCys: 0.72 ± 0.116
6.357LeuAsp: 6.357 ± 0.358
6.759LeuGlu: 6.759 ± 0.434
2.861LeuPhe: 2.861 ± 0.278
4.64LeuGly: 4.64 ± 0.308
1.738LeuHis: 1.738 ± 0.22
4.746LeuIle: 4.746 ± 0.305
5.403LeuLys: 5.403 ± 0.406
5.467LeuLeu: 5.467 ± 0.332
2.246LeuMet: 2.246 ± 0.246
4.492LeuAsn: 4.492 ± 0.353
3.094LeuPro: 3.094 ± 0.25
3.072LeuGln: 3.072 ± 0.308
4.259LeuArg: 4.259 ± 0.318
5.149LeuSer: 5.149 ± 0.385
4.958LeuThr: 4.958 ± 0.347
5.149LeuVal: 5.149 ± 0.345
1.059LeuTrp: 1.059 ± 0.137
2.479LeuTyr: 2.479 ± 0.249
0.0LeuXaa: 0.0 ± 0.0
Met
2.31MetAla: 2.31 ± 0.215
0.233MetCys: 0.233 ± 0.073
1.822MetAsp: 1.822 ± 0.233
1.526MetGlu: 1.526 ± 0.191
1.229MetPhe: 1.229 ± 0.169
1.356MetGly: 1.356 ± 0.186
0.36MetHis: 0.36 ± 0.086
1.589MetIle: 1.589 ± 0.195
1.865MetLys: 1.865 ± 0.205
1.949MetLeu: 1.949 ± 0.235
0.89MetMet: 0.89 ± 0.167
1.907MetAsn: 1.907 ± 0.214
1.059MetPro: 1.059 ± 0.138
1.081MetGln: 1.081 ± 0.147
1.271MetArg: 1.271 ± 0.181
2.31MetSer: 2.31 ± 0.217
2.14MetThr: 2.14 ± 0.197
1.632MetVal: 1.632 ± 0.158
0.339MetTrp: 0.339 ± 0.08
1.25MetTyr: 1.25 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
3.2AsnAla: 3.2 ± 0.329
0.466AsnCys: 0.466 ± 0.098
2.733AsnAsp: 2.733 ± 0.259
2.755AsnGlu: 2.755 ± 0.244
2.331AsnPhe: 2.331 ± 0.262
4.344AsnGly: 4.344 ± 0.422
0.572AsnHis: 0.572 ± 0.111
2.839AsnIle: 2.839 ± 0.266
3.072AsnLys: 3.072 ± 0.29
3.729AsnLeu: 3.729 ± 0.243
1.271AsnMet: 1.271 ± 0.154
2.161AsnAsn: 2.161 ± 0.289
2.627AsnPro: 2.627 ± 0.238
1.674AsnGln: 1.674 ± 0.191
2.691AsnArg: 2.691 ± 0.225
2.882AsnSer: 2.882 ± 0.259
2.606AsnThr: 2.606 ± 0.231
3.115AsnVal: 3.115 ± 0.266
0.89AsnTrp: 0.89 ± 0.136
1.547AsnTyr: 1.547 ± 0.161
0.0AsnXaa: 0.0 ± 0.0
Pro
3.411ProAla: 3.411 ± 0.341
0.297ProCys: 0.297 ± 0.075
3.178ProAsp: 3.178 ± 0.242
2.839ProGlu: 2.839 ± 0.285
1.992ProPhe: 1.992 ± 0.155
2.945ProGly: 2.945 ± 0.326
0.403ProHis: 0.403 ± 0.117
2.373ProIle: 2.373 ± 0.227
2.373ProLys: 2.373 ± 0.253
2.564ProLeu: 2.564 ± 0.273
0.805ProMet: 0.805 ± 0.144
2.034ProAsn: 2.034 ± 0.219
1.42ProPro: 1.42 ± 0.193
1.462ProGln: 1.462 ± 0.179
1.653ProArg: 1.653 ± 0.189
2.225ProSer: 2.225 ± 0.208
2.903ProThr: 2.903 ± 0.316
3.348ProVal: 3.348 ± 0.257
0.509ProTrp: 0.509 ± 0.109
1.992ProTyr: 1.992 ± 0.231
0.0ProXaa: 0.0 ± 0.0
Gln
3.094GlnAla: 3.094 ± 0.251
0.487GlnCys: 0.487 ± 0.098
2.352GlnAsp: 2.352 ± 0.217
2.31GlnGlu: 2.31 ± 0.238
2.098GlnPhe: 2.098 ± 0.201
1.886GlnGly: 1.886 ± 0.199
0.593GlnHis: 0.593 ± 0.109
3.115GlnIle: 3.115 ± 0.286
2.119GlnLys: 2.119 ± 0.205
3.284GlnLeu: 3.284 ± 0.249
1.293GlnMet: 1.293 ± 0.204
1.865GlnAsn: 1.865 ± 0.192
1.208GlnPro: 1.208 ± 0.132
1.547GlnGln: 1.547 ± 0.233
2.098GlnArg: 2.098 ± 0.188
2.119GlnSer: 2.119 ± 0.238
2.31GlnThr: 2.31 ± 0.249
2.861GlnVal: 2.861 ± 0.311
0.784GlnTrp: 0.784 ± 0.13
1.483GlnTyr: 1.483 ± 0.168
0.0GlnXaa: 0.0 ± 0.0
Arg
3.878ArgAla: 3.878 ± 0.236
0.593ArgCys: 0.593 ± 0.125
3.941ArgAsp: 3.941 ± 0.312
3.581ArgGlu: 3.581 ± 0.341
2.522ArgPhe: 2.522 ± 0.24
3.263ArgGly: 3.263 ± 0.286
0.784ArgHis: 0.784 ± 0.142
3.348ArgIle: 3.348 ± 0.262
2.966ArgLys: 2.966 ± 0.283
4.407ArgLeu: 4.407 ± 0.303
1.229ArgMet: 1.229 ± 0.173
2.204ArgAsn: 2.204 ± 0.254
1.674ArgPro: 1.674 ± 0.174
2.14ArgGln: 2.14 ± 0.21
3.157ArgArg: 3.157 ± 0.326
3.051ArgSer: 3.051 ± 0.201
2.649ArgThr: 2.649 ± 0.233
3.475ArgVal: 3.475 ± 0.236
0.848ArgTrp: 0.848 ± 0.144
2.288ArgTyr: 2.288 ± 0.221
0.0ArgXaa: 0.0 ± 0.0
Ser
4.259SerAla: 4.259 ± 0.304
0.699SerCys: 0.699 ± 0.132
3.666SerAsp: 3.666 ± 0.274
4.068SerGlu: 4.068 ± 0.322
2.733SerPhe: 2.733 ± 0.264
4.683SerGly: 4.683 ± 0.323
1.017SerHis: 1.017 ± 0.15
3.454SerIle: 3.454 ± 0.273
3.327SerLys: 3.327 ± 0.292
4.874SerLeu: 4.874 ± 0.349
1.462SerMet: 1.462 ± 0.196
2.776SerAsn: 2.776 ± 0.217
2.416SerPro: 2.416 ± 0.266
2.437SerGln: 2.437 ± 0.236
2.691SerArg: 2.691 ± 0.199
3.856SerSer: 3.856 ± 0.332
3.878SerThr: 3.878 ± 0.328
4.026SerVal: 4.026 ± 0.307
1.187SerTrp: 1.187 ± 0.167
2.225SerTyr: 2.225 ± 0.221
0.0SerXaa: 0.0 ± 0.0
Thr
4.937ThrAla: 4.937 ± 0.458
0.593ThrCys: 0.593 ± 0.112
3.623ThrAsp: 3.623 ± 0.261
3.687ThrGlu: 3.687 ± 0.267
3.051ThrPhe: 3.051 ± 0.285
4.45ThrGly: 4.45 ± 0.507
0.954ThrHis: 0.954 ± 0.148
3.327ThrIle: 3.327 ± 0.322
3.263ThrLys: 3.263 ± 0.269
5.17ThrLeu: 5.17 ± 0.362
1.653ThrMet: 1.653 ± 0.22
2.627ThrAsn: 2.627 ± 0.274
3.348ThrPro: 3.348 ± 0.339
2.034ThrGln: 2.034 ± 0.204
2.691ThrArg: 2.691 ± 0.221
3.793ThrSer: 3.793 ± 0.399
3.221ThrThr: 3.221 ± 0.422
4.45ThrVal: 4.45 ± 0.286
1.208ThrTrp: 1.208 ± 0.16
2.543ThrTyr: 2.543 ± 0.352
0.0ThrXaa: 0.0 ± 0.0
Val
4.513ValAla: 4.513 ± 0.316
0.805ValCys: 0.805 ± 0.15
5.064ValAsp: 5.064 ± 0.371
4.874ValGlu: 4.874 ± 0.309
2.437ValPhe: 2.437 ± 0.213
4.09ValGly: 4.09 ± 0.324
1.356ValHis: 1.356 ± 0.16
4.132ValIle: 4.132 ± 0.271
4.513ValLys: 4.513 ± 0.292
4.874ValLeu: 4.874 ± 0.337
1.674ValMet: 1.674 ± 0.191
3.2ValAsn: 3.2 ± 0.283
2.861ValPro: 2.861 ± 0.272
2.839ValGln: 2.839 ± 0.229
3.39ValArg: 3.39 ± 0.245
4.195ValSer: 4.195 ± 0.308
4.683ValThr: 4.683 ± 0.447
4.683ValVal: 4.683 ± 0.272
0.869ValTrp: 0.869 ± 0.135
2.416ValTyr: 2.416 ± 0.198
0.0ValXaa: 0.0 ± 0.0
Trp
1.271TrpAla: 1.271 ± 0.174
0.148TrpCys: 0.148 ± 0.054
0.911TrpAsp: 0.911 ± 0.144
1.208TrpGlu: 1.208 ± 0.188
0.826TrpPhe: 0.826 ± 0.128
0.911TrpGly: 0.911 ± 0.139
0.424TrpHis: 0.424 ± 0.098
0.911TrpIle: 0.911 ± 0.129
0.932TrpLys: 0.932 ± 0.143
1.441TrpLeu: 1.441 ± 0.152
0.445TrpMet: 0.445 ± 0.101
1.123TrpAsn: 1.123 ± 0.147
0.254TrpPro: 0.254 ± 0.068
0.593TrpGln: 0.593 ± 0.124
1.123TrpArg: 1.123 ± 0.157
1.165TrpSer: 1.165 ± 0.159
1.271TrpThr: 1.271 ± 0.182
1.038TrpVal: 1.038 ± 0.145
0.339TrpTrp: 0.339 ± 0.088
0.996TrpTyr: 0.996 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.924TyrAla: 2.924 ± 0.235
0.403TyrCys: 0.403 ± 0.099
2.903TyrAsp: 2.903 ± 0.246
2.649TyrGlu: 2.649 ± 0.288
1.441TyrPhe: 1.441 ± 0.16
1.992TyrGly: 1.992 ± 0.238
0.996TyrHis: 0.996 ± 0.164
2.077TyrIle: 2.077 ± 0.187
2.288TyrLys: 2.288 ± 0.274
3.157TyrLeu: 3.157 ± 0.228
1.017TyrMet: 1.017 ± 0.151
2.077TyrAsn: 2.077 ± 0.216
1.504TyrPro: 1.504 ± 0.171
1.865TyrGln: 1.865 ± 0.241
2.013TyrArg: 2.013 ± 0.226
2.31TyrSer: 2.31 ± 0.221
2.733TyrThr: 2.733 ± 0.274
2.543TyrVal: 2.543 ± 0.261
0.657TyrTrp: 0.657 ± 0.117
1.526TyrTyr: 1.526 ± 0.178
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 257 proteins (47195 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski