Amino acid dipepetide frequency for Aeromonas phage 4_L372D

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.816AlaAla: 3.816 ± 0.546
0.659AlaCys: 0.659 ± 0.134
2.444AlaAsp: 2.444 ± 0.276
3.322AlaGlu: 3.322 ± 0.337
1.812AlaPhe: 1.812 ± 0.235
3.844AlaGly: 3.844 ± 0.402
0.824AlaHis: 0.824 ± 0.154
3.02AlaIle: 3.02 ± 0.35
4.064AlaLys: 4.064 ± 0.344
4.53AlaLeu: 4.53 ± 0.458
1.181AlaMet: 1.181 ± 0.189
1.647AlaAsn: 1.647 ± 0.241
1.538AlaPro: 1.538 ± 0.265
1.867AlaGln: 1.867 ± 0.212
2.361AlaArg: 2.361 ± 0.276
3.267AlaSer: 3.267 ± 0.414
2.883AlaThr: 2.883 ± 0.367
2.636AlaVal: 2.636 ± 0.27
0.851AlaTrp: 0.851 ± 0.164
2.389AlaTyr: 2.389 ± 0.226
0.0AlaXaa: 0.0 ± 0.0
Cys
0.988CysAla: 0.988 ± 0.156
0.302CysCys: 0.302 ± 0.083
0.988CysAsp: 0.988 ± 0.166
1.208CysGlu: 1.208 ± 0.233
0.824CysPhe: 0.824 ± 0.138
1.592CysGly: 1.592 ± 0.25
0.357CysHis: 0.357 ± 0.075
1.016CysIle: 1.016 ± 0.194
1.867CysLys: 1.867 ± 0.262
1.126CysLeu: 1.126 ± 0.177
0.329CysMet: 0.329 ± 0.101
0.467CysAsn: 0.467 ± 0.117
0.796CysPro: 0.796 ± 0.161
0.577CysGln: 0.577 ± 0.122
0.604CysArg: 0.604 ± 0.141
1.483CysSer: 1.483 ± 0.233
0.741CysThr: 0.741 ± 0.141
1.071CysVal: 1.071 ± 0.179
0.522CysTrp: 0.522 ± 0.115
0.796CysTyr: 0.796 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
2.526AspAla: 2.526 ± 0.234
0.851AspCys: 0.851 ± 0.168
3.212AspAsp: 3.212 ± 0.399
3.844AspGlu: 3.844 ± 0.359
3.762AspPhe: 3.762 ± 0.351
4.201AspGly: 4.201 ± 0.381
0.934AspHis: 0.934 ± 0.172
4.256AspIle: 4.256 ± 0.409
6.59AspLys: 6.59 ± 0.496
4.777AspLeu: 4.777 ± 0.313
1.455AspMet: 1.455 ± 0.197
3.871AspAsn: 3.871 ± 0.306
2.224AspPro: 2.224 ± 0.218
1.29AspGln: 1.29 ± 0.172
1.867AspArg: 1.867 ± 0.232
3.597AspSer: 3.597 ± 0.337
3.35AspThr: 3.35 ± 0.336
3.871AspVal: 3.871 ± 0.359
1.345AspTrp: 1.345 ± 0.193
4.173AspTyr: 4.173 ± 0.363
0.0AspXaa: 0.0 ± 0.0
Glu
4.064GluAla: 4.064 ± 0.38
1.4GluCys: 1.4 ± 0.219
5.079GluAsp: 5.079 ± 0.348
5.766GluGlu: 5.766 ± 0.484
3.35GluPhe: 3.35 ± 0.335
4.009GluGly: 4.009 ± 0.317
1.29GluHis: 1.29 ± 0.216
5.546GluIle: 5.546 ± 0.417
5.134GluLys: 5.134 ± 0.401
6.809GluLeu: 6.809 ± 0.344
2.746GluMet: 2.746 ± 0.234
4.064GluAsn: 4.064 ± 0.348
1.73GluPro: 1.73 ± 0.248
3.954GluGln: 3.954 ± 0.402
3.185GluArg: 3.185 ± 0.271
4.283GluSer: 4.283 ± 0.342
3.322GluThr: 3.322 ± 0.305
5.574GluVal: 5.574 ± 0.383
1.318GluTrp: 1.318 ± 0.207
3.789GluTyr: 3.789 ± 0.329
0.0GluXaa: 0.0 ± 0.0
Phe
2.114PheAla: 2.114 ± 0.211
0.824PheCys: 0.824 ± 0.142
3.624PheAsp: 3.624 ± 0.313
3.432PheGlu: 3.432 ± 0.317
0.906PhePhe: 0.906 ± 0.162
2.691PheGly: 2.691 ± 0.343
0.851PheHis: 0.851 ± 0.151
4.613PheIle: 4.613 ± 0.399
3.789PheLys: 3.789 ± 0.279
2.663PheLeu: 2.663 ± 0.257
1.263PheMet: 1.263 ± 0.159
2.746PheAsn: 2.746 ± 0.283
1.043PhePro: 1.043 ± 0.17
1.373PheGln: 1.373 ± 0.172
1.592PheArg: 1.592 ± 0.252
3.212PheSer: 3.212 ± 0.349
2.636PheThr: 2.636 ± 0.286
3.789PheVal: 3.789 ± 0.286
0.467PheTrp: 0.467 ± 0.121
1.949PheTyr: 1.949 ± 0.211
0.0PheXaa: 0.0 ± 0.0
Gly
2.746GlyAla: 2.746 ± 0.398
1.318GlyCys: 1.318 ± 0.236
2.581GlyAsp: 2.581 ± 0.283
3.377GlyGlu: 3.377 ± 0.344
3.212GlyPhe: 3.212 ± 0.304
3.871GlyGly: 3.871 ± 0.506
0.796GlyHis: 0.796 ± 0.172
3.926GlyIle: 3.926 ± 0.331
5.244GlyLys: 5.244 ± 0.405
5.272GlyLeu: 5.272 ± 0.367
0.934GlyMet: 0.934 ± 0.189
3.899GlyAsn: 3.899 ± 0.431
0.577GlyPro: 0.577 ± 0.107
2.306GlyGln: 2.306 ± 0.273
2.718GlyArg: 2.718 ± 0.244
3.844GlySer: 3.844 ± 0.458
3.048GlyThr: 3.048 ± 0.422
4.448GlyVal: 4.448 ± 0.358
1.016GlyTrp: 1.016 ± 0.164
3.762GlyTyr: 3.762 ± 0.37
0.0GlyXaa: 0.0 ± 0.0
His
0.604HisAla: 0.604 ± 0.129
0.357HisCys: 0.357 ± 0.105
0.851HisAsp: 0.851 ± 0.157
1.016HisGlu: 1.016 ± 0.173
0.988HisPhe: 0.988 ± 0.159
1.675HisGly: 1.675 ± 0.261
0.357HisHis: 0.357 ± 0.115
1.098HisIle: 1.098 ± 0.161
2.032HisLys: 2.032 ± 0.212
1.4HisLeu: 1.4 ± 0.204
0.384HisMet: 0.384 ± 0.122
1.51HisAsn: 1.51 ± 0.212
0.577HisPro: 0.577 ± 0.111
0.796HisGln: 0.796 ± 0.161
0.632HisArg: 0.632 ± 0.139
1.29HisSer: 1.29 ± 0.168
0.934HisThr: 0.934 ± 0.148
1.098HisVal: 1.098 ± 0.175
0.22HisTrp: 0.22 ± 0.072
1.126HisTyr: 1.126 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
3.24IleAla: 3.24 ± 0.357
1.51IleCys: 1.51 ± 0.212
4.668IleAsp: 4.668 ± 0.378
5.272IleGlu: 5.272 ± 0.402
2.306IlePhe: 2.306 ± 0.249
3.762IleGly: 3.762 ± 0.274
1.51IleHis: 1.51 ± 0.222
4.448IleIle: 4.448 ± 0.315
6.068IleLys: 6.068 ± 0.479
4.915IleLeu: 4.915 ± 0.396
1.675IleMet: 1.675 ± 0.187
4.119IleAsn: 4.119 ± 0.39
2.224IlePro: 2.224 ± 0.228
2.608IleGln: 2.608 ± 0.291
2.801IleArg: 2.801 ± 0.275
3.844IleSer: 3.844 ± 0.286
3.405IleThr: 3.405 ± 0.416
4.832IleVal: 4.832 ± 0.377
0.741IleTrp: 0.741 ± 0.133
3.158IleTyr: 3.158 ± 0.299
0.0IleXaa: 0.0 ± 0.0
Lys
3.844LysAla: 3.844 ± 0.402
1.428LysCys: 1.428 ± 0.202
6.864LysAsp: 6.864 ± 0.543
8.072LysGlu: 8.072 ± 0.567
3.267LysPhe: 3.267 ± 0.318
4.338LysGly: 4.338 ± 0.392
1.895LysHis: 1.895 ± 0.231
5.766LysIle: 5.766 ± 0.344
6.013LysLys: 6.013 ± 0.535
7.633LysLeu: 7.633 ± 0.444
3.048LysMet: 3.048 ± 0.309
4.311LysAsn: 4.311 ± 0.286
2.444LysPro: 2.444 ± 0.329
5.107LysGln: 5.107 ± 0.403
2.773LysArg: 2.773 ± 0.291
4.97LysSer: 4.97 ± 0.408
4.915LysThr: 4.915 ± 0.366
5.684LysVal: 5.684 ± 0.436
1.236LysTrp: 1.236 ± 0.203
3.816LysTyr: 3.816 ± 0.306
0.0LysXaa: 0.0 ± 0.0
Leu
4.146LeuAla: 4.146 ± 0.383
0.934LeuCys: 0.934 ± 0.157
6.15LeuAsp: 6.15 ± 0.345
6.178LeuGlu: 6.178 ± 0.403
3.981LeuPhe: 3.981 ± 0.297
4.53LeuGly: 4.53 ± 0.38
1.73LeuHis: 1.73 ± 0.2
4.723LeuIle: 4.723 ± 0.433
7.386LeuLys: 7.386 ± 0.503
6.205LeuLeu: 6.205 ± 0.467
2.142LeuMet: 2.142 ± 0.242
5.629LeuAsn: 5.629 ± 0.375
2.746LeuPro: 2.746 ± 0.291
3.24LeuGln: 3.24 ± 0.308
3.267LeuArg: 3.267 ± 0.325
5.272LeuSer: 5.272 ± 0.372
4.393LeuThr: 4.393 ± 0.379
5.052LeuVal: 5.052 ± 0.403
0.906LeuTrp: 0.906 ± 0.178
3.103LeuTyr: 3.103 ± 0.273
0.0LeuXaa: 0.0 ± 0.0
Met
1.455MetAla: 1.455 ± 0.211
0.467MetCys: 0.467 ± 0.113
0.934MetAsp: 0.934 ± 0.159
1.84MetGlu: 1.84 ± 0.248
1.318MetPhe: 1.318 ± 0.198
0.879MetGly: 0.879 ± 0.152
0.439MetHis: 0.439 ± 0.146
1.675MetIle: 1.675 ± 0.245
2.773MetLys: 2.773 ± 0.276
1.895MetLeu: 1.895 ± 0.22
0.632MetMet: 0.632 ± 0.138
1.647MetAsn: 1.647 ± 0.221
0.522MetPro: 0.522 ± 0.108
1.208MetGln: 1.208 ± 0.195
0.988MetArg: 0.988 ± 0.174
2.251MetSer: 2.251 ± 0.259
1.51MetThr: 1.51 ± 0.201
1.318MetVal: 1.318 ± 0.179
0.302MetTrp: 0.302 ± 0.098
1.263MetTyr: 1.263 ± 0.189
0.0MetXaa: 0.0 ± 0.0
Asn
2.938AsnAla: 2.938 ± 0.339
1.236AsnCys: 1.236 ± 0.169
2.718AsnAsp: 2.718 ± 0.284
3.212AsnGlu: 3.212 ± 0.26
2.746AsnPhe: 2.746 ± 0.322
3.569AsnGly: 3.569 ± 0.317
1.208AsnHis: 1.208 ± 0.18
3.981AsnIle: 3.981 ± 0.29
6.397AsnLys: 6.397 ± 0.394
5.162AsnLeu: 5.162 ± 0.364
1.236AsnMet: 1.236 ± 0.189
3.816AsnAsn: 3.816 ± 0.367
1.702AsnPro: 1.702 ± 0.263
1.977AsnGln: 1.977 ± 0.248
2.416AsnArg: 2.416 ± 0.248
3.487AsnSer: 3.487 ± 0.379
3.185AsnThr: 3.185 ± 0.295
3.24AsnVal: 3.24 ± 0.313
0.714AsnTrp: 0.714 ± 0.129
2.471AsnTyr: 2.471 ± 0.279
0.0AsnXaa: 0.0 ± 0.0
Pro
1.153ProAla: 1.153 ± 0.198
0.659ProCys: 0.659 ± 0.127
2.361ProAsp: 2.361 ± 0.266
3.487ProGlu: 3.487 ± 0.368
1.345ProPhe: 1.345 ± 0.201
0.577ProGly: 0.577 ± 0.139
0.494ProHis: 0.494 ± 0.121
1.757ProIle: 1.757 ± 0.201
2.251ProLys: 2.251 ± 0.239
2.224ProLeu: 2.224 ± 0.27
0.522ProMet: 0.522 ± 0.124
1.565ProAsn: 1.565 ± 0.231
0.549ProPro: 0.549 ± 0.12
1.263ProGln: 1.263 ± 0.198
0.714ProArg: 0.714 ± 0.145
2.224ProSer: 2.224 ± 0.271
1.62ProThr: 1.62 ± 0.254
2.279ProVal: 2.279 ± 0.259
0.275ProTrp: 0.275 ± 0.103
1.263ProTyr: 1.263 ± 0.149
0.0ProXaa: 0.0 ± 0.0
Gln
2.032GlnAla: 2.032 ± 0.246
0.522GlnCys: 0.522 ± 0.11
2.142GlnAsp: 2.142 ± 0.277
4.173GlnGlu: 4.173 ± 0.331
1.895GlnPhe: 1.895 ± 0.236
1.949GlnGly: 1.949 ± 0.218
0.741GlnHis: 0.741 ± 0.157
2.691GlnIle: 2.691 ± 0.299
2.883GlnLys: 2.883 ± 0.319
4.338GlnLeu: 4.338 ± 0.39
1.126GlnMet: 1.126 ± 0.19
2.169GlnAsn: 2.169 ± 0.234
1.153GlnPro: 1.153 ± 0.18
2.224GlnGln: 2.224 ± 0.294
1.62GlnArg: 1.62 ± 0.218
1.977GlnSer: 1.977 ± 0.21
1.757GlnThr: 1.757 ± 0.246
2.334GlnVal: 2.334 ± 0.274
0.439GlnTrp: 0.439 ± 0.097
2.416GlnTyr: 2.416 ± 0.272
0.0GlnXaa: 0.0 ± 0.0
Arg
1.84ArgAla: 1.84 ± 0.212
0.632ArgCys: 0.632 ± 0.129
1.977ArgAsp: 1.977 ± 0.223
3.267ArgGlu: 3.267 ± 0.333
1.977ArgPhe: 1.977 ± 0.249
2.251ArgGly: 2.251 ± 0.262
0.769ArgHis: 0.769 ± 0.125
2.197ArgIle: 2.197 ± 0.245
3.514ArgLys: 3.514 ± 0.291
3.212ArgLeu: 3.212 ± 0.291
0.879ArgMet: 0.879 ± 0.155
2.499ArgAsn: 2.499 ± 0.238
0.851ArgPro: 0.851 ± 0.163
1.538ArgGln: 1.538 ± 0.204
1.647ArgArg: 1.647 ± 0.254
3.02ArgSer: 3.02 ± 0.286
1.84ArgThr: 1.84 ± 0.202
1.977ArgVal: 1.977 ± 0.228
0.632ArgTrp: 0.632 ± 0.125
1.675ArgTyr: 1.675 ± 0.276
0.0ArgXaa: 0.0 ± 0.0
Ser
3.103SerAla: 3.103 ± 0.393
1.098SerCys: 1.098 ± 0.14
3.981SerAsp: 3.981 ± 0.322
4.723SerGlu: 4.723 ± 0.363
3.487SerPhe: 3.487 ± 0.326
4.475SerGly: 4.475 ± 0.398
1.071SerHis: 1.071 ± 0.197
4.75SerIle: 4.75 ± 0.325
5.025SerLys: 5.025 ± 0.428
4.887SerLeu: 4.887 ± 0.396
1.538SerMet: 1.538 ± 0.189
3.158SerAsn: 3.158 ± 0.292
1.949SerPro: 1.949 ± 0.247
2.059SerGln: 2.059 ± 0.235
2.444SerArg: 2.444 ± 0.266
3.899SerSer: 3.899 ± 0.391
3.487SerThr: 3.487 ± 0.315
4.256SerVal: 4.256 ± 0.325
1.236SerTrp: 1.236 ± 0.159
2.444SerTyr: 2.444 ± 0.222
0.0SerXaa: 0.0 ± 0.0
Thr
2.608ThrAla: 2.608 ± 0.358
0.906ThrCys: 0.906 ± 0.135
2.993ThrAsp: 2.993 ± 0.285
4.338ThrGlu: 4.338 ± 0.354
2.361ThrPhe: 2.361 ± 0.277
3.46ThrGly: 3.46 ± 0.4
1.4ThrHis: 1.4 ± 0.219
4.036ThrIle: 4.036 ± 0.4
4.695ThrLys: 4.695 ± 0.382
4.613ThrLeu: 4.613 ± 0.367
0.988ThrMet: 0.988 ± 0.165
2.636ThrAsn: 2.636 ± 0.262
1.977ThrPro: 1.977 ± 0.311
2.169ThrGln: 2.169 ± 0.228
1.785ThrArg: 1.785 ± 0.212
2.499ThrSer: 2.499 ± 0.296
3.103ThrThr: 3.103 ± 0.316
3.542ThrVal: 3.542 ± 0.376
0.604ThrTrp: 0.604 ± 0.129
2.361ThrTyr: 2.361 ± 0.258
0.0ThrXaa: 0.0 ± 0.0
Val
3.322ValAla: 3.322 ± 0.329
1.236ValCys: 1.236 ± 0.196
4.64ValAsp: 4.64 ± 0.394
5.162ValGlu: 5.162 ± 0.349
2.746ValPhe: 2.746 ± 0.258
4.338ValGly: 4.338 ± 0.378
1.071ValHis: 1.071 ± 0.184
4.311ValIle: 4.311 ± 0.294
5.409ValLys: 5.409 ± 0.446
4.915ValLeu: 4.915 ± 0.436
1.73ValMet: 1.73 ± 0.21
3.185ValAsn: 3.185 ± 0.303
1.922ValPro: 1.922 ± 0.25
2.306ValGln: 2.306 ± 0.256
2.334ValArg: 2.334 ± 0.244
4.421ValSer: 4.421 ± 0.346
4.009ValThr: 4.009 ± 0.369
5.162ValVal: 5.162 ± 0.455
0.714ValTrp: 0.714 ± 0.13
3.24ValTyr: 3.24 ± 0.302
0.0ValXaa: 0.0 ± 0.0
Trp
0.604TrpAla: 0.604 ± 0.132
0.247TrpCys: 0.247 ± 0.092
1.043TrpAsp: 1.043 ± 0.187
1.043TrpGlu: 1.043 ± 0.143
0.961TrpPhe: 0.961 ± 0.169
0.632TrpGly: 0.632 ± 0.144
0.302TrpHis: 0.302 ± 0.093
1.016TrpIle: 1.016 ± 0.143
1.236TrpLys: 1.236 ± 0.199
1.318TrpLeu: 1.318 ± 0.198
0.467TrpMet: 0.467 ± 0.121
0.632TrpAsn: 0.632 ± 0.138
0.247TrpPro: 0.247 ± 0.077
0.659TrpGln: 0.659 ± 0.165
0.494TrpArg: 0.494 ± 0.139
1.071TrpSer: 1.071 ± 0.15
0.659TrpThr: 0.659 ± 0.139
0.796TrpVal: 0.796 ± 0.139
0.192TrpTrp: 0.192 ± 0.079
0.741TrpTyr: 0.741 ± 0.165
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.84TyrAla: 1.84 ± 0.212
1.071TyrCys: 1.071 ± 0.181
2.746TyrAsp: 2.746 ± 0.302
3.679TyrGlu: 3.679 ± 0.317
2.334TyrPhe: 2.334 ± 0.224
2.251TyrGly: 2.251 ± 0.23
0.906TyrHis: 0.906 ± 0.166
2.279TyrIle: 2.279 ± 0.233
5.052TyrLys: 5.052 ± 0.439
3.816TyrLeu: 3.816 ± 0.328
1.016TyrMet: 1.016 ± 0.16
3.844TyrAsn: 3.844 ± 0.384
1.812TyrPro: 1.812 ± 0.185
2.059TyrGln: 2.059 ± 0.245
1.949TyrArg: 1.949 ± 0.252
3.158TyrSer: 3.158 ± 0.256
2.279TyrThr: 2.279 ± 0.283
3.212TyrVal: 3.212 ± 0.344
0.632TyrTrp: 0.632 ± 0.126
2.636TyrTyr: 2.636 ± 0.277
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 223 proteins (36422 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski