Amino acid dipepetide frequency for Vibrio phage Brizo

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.257AlaAla: 6.257 ± 1.044
0.87AlaCys: 0.87 ± 0.159
4.742AlaAsp: 4.742 ± 0.415
5.556AlaGlu: 5.556 ± 0.597
2.946AlaPhe: 2.946 ± 0.309
4.63AlaGly: 4.63 ± 0.361
1.094AlaHis: 1.094 ± 0.198
4.461AlaIle: 4.461 ± 0.375
6.846AlaLys: 6.846 ± 0.635
6.538AlaLeu: 6.538 ± 0.416
2.132AlaMet: 2.132 ± 0.265
4.125AlaAsn: 4.125 ± 0.417
2.637AlaPro: 2.637 ± 0.316
2.974AlaGln: 2.974 ± 0.337
3.563AlaArg: 3.563 ± 0.322
5.079AlaSer: 5.079 ± 0.728
4.77AlaThr: 4.77 ± 0.424
4.405AlaVal: 4.405 ± 0.356
0.954AlaTrp: 0.954 ± 0.186
2.637AlaTyr: 2.637 ± 0.31
0.0AlaXaa: 0.0 ± 0.0
Cys
0.842CysAla: 0.842 ± 0.167
0.14CysCys: 0.14 ± 0.062
0.617CysAsp: 0.617 ± 0.148
1.15CysGlu: 1.15 ± 0.186
0.421CysPhe: 0.421 ± 0.124
0.842CysGly: 0.842 ± 0.175
0.309CysHis: 0.309 ± 0.099
0.673CysIle: 0.673 ± 0.131
1.01CysLys: 1.01 ± 0.174
0.73CysLeu: 0.73 ± 0.182
0.337CysMet: 0.337 ± 0.111
0.701CysAsn: 0.701 ± 0.144
0.449CysPro: 0.449 ± 0.107
0.477CysGln: 0.477 ± 0.1
0.645CysArg: 0.645 ± 0.121
0.673CysSer: 0.673 ± 0.182
0.842CysThr: 0.842 ± 0.154
0.814CysVal: 0.814 ± 0.168
0.112CysTrp: 0.112 ± 0.051
0.505CysTyr: 0.505 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
4.882AspAla: 4.882 ± 0.384
0.701AspCys: 0.701 ± 0.169
3.648AspAsp: 3.648 ± 0.366
4.545AspGlu: 4.545 ± 0.402
3.255AspPhe: 3.255 ± 0.269
4.433AspGly: 4.433 ± 0.447
0.786AspHis: 0.786 ± 0.149
4.517AspIle: 4.517 ± 0.314
4.405AspLys: 4.405 ± 0.387
4.77AspLeu: 4.77 ± 0.378
1.852AspMet: 1.852 ± 0.253
3.451AspAsn: 3.451 ± 0.343
2.245AspPro: 2.245 ± 0.285
1.459AspGln: 1.459 ± 0.213
2.75AspArg: 2.75 ± 0.259
3.984AspSer: 3.984 ± 0.385
3.984AspThr: 3.984 ± 0.396
4.265AspVal: 4.265 ± 0.283
1.263AspTrp: 1.263 ± 0.178
2.553AspTyr: 2.553 ± 0.293
0.0AspXaa: 0.0 ± 0.0
Glu
5.836GluAla: 5.836 ± 0.556
0.982GluCys: 0.982 ± 0.162
3.928GluAsp: 3.928 ± 0.325
4.574GluGlu: 4.574 ± 0.428
2.75GluPhe: 2.75 ± 0.284
4.153GluGly: 4.153 ± 0.324
1.515GluHis: 1.515 ± 0.191
4.826GluIle: 4.826 ± 0.35
4.966GluLys: 4.966 ± 0.427
6.902GluLeu: 6.902 ± 0.579
2.132GluMet: 2.132 ± 0.241
3.62GluAsn: 3.62 ± 0.318
1.852GluPro: 1.852 ± 0.225
3.143GluGln: 3.143 ± 0.371
2.75GluArg: 2.75 ± 0.381
2.89GluSer: 2.89 ± 0.366
3.704GluThr: 3.704 ± 0.394
5.275GluVal: 5.275 ± 0.383
1.15GluTrp: 1.15 ± 0.161
3.114GluTyr: 3.114 ± 0.326
0.0GluXaa: 0.0 ± 0.0
Phe
2.694PheAla: 2.694 ± 0.308
0.645PheCys: 0.645 ± 0.147
3.283PheAsp: 3.283 ± 0.259
2.89PheGlu: 2.89 ± 0.284
1.627PhePhe: 1.627 ± 0.228
2.637PheGly: 2.637 ± 0.273
1.263PheHis: 1.263 ± 0.198
2.722PheIle: 2.722 ± 0.243
3.143PheLys: 3.143 ± 0.298
2.385PheLeu: 2.385 ± 0.296
1.15PheMet: 1.15 ± 0.18
2.357PheAsn: 2.357 ± 0.25
1.459PhePro: 1.459 ± 0.203
0.926PheGln: 0.926 ± 0.147
1.599PheArg: 1.599 ± 0.18
2.637PheSer: 2.637 ± 0.315
2.637PheThr: 2.637 ± 0.365
2.217PheVal: 2.217 ± 0.284
0.617PheTrp: 0.617 ± 0.137
1.263PheTyr: 1.263 ± 0.215
0.0PheXaa: 0.0 ± 0.0
Gly
4.405GlyAla: 4.405 ± 0.525
1.01GlyCys: 1.01 ± 0.224
4.742GlyAsp: 4.742 ± 0.433
3.956GlyGlu: 3.956 ± 0.33
3.03GlyPhe: 3.03 ± 0.233
3.591GlyGly: 3.591 ± 0.417
1.207GlyHis: 1.207 ± 0.18
4.405GlyIle: 4.405 ± 0.346
5.191GlyLys: 5.191 ± 0.438
5.275GlyLeu: 5.275 ± 0.382
1.88GlyMet: 1.88 ± 0.212
3.114GlyAsn: 3.114 ± 0.352
0.505GlyPro: 0.505 ± 0.105
1.908GlyGln: 1.908 ± 0.244
3.086GlyArg: 3.086 ± 0.303
4.153GlySer: 4.153 ± 0.449
4.798GlyThr: 4.798 ± 0.416
4.321GlyVal: 4.321 ± 0.318
1.375GlyTrp: 1.375 ± 0.201
2.946GlyTyr: 2.946 ± 0.369
0.0GlyXaa: 0.0 ± 0.0
His
1.01HisAla: 1.01 ± 0.172
0.309HisCys: 0.309 ± 0.095
1.571HisAsp: 1.571 ± 0.189
1.403HisGlu: 1.403 ± 0.199
0.842HisPhe: 0.842 ± 0.155
1.655HisGly: 1.655 ± 0.202
0.589HisHis: 0.589 ± 0.142
1.599HisIle: 1.599 ± 0.241
1.291HisLys: 1.291 ± 0.169
2.217HisLeu: 2.217 ± 0.238
0.673HisMet: 0.673 ± 0.148
0.898HisAsn: 0.898 ± 0.171
0.954HisPro: 0.954 ± 0.158
0.561HisGln: 0.561 ± 0.119
0.954HisArg: 0.954 ± 0.157
0.982HisSer: 0.982 ± 0.214
1.235HisThr: 1.235 ± 0.199
0.982HisVal: 0.982 ± 0.184
0.337HisTrp: 0.337 ± 0.097
0.926HisTyr: 0.926 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
4.461IleAla: 4.461 ± 0.361
0.645IleCys: 0.645 ± 0.142
3.984IleAsp: 3.984 ± 0.335
5.499IleGlu: 5.499 ± 0.461
1.852IlePhe: 1.852 ± 0.218
3.62IleGly: 3.62 ± 0.279
1.263IleHis: 1.263 ± 0.181
3.507IleIle: 3.507 ± 0.324
4.658IleLys: 4.658 ± 0.392
4.181IleLeu: 4.181 ± 0.356
1.768IleMet: 1.768 ± 0.193
3.704IleAsn: 3.704 ± 0.387
2.329IlePro: 2.329 ± 0.237
2.469IleGln: 2.469 ± 0.226
3.143IleArg: 3.143 ± 0.318
4.265IleSer: 4.265 ± 0.407
4.012IleThr: 4.012 ± 0.409
3.676IleVal: 3.676 ± 0.364
0.561IleTrp: 0.561 ± 0.104
2.048IleTyr: 2.048 ± 0.218
0.0IleXaa: 0.0 ± 0.0
Lys
6.818LysAla: 6.818 ± 0.662
0.73LysCys: 0.73 ± 0.148
4.097LysAsp: 4.097 ± 0.403
5.275LysGlu: 5.275 ± 0.398
3.114LysPhe: 3.114 ± 0.319
4.265LysGly: 4.265 ± 0.383
1.655LysHis: 1.655 ± 0.246
4.377LysIle: 4.377 ± 0.308
4.377LysLys: 4.377 ± 0.336
7.52LysLeu: 7.52 ± 0.572
2.385LysMet: 2.385 ± 0.3
2.694LysAsn: 2.694 ± 0.285
3.171LysPro: 3.171 ± 0.358
2.694LysGln: 2.694 ± 0.262
3.058LysArg: 3.058 ± 0.292
4.153LysSer: 4.153 ± 0.336
3.984LysThr: 3.984 ± 0.386
4.545LysVal: 4.545 ± 0.374
0.786LysTrp: 0.786 ± 0.138
2.862LysTyr: 2.862 ± 0.312
0.0LysXaa: 0.0 ± 0.0
Leu
7.52LeuAla: 7.52 ± 0.511
0.926LeuCys: 0.926 ± 0.169
6.257LeuAsp: 6.257 ± 0.423
6.51LeuGlu: 6.51 ± 0.409
2.413LeuPhe: 2.413 ± 0.24
5.752LeuGly: 5.752 ± 0.452
1.824LeuHis: 1.824 ± 0.22
4.742LeuIle: 4.742 ± 0.373
5.191LeuLys: 5.191 ± 0.463
6.229LeuLeu: 6.229 ± 0.477
2.301LeuMet: 2.301 ± 0.239
3.956LeuAsn: 3.956 ± 0.365
3.563LeuPro: 3.563 ± 0.42
3.171LeuGln: 3.171 ± 0.259
3.9LeuArg: 3.9 ± 0.334
4.742LeuSer: 4.742 ± 0.38
4.686LeuThr: 4.686 ± 0.344
6.229LeuVal: 6.229 ± 0.425
0.73LeuTrp: 0.73 ± 0.158
2.553LeuTyr: 2.553 ± 0.285
0.0LeuXaa: 0.0 ± 0.0
Met
2.329MetAla: 2.329 ± 0.245
0.393MetCys: 0.393 ± 0.097
1.543MetAsp: 1.543 ± 0.169
1.88MetGlu: 1.88 ± 0.253
1.207MetPhe: 1.207 ± 0.199
1.655MetGly: 1.655 ± 0.182
0.589MetHis: 0.589 ± 0.142
1.515MetIle: 1.515 ± 0.234
2.385MetLys: 2.385 ± 0.262
2.385MetLeu: 2.385 ± 0.272
0.673MetMet: 0.673 ± 0.161
1.459MetAsn: 1.459 ± 0.246
1.319MetPro: 1.319 ± 0.192
0.898MetGln: 0.898 ± 0.172
1.263MetArg: 1.263 ± 0.199
1.796MetSer: 1.796 ± 0.235
1.655MetThr: 1.655 ± 0.252
1.487MetVal: 1.487 ± 0.209
0.309MetTrp: 0.309 ± 0.094
0.87MetTyr: 0.87 ± 0.135
0.0MetXaa: 0.0 ± 0.0
Asn
3.704AsnAla: 3.704 ± 0.367
0.617AsnCys: 0.617 ± 0.16
2.217AsnAsp: 2.217 ± 0.199
2.918AsnGlu: 2.918 ± 0.251
1.599AsnPhe: 1.599 ± 0.228
3.76AsnGly: 3.76 ± 0.287
1.038AsnHis: 1.038 ± 0.23
3.367AsnIle: 3.367 ± 0.291
4.714AsnLys: 4.714 ± 0.339
4.433AsnLeu: 4.433 ± 0.296
1.291AsnMet: 1.291 ± 0.172
2.441AsnAsn: 2.441 ± 0.255
2.385AsnPro: 2.385 ± 0.278
1.571AsnGln: 1.571 ± 0.24
2.048AsnArg: 2.048 ± 0.294
3.9AsnSer: 3.9 ± 0.371
3.086AsnThr: 3.086 ± 0.318
3.283AsnVal: 3.283 ± 0.333
0.842AsnTrp: 0.842 ± 0.172
1.992AsnTyr: 1.992 ± 0.253
0.0AsnXaa: 0.0 ± 0.0
Pro
2.217ProAla: 2.217 ± 0.238
0.449ProCys: 0.449 ± 0.119
2.413ProAsp: 2.413 ± 0.254
3.03ProGlu: 3.03 ± 0.32
1.403ProPhe: 1.403 ± 0.182
1.908ProGly: 1.908 ± 0.295
0.73ProHis: 0.73 ± 0.131
2.104ProIle: 2.104 ± 0.264
2.16ProLys: 2.16 ± 0.273
2.441ProLeu: 2.441 ± 0.305
1.094ProMet: 1.094 ± 0.2
2.329ProAsn: 2.329 ± 0.317
0.926ProPro: 0.926 ± 0.205
0.982ProGln: 0.982 ± 0.154
1.403ProArg: 1.403 ± 0.198
2.132ProSer: 2.132 ± 0.289
2.217ProThr: 2.217 ± 0.292
2.469ProVal: 2.469 ± 0.263
0.477ProTrp: 0.477 ± 0.12
1.459ProTyr: 1.459 ± 0.227
0.0ProXaa: 0.0 ± 0.0
Gln
3.143GlnAla: 3.143 ± 0.358
0.561GlnCys: 0.561 ± 0.112
1.908GlnAsp: 1.908 ± 0.229
2.301GlnGlu: 2.301 ± 0.265
1.403GlnPhe: 1.403 ± 0.186
2.189GlnGly: 2.189 ± 0.234
0.673GlnHis: 0.673 ± 0.114
2.301GlnIle: 2.301 ± 0.254
1.936GlnLys: 1.936 ± 0.215
3.507GlnLeu: 3.507 ± 0.321
1.094GlnMet: 1.094 ± 0.198
1.178GlnAsn: 1.178 ± 0.217
1.066GlnPro: 1.066 ± 0.162
1.122GlnGln: 1.122 ± 0.231
1.627GlnArg: 1.627 ± 0.26
2.132GlnSer: 2.132 ± 0.348
1.936GlnThr: 1.936 ± 0.201
2.666GlnVal: 2.666 ± 0.275
0.589GlnTrp: 0.589 ± 0.144
1.543GlnTyr: 1.543 ± 0.207
0.0GlnXaa: 0.0 ± 0.0
Arg
3.563ArgAla: 3.563 ± 0.335
0.477ArgCys: 0.477 ± 0.119
3.367ArgAsp: 3.367 ± 0.293
2.778ArgGlu: 2.778 ± 0.338
1.768ArgPhe: 1.768 ± 0.255
3.002ArgGly: 3.002 ± 0.292
0.982ArgHis: 0.982 ± 0.151
3.03ArgIle: 3.03 ± 0.289
3.227ArgLys: 3.227 ± 0.301
3.62ArgLeu: 3.62 ± 0.316
1.066ArgMet: 1.066 ± 0.163
2.301ArgAsn: 2.301 ± 0.255
1.431ArgPro: 1.431 ± 0.269
1.487ArgGln: 1.487 ± 0.189
1.908ArgArg: 1.908 ± 0.213
2.301ArgSer: 2.301 ± 0.28
2.329ArgThr: 2.329 ± 0.258
2.918ArgVal: 2.918 ± 0.292
0.701ArgTrp: 0.701 ± 0.165
1.852ArgTyr: 1.852 ± 0.265
0.0ArgXaa: 0.0 ± 0.0
Ser
4.714SerAla: 4.714 ± 0.513
0.673SerCys: 0.673 ± 0.137
3.535SerAsp: 3.535 ± 0.32
3.563SerGlu: 3.563 ± 0.494
2.497SerPhe: 2.497 ± 0.284
5.443SerGly: 5.443 ± 0.424
1.178SerHis: 1.178 ± 0.18
3.395SerIle: 3.395 ± 0.32
4.545SerLys: 4.545 ± 0.385
5.584SerLeu: 5.584 ± 0.501
1.543SerMet: 1.543 ± 0.21
3.311SerAsn: 3.311 ± 0.305
2.02SerPro: 2.02 ± 0.223
1.992SerGln: 1.992 ± 0.292
2.385SerArg: 2.385 ± 0.261
4.209SerSer: 4.209 ± 0.533
3.676SerThr: 3.676 ± 0.393
4.153SerVal: 4.153 ± 0.376
0.842SerTrp: 0.842 ± 0.174
2.385SerTyr: 2.385 ± 0.275
0.0SerXaa: 0.0 ± 0.0
Thr
4.63ThrAla: 4.63 ± 0.432
0.645ThrCys: 0.645 ± 0.119
3.816ThrAsp: 3.816 ± 0.363
3.563ThrGlu: 3.563 ± 0.328
2.946ThrPhe: 2.946 ± 0.357
4.097ThrGly: 4.097 ± 0.37
1.291ThrHis: 1.291 ± 0.201
4.125ThrIle: 4.125 ± 0.316
4.293ThrLys: 4.293 ± 0.379
4.854ThrLeu: 4.854 ± 0.354
1.15ThrMet: 1.15 ± 0.173
3.283ThrAsn: 3.283 ± 0.4
2.245ThrPro: 2.245 ± 0.245
2.132ThrGln: 2.132 ± 0.279
2.385ThrArg: 2.385 ± 0.235
3.956ThrSer: 3.956 ± 0.533
3.928ThrThr: 3.928 ± 0.416
3.956ThrVal: 3.956 ± 0.314
0.786ThrTrp: 0.786 ± 0.163
2.862ThrTyr: 2.862 ± 0.223
0.0ThrXaa: 0.0 ± 0.0
Val
4.742ValAla: 4.742 ± 0.368
0.701ValCys: 0.701 ± 0.139
4.153ValAsp: 4.153 ± 0.38
4.882ValGlu: 4.882 ± 0.418
3.002ValPhe: 3.002 ± 0.261
3.872ValGly: 3.872 ± 0.334
1.515ValHis: 1.515 ± 0.257
2.89ValIle: 2.89 ± 0.35
4.854ValLys: 4.854 ± 0.336
4.854ValLeu: 4.854 ± 0.37
1.459ValMet: 1.459 ± 0.231
3.507ValAsn: 3.507 ± 0.281
2.16ValPro: 2.16 ± 0.237
2.217ValGln: 2.217 ± 0.221
3.311ValArg: 3.311 ± 0.275
4.882ValSer: 4.882 ± 0.334
4.265ValThr: 4.265 ± 0.386
4.714ValVal: 4.714 ± 0.41
0.73ValTrp: 0.73 ± 0.12
2.581ValTyr: 2.581 ± 0.273
0.0ValXaa: 0.0 ± 0.0
Trp
1.094TrpAla: 1.094 ± 0.186
0.196TrpCys: 0.196 ± 0.082
1.15TrpAsp: 1.15 ± 0.173
0.758TrpGlu: 0.758 ± 0.147
0.533TrpPhe: 0.533 ± 0.131
0.814TrpGly: 0.814 ± 0.143
0.505TrpHis: 0.505 ± 0.119
0.617TrpIle: 0.617 ± 0.167
0.926TrpLys: 0.926 ± 0.184
1.403TrpLeu: 1.403 ± 0.202
0.449TrpMet: 0.449 ± 0.123
0.673TrpAsn: 0.673 ± 0.146
0.224TrpPro: 0.224 ± 0.099
0.589TrpGln: 0.589 ± 0.126
0.645TrpArg: 0.645 ± 0.141
0.73TrpSer: 0.73 ± 0.174
0.533TrpThr: 0.533 ± 0.113
1.01TrpVal: 1.01 ± 0.172
0.196TrpTrp: 0.196 ± 0.072
0.73TrpTyr: 0.73 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.357TyrAla: 2.357 ± 0.278
0.617TyrCys: 0.617 ± 0.132
2.778TyrAsp: 2.778 ± 0.297
3.03TyrGlu: 3.03 ± 0.323
1.543TyrPhe: 1.543 ± 0.211
2.581TyrGly: 2.581 ± 0.247
0.982TyrHis: 0.982 ± 0.161
2.497TyrIle: 2.497 ± 0.293
2.441TyrLys: 2.441 ± 0.272
3.367TyrLeu: 3.367 ± 0.33
1.15TyrMet: 1.15 ± 0.174
2.104TyrAsn: 2.104 ± 0.238
1.403TyrPro: 1.403 ± 0.186
2.104TyrGln: 2.104 ± 0.276
1.684TyrArg: 1.684 ± 0.187
2.048TyrSer: 2.048 ± 0.311
2.666TyrThr: 2.666 ± 0.266
1.908TyrVal: 1.908 ± 0.215
0.421TyrTrp: 0.421 ± 0.121
1.768TyrTyr: 1.768 ± 0.248
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 181 proteins (35641 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski