Amino acid dipepetide frequency for Rheinheimera phage vB_RspM_Barba31A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.684AlaAla: 4.684 ± 0.482
0.984AlaCys: 0.984 ± 0.159
3.779AlaAsp: 3.779 ± 0.465
4.566AlaGlu: 4.566 ± 0.454
2.598AlaPhe: 2.598 ± 0.317
3.976AlaGly: 3.976 ± 0.422
0.827AlaHis: 0.827 ± 0.179
4.094AlaIle: 4.094 ± 0.421
4.724AlaLys: 4.724 ± 0.493
5.787AlaLeu: 5.787 ± 0.726
1.299AlaMet: 1.299 ± 0.232
2.952AlaAsn: 2.952 ± 0.291
1.771AlaPro: 1.771 ± 0.272
2.204AlaGln: 2.204 ± 0.29
2.008AlaArg: 2.008 ± 0.342
3.818AlaSer: 3.818 ± 0.409
3.976AlaThr: 3.976 ± 0.461
3.976AlaVal: 3.976 ± 0.475
0.59AlaTrp: 0.59 ± 0.154
3.425AlaTyr: 3.425 ± 0.401
0.0AlaXaa: 0.0 ± 0.0
Cys
0.512CysAla: 0.512 ± 0.146
0.276CysCys: 0.276 ± 0.09
1.063CysAsp: 1.063 ± 0.223
0.669CysGlu: 0.669 ± 0.179
0.748CysPhe: 0.748 ± 0.205
0.945CysGly: 0.945 ± 0.183
0.276CysHis: 0.276 ± 0.116
0.748CysIle: 0.748 ± 0.176
1.457CysLys: 1.457 ± 0.245
0.945CysLeu: 0.945 ± 0.181
0.512CysMet: 0.512 ± 0.141
0.827CysAsn: 0.827 ± 0.197
0.827CysPro: 0.827 ± 0.182
0.354CysGln: 0.354 ± 0.109
0.709CysArg: 0.709 ± 0.144
0.984CysSer: 0.984 ± 0.219
0.905CysThr: 0.905 ± 0.242
1.142CysVal: 1.142 ± 0.19
0.354CysTrp: 0.354 ± 0.13
0.551CysTyr: 0.551 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
3.385AspAla: 3.385 ± 0.336
0.984AspCys: 0.984 ± 0.164
3.779AspAsp: 3.779 ± 0.419
3.937AspGlu: 3.937 ± 0.404
2.677AspPhe: 2.677 ± 0.385
5.59AspGly: 5.59 ± 0.646
1.024AspHis: 1.024 ± 0.195
4.999AspIle: 4.999 ± 0.493
4.724AspLys: 4.724 ± 0.475
4.763AspLeu: 4.763 ± 0.449
1.299AspMet: 1.299 ± 0.211
3.228AspAsn: 3.228 ± 0.368
1.575AspPro: 1.575 ± 0.258
1.575AspGln: 1.575 ± 0.261
2.244AspArg: 2.244 ± 0.283
3.661AspSer: 3.661 ± 0.525
4.566AspThr: 4.566 ± 0.402
5.039AspVal: 5.039 ± 0.478
1.771AspTrp: 1.771 ± 0.305
3.228AspTyr: 3.228 ± 0.361
0.0AspXaa: 0.0 ± 0.0
Glu
3.74GluAla: 3.74 ± 0.433
1.142GluCys: 1.142 ± 0.221
4.055GluAsp: 4.055 ± 0.436
3.818GluGlu: 3.818 ± 0.381
2.756GluPhe: 2.756 ± 0.313
3.661GluGly: 3.661 ± 0.421
1.693GluHis: 1.693 ± 0.248
4.645GluIle: 4.645 ± 0.396
4.33GluLys: 4.33 ± 0.385
7.361GluLeu: 7.361 ± 0.55
1.929GluMet: 1.929 ± 0.272
3.504GluAsn: 3.504 ± 0.347
1.535GluPro: 1.535 ± 0.227
2.795GluGln: 2.795 ± 0.359
2.952GluArg: 2.952 ± 0.333
3.7GluSer: 3.7 ± 0.415
3.11GluThr: 3.11 ± 0.309
4.881GluVal: 4.881 ± 0.404
1.299GluTrp: 1.299 ± 0.223
3.7GluTyr: 3.7 ± 0.365
0.0GluXaa: 0.0 ± 0.0
Phe
2.716PheAla: 2.716 ± 0.35
0.748PheCys: 0.748 ± 0.163
2.992PheAsp: 2.992 ± 0.296
2.952PheGlu: 2.952 ± 0.354
1.85PhePhe: 1.85 ± 0.284
2.362PheGly: 2.362 ± 0.264
0.512PheHis: 0.512 ± 0.138
3.071PheIle: 3.071 ± 0.3
2.834PheLys: 2.834 ± 0.335
3.385PheLeu: 3.385 ± 0.32
1.26PheMet: 1.26 ± 0.18
3.228PheAsn: 3.228 ± 0.347
1.22PhePro: 1.22 ± 0.202
1.338PheGln: 1.338 ± 0.226
1.457PheArg: 1.457 ± 0.245
3.307PheSer: 3.307 ± 0.37
3.267PheThr: 3.267 ± 0.364
2.913PheVal: 2.913 ± 0.326
0.551PheTrp: 0.551 ± 0.156
1.299PheTyr: 1.299 ± 0.249
0.0PheXaa: 0.0 ± 0.0
Gly
3.74GlyAla: 3.74 ± 0.436
1.024GlyCys: 1.024 ± 0.206
4.37GlyAsp: 4.37 ± 0.639
3.661GlyGlu: 3.661 ± 0.437
3.267GlyPhe: 3.267 ± 0.405
4.606GlyGly: 4.606 ± 0.423
0.787GlyHis: 0.787 ± 0.174
3.149GlyIle: 3.149 ± 0.337
4.999GlyLys: 4.999 ± 0.411
5.275GlyLeu: 5.275 ± 0.485
1.89GlyMet: 1.89 ± 0.268
4.527GlyAsn: 4.527 ± 0.791
0.236GlyPro: 0.236 ± 0.106
1.929GlyGln: 1.929 ± 0.285
2.48GlyArg: 2.48 ± 0.289
5.59GlySer: 5.59 ± 0.513
3.937GlyThr: 3.937 ± 0.601
5.393GlyVal: 5.393 ± 0.481
1.496GlyTrp: 1.496 ± 0.299
3.385GlyTyr: 3.385 ± 0.708
0.0GlyXaa: 0.0 ± 0.0
His
0.709HisAla: 0.709 ± 0.162
0.197HisCys: 0.197 ± 0.099
0.669HisAsp: 0.669 ± 0.175
1.299HisGlu: 1.299 ± 0.236
0.512HisPhe: 0.512 ± 0.148
0.866HisGly: 0.866 ± 0.198
0.276HisHis: 0.276 ± 0.111
1.024HisIle: 1.024 ± 0.215
1.378HisLys: 1.378 ± 0.231
2.008HisLeu: 2.008 ± 0.306
0.276HisMet: 0.276 ± 0.099
0.866HisAsn: 0.866 ± 0.148
0.748HisPro: 0.748 ± 0.224
0.433HisGln: 0.433 ± 0.121
0.63HisArg: 0.63 ± 0.132
1.22HisSer: 1.22 ± 0.205
1.299HisThr: 1.299 ± 0.196
1.142HisVal: 1.142 ± 0.21
0.236HisTrp: 0.236 ± 0.084
0.787HisTyr: 0.787 ± 0.164
0.0HisXaa: 0.0 ± 0.0
Ile
3.7IleAla: 3.7 ± 0.398
0.669IleCys: 0.669 ± 0.176
4.527IleAsp: 4.527 ± 0.467
3.385IleGlu: 3.385 ± 0.415
1.968IlePhe: 1.968 ± 0.265
3.858IleGly: 3.858 ± 0.406
1.181IleHis: 1.181 ± 0.222
2.992IleIle: 2.992 ± 0.347
5.078IleLys: 5.078 ± 0.491
3.818IleLeu: 3.818 ± 0.356
1.299IleMet: 1.299 ± 0.221
3.543IleAsn: 3.543 ± 0.352
2.165IlePro: 2.165 ± 0.311
2.519IleGln: 2.519 ± 0.309
2.874IleArg: 2.874 ± 0.3
4.488IleSer: 4.488 ± 0.389
4.684IleThr: 4.684 ± 0.473
3.661IleVal: 3.661 ± 0.398
0.669IleTrp: 0.669 ± 0.156
2.795IleTyr: 2.795 ± 0.326
0.0IleXaa: 0.0 ± 0.0
Lys
4.055LysAla: 4.055 ± 0.464
0.905LysCys: 0.905 ± 0.222
4.015LysAsp: 4.015 ± 0.418
5.039LysGlu: 5.039 ± 0.493
2.992LysPhe: 2.992 ± 0.346
3.504LysGly: 3.504 ± 0.391
1.732LysHis: 1.732 ± 0.258
4.409LysIle: 4.409 ± 0.393
3.622LysLys: 3.622 ± 0.398
5.393LysLeu: 5.393 ± 0.464
2.323LysMet: 2.323 ± 0.339
3.031LysAsn: 3.031 ± 0.379
2.637LysPro: 2.637 ± 0.338
4.055LysGln: 4.055 ± 0.406
3.189LysArg: 3.189 ± 0.419
4.094LysSer: 4.094 ± 0.377
3.897LysThr: 3.897 ± 0.454
5.393LysVal: 5.393 ± 0.422
1.142LysTrp: 1.142 ± 0.209
3.307LysTyr: 3.307 ± 0.374
0.0LysXaa: 0.0 ± 0.0
Leu
5.59LeuAla: 5.59 ± 0.576
1.457LeuCys: 1.457 ± 0.215
4.96LeuAsp: 4.96 ± 0.449
6.535LeuGlu: 6.535 ± 0.454
3.858LeuPhe: 3.858 ± 0.502
4.881LeuGly: 4.881 ± 0.487
1.26LeuHis: 1.26 ± 0.227
4.684LeuIle: 4.684 ± 0.397
5.551LeuLys: 5.551 ± 0.491
6.968LeuLeu: 6.968 ± 0.596
2.204LeuMet: 2.204 ± 0.286
4.842LeuAsn: 4.842 ± 0.457
3.346LeuPro: 3.346 ± 0.367
3.661LeuGln: 3.661 ± 0.385
3.464LeuArg: 3.464 ± 0.295
6.377LeuSer: 6.377 ± 0.493
5.393LeuThr: 5.393 ± 0.476
5.472LeuVal: 5.472 ± 0.509
0.945LeuTrp: 0.945 ± 0.222
3.11LeuTyr: 3.11 ± 0.321
0.0LeuXaa: 0.0 ± 0.0
Met
1.929MetAla: 1.929 ± 0.263
0.433MetCys: 0.433 ± 0.13
0.984MetAsp: 0.984 ± 0.178
1.771MetGlu: 1.771 ± 0.249
1.338MetPhe: 1.338 ± 0.24
1.024MetGly: 1.024 ± 0.207
0.118MetHis: 0.118 ± 0.075
1.22MetIle: 1.22 ± 0.241
2.008MetLys: 2.008 ± 0.31
2.323MetLeu: 2.323 ± 0.331
0.709MetMet: 0.709 ± 0.171
1.22MetAsn: 1.22 ± 0.202
0.669MetPro: 0.669 ± 0.188
1.338MetGln: 1.338 ± 0.203
0.748MetArg: 0.748 ± 0.157
2.362MetSer: 2.362 ± 0.356
1.457MetThr: 1.457 ± 0.26
1.693MetVal: 1.693 ± 0.301
0.354MetTrp: 0.354 ± 0.123
0.945MetTyr: 0.945 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
3.149AsnAla: 3.149 ± 0.49
0.905AsnCys: 0.905 ± 0.285
2.913AsnAsp: 2.913 ± 0.346
2.874AsnGlu: 2.874 ± 0.365
2.008AsnPhe: 2.008 ± 0.303
4.527AsnGly: 4.527 ± 0.628
0.59AsnHis: 0.59 ± 0.171
3.189AsnIle: 3.189 ± 0.315
4.527AsnLys: 4.527 ± 0.424
4.133AsnLeu: 4.133 ± 0.448
1.378AsnMet: 1.378 ± 0.24
3.149AsnAsn: 3.149 ± 0.345
2.598AsnPro: 2.598 ± 0.329
2.48AsnGln: 2.48 ± 0.328
2.283AsnArg: 2.283 ± 0.293
2.677AsnSer: 2.677 ± 0.287
4.448AsnThr: 4.448 ± 1.101
4.094AsnVal: 4.094 ± 0.425
0.709AsnTrp: 0.709 ± 0.212
2.441AsnTyr: 2.441 ± 0.316
0.0AsnXaa: 0.0 ± 0.0
Pro
1.929ProAla: 1.929 ± 0.305
0.079ProCys: 0.079 ± 0.053
3.031ProAsp: 3.031 ± 0.328
2.716ProGlu: 2.716 ± 0.337
1.614ProPhe: 1.614 ± 0.245
0.039ProGly: 0.039 ± 0.039
0.433ProHis: 0.433 ± 0.111
2.323ProIle: 2.323 ± 0.285
1.771ProLys: 1.771 ± 0.265
2.716ProLeu: 2.716 ± 0.321
0.472ProMet: 0.472 ± 0.137
1.732ProAsn: 1.732 ± 0.273
0.905ProPro: 0.905 ± 0.224
1.063ProGln: 1.063 ± 0.217
1.102ProArg: 1.102 ± 0.233
2.047ProSer: 2.047 ± 0.281
3.189ProThr: 3.189 ± 0.368
3.189ProVal: 3.189 ± 0.433
0.315ProTrp: 0.315 ± 0.097
1.968ProTyr: 1.968 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
2.795GlnAla: 2.795 ± 0.406
0.512GlnCys: 0.512 ± 0.131
1.929GlnAsp: 1.929 ± 0.27
3.7GlnGlu: 3.7 ± 0.383
1.575GlnPhe: 1.575 ± 0.226
1.614GlnGly: 1.614 ± 0.25
0.787GlnHis: 0.787 ± 0.16
2.047GlnIle: 2.047 ± 0.285
2.283GlnLys: 2.283 ± 0.301
3.425GlnLeu: 3.425 ± 0.34
1.024GlnMet: 1.024 ± 0.274
2.204GlnAsn: 2.204 ± 0.422
1.535GlnPro: 1.535 ± 0.236
1.811GlnGln: 1.811 ± 0.323
2.047GlnArg: 2.047 ± 0.315
2.441GlnSer: 2.441 ± 0.308
1.929GlnThr: 1.929 ± 0.261
2.204GlnVal: 2.204 ± 0.239
0.394GlnTrp: 0.394 ± 0.115
2.008GlnTyr: 2.008 ± 0.29
0.0GlnXaa: 0.0 ± 0.0
Arg
2.519ArgAla: 2.519 ± 0.346
0.905ArgCys: 0.905 ± 0.216
2.283ArgAsp: 2.283 ± 0.367
2.204ArgGlu: 2.204 ± 0.255
1.85ArgPhe: 1.85 ± 0.277
2.559ArgGly: 2.559 ± 0.37
0.827ArgHis: 0.827 ± 0.159
2.48ArgIle: 2.48 ± 0.326
2.519ArgLys: 2.519 ± 0.345
3.818ArgLeu: 3.818 ± 0.372
0.984ArgMet: 0.984 ± 0.175
2.323ArgAsn: 2.323 ± 0.363
1.496ArgPro: 1.496 ± 0.265
1.732ArgGln: 1.732 ± 0.238
1.693ArgArg: 1.693 ± 0.332
2.204ArgSer: 2.204 ± 0.304
2.244ArgThr: 2.244 ± 0.244
2.952ArgVal: 2.952 ± 0.38
0.394ArgTrp: 0.394 ± 0.114
2.008ArgTyr: 2.008 ± 0.274
0.0ArgXaa: 0.0 ± 0.0
Ser
4.133SerAla: 4.133 ± 0.458
1.063SerCys: 1.063 ± 0.208
5.236SerAsp: 5.236 ± 0.406
3.897SerGlu: 3.897 ± 0.39
3.149SerPhe: 3.149 ± 0.317
4.999SerGly: 4.999 ± 0.454
1.102SerHis: 1.102 ± 0.211
3.425SerIle: 3.425 ± 0.351
4.803SerLys: 4.803 ± 0.417
5.551SerLeu: 5.551 ± 0.588
1.575SerMet: 1.575 ± 0.237
3.543SerAsn: 3.543 ± 0.379
2.323SerPro: 2.323 ± 0.343
2.204SerGln: 2.204 ± 0.312
2.441SerArg: 2.441 ± 0.336
4.803SerSer: 4.803 ± 0.527
3.897SerThr: 3.897 ± 0.389
4.645SerVal: 4.645 ± 0.472
0.827SerTrp: 0.827 ± 0.169
2.952SerTyr: 2.952 ± 0.333
0.0SerXaa: 0.0 ± 0.0
Thr
4.921ThrAla: 4.921 ± 0.572
0.512ThrCys: 0.512 ± 0.15
4.291ThrAsp: 4.291 ± 0.355
3.937ThrGlu: 3.937 ± 0.365
2.756ThrPhe: 2.756 ± 0.309
6.495ThrGly: 6.495 ± 1.136
1.063ThrHis: 1.063 ± 0.236
4.133ThrIle: 4.133 ± 0.436
4.133ThrLys: 4.133 ± 0.432
4.842ThrLeu: 4.842 ± 0.539
0.984ThrMet: 0.984 ± 0.175
3.228ThrAsn: 3.228 ± 0.412
2.716ThrPro: 2.716 ± 0.389
2.283ThrGln: 2.283 ± 0.289
1.811ThrArg: 1.811 ± 0.256
3.858ThrSer: 3.858 ± 0.497
4.527ThrThr: 4.527 ± 0.511
4.763ThrVal: 4.763 ± 0.418
0.63ThrTrp: 0.63 ± 0.132
2.204ThrTyr: 2.204 ± 0.3
0.0ThrXaa: 0.0 ± 0.0
Val
4.527ValAla: 4.527 ± 0.583
0.905ValCys: 0.905 ± 0.188
5.157ValAsp: 5.157 ± 0.449
5.629ValGlu: 5.629 ± 0.504
3.189ValPhe: 3.189 ± 0.331
5.472ValGly: 5.472 ± 0.45
1.181ValHis: 1.181 ± 0.267
3.622ValIle: 3.622 ± 0.391
4.212ValLys: 4.212 ± 0.426
4.921ValLeu: 4.921 ± 0.409
1.968ValMet: 1.968 ± 0.275
3.818ValAsn: 3.818 ± 0.363
2.559ValPro: 2.559 ± 0.296
2.165ValGln: 2.165 ± 0.336
3.071ValArg: 3.071 ± 0.341
5.118ValSer: 5.118 ± 0.505
4.33ValThr: 4.33 ± 0.477
6.259ValVal: 6.259 ± 0.517
1.378ValTrp: 1.378 ± 0.211
2.795ValTyr: 2.795 ± 0.278
0.0ValXaa: 0.0 ± 0.0
Trp
0.669TrpAla: 0.669 ± 0.15
0.157TrpCys: 0.157 ± 0.075
0.59TrpAsp: 0.59 ± 0.168
1.024TrpGlu: 1.024 ± 0.191
0.748TrpPhe: 0.748 ± 0.217
0.709TrpGly: 0.709 ± 0.167
0.197TrpHis: 0.197 ± 0.086
0.787TrpIle: 0.787 ± 0.167
0.984TrpLys: 0.984 ± 0.211
1.85TrpLeu: 1.85 ± 0.333
0.354TrpMet: 0.354 ± 0.12
0.748TrpAsn: 0.748 ± 0.313
0.394TrpPro: 0.394 ± 0.115
0.945TrpGln: 0.945 ± 0.179
0.787TrpArg: 0.787 ± 0.176
0.827TrpSer: 0.827 ± 0.183
0.787TrpThr: 0.787 ± 0.179
1.22TrpVal: 1.22 ± 0.205
0.354TrpTrp: 0.354 ± 0.126
0.472TrpTyr: 0.472 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.637TyrAla: 2.637 ± 0.3
0.945TyrCys: 0.945 ± 0.212
3.425TyrAsp: 3.425 ± 0.403
2.992TyrGlu: 2.992 ± 0.317
1.811TyrPhe: 1.811 ± 0.254
4.055TyrGly: 4.055 ± 0.705
0.709TyrHis: 0.709 ± 0.188
2.834TyrIle: 2.834 ± 0.33
2.795TyrLys: 2.795 ± 0.32
5.275TyrLeu: 5.275 ± 0.521
0.866TyrMet: 0.866 ± 0.175
2.598TyrAsn: 2.598 ± 0.349
1.378TyrPro: 1.378 ± 0.236
1.378TyrGln: 1.378 ± 0.214
2.008TyrArg: 2.008 ± 0.267
2.992TyrSer: 2.992 ± 0.311
2.244TyrThr: 2.244 ± 0.317
2.244TyrVal: 2.244 ± 0.297
0.197TyrTrp: 0.197 ± 0.088
2.244TyrTyr: 2.244 ± 0.296
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 143 proteins (25404 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski