Amino acid dipepetide frequency for Vibrio phage QH

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.423AlaAla: 10.423 ± 1.23
0.637AlaCys: 0.637 ± 0.24
5.013AlaAsp: 5.013 ± 0.546
6.922AlaGlu: 6.922 ± 0.888
2.785AlaPhe: 2.785 ± 0.37
5.808AlaGly: 5.808 ± 0.63
1.591AlaHis: 1.591 ± 0.393
5.411AlaIle: 5.411 ± 0.642
6.365AlaLys: 6.365 ± 1.106
7.32AlaLeu: 7.32 ± 0.754
3.024AlaMet: 3.024 ± 0.454
2.944AlaAsn: 2.944 ± 0.409
3.024AlaPro: 3.024 ± 0.452
3.899AlaGln: 3.899 ± 0.696
4.535AlaArg: 4.535 ± 0.497
6.206AlaSer: 6.206 ± 0.697
5.808AlaThr: 5.808 ± 0.685
6.843AlaVal: 6.843 ± 0.734
1.353AlaTrp: 1.353 ± 0.351
2.705AlaTyr: 2.705 ± 0.481
0.0AlaXaa: 0.0 ± 0.0
Cys
0.477CysAla: 0.477 ± 0.194
0.239CysCys: 0.239 ± 0.108
0.318CysAsp: 0.318 ± 0.158
0.637CysGlu: 0.637 ± 0.182
0.318CysPhe: 0.318 ± 0.113
1.034CysGly: 1.034 ± 0.324
0.159CysHis: 0.159 ± 0.098
0.637CysIle: 0.637 ± 0.183
0.557CysLys: 0.557 ± 0.2
0.477CysLeu: 0.477 ± 0.181
0.239CysMet: 0.239 ± 0.14
0.159CysAsn: 0.159 ± 0.108
0.557CysPro: 0.557 ± 0.265
0.398CysGln: 0.398 ± 0.154
0.398CysArg: 0.398 ± 0.192
0.477CysSer: 0.477 ± 0.202
0.796CysThr: 0.796 ± 0.318
0.398CysVal: 0.398 ± 0.174
0.08CysTrp: 0.08 ± 0.07
0.159CysTyr: 0.159 ± 0.108
0.0CysXaa: 0.0 ± 0.0
Asp
5.251AspAla: 5.251 ± 0.674
0.875AspCys: 0.875 ± 0.271
3.66AspAsp: 3.66 ± 0.618
4.933AspGlu: 4.933 ± 0.848
2.944AspPhe: 2.944 ± 0.503
5.331AspGly: 5.331 ± 0.941
0.716AspHis: 0.716 ± 0.288
3.501AspIle: 3.501 ± 0.479
4.297AspLys: 4.297 ± 0.525
5.57AspLeu: 5.57 ± 0.721
1.989AspMet: 1.989 ± 0.474
2.069AspAsn: 2.069 ± 0.407
3.421AspPro: 3.421 ± 0.495
0.955AspGln: 0.955 ± 0.272
3.024AspArg: 3.024 ± 0.55
3.978AspSer: 3.978 ± 0.596
3.899AspThr: 3.899 ± 0.58
4.854AspVal: 4.854 ± 0.585
1.194AspTrp: 1.194 ± 0.35
1.83AspTyr: 1.83 ± 0.444
0.0AspXaa: 0.0 ± 0.0
Glu
5.331GluAla: 5.331 ± 0.634
0.477GluCys: 0.477 ± 0.164
4.615GluAsp: 4.615 ± 0.558
5.331GluGlu: 5.331 ± 0.781
1.83GluPhe: 1.83 ± 0.323
3.819GluGly: 3.819 ± 0.628
1.591GluHis: 1.591 ± 0.395
3.501GluIle: 3.501 ± 0.56
3.342GluLys: 3.342 ± 0.601
5.649GluLeu: 5.649 ± 0.741
2.467GluMet: 2.467 ± 0.418
2.467GluAsn: 2.467 ± 0.497
1.91GluPro: 1.91 ± 0.385
3.581GluGln: 3.581 ± 0.437
4.058GluArg: 4.058 ± 0.589
3.581GluSer: 3.581 ± 0.48
2.785GluThr: 2.785 ± 0.43
4.774GluVal: 4.774 ± 0.596
1.75GluTrp: 1.75 ± 0.397
2.148GluTyr: 2.148 ± 0.426
0.0GluXaa: 0.0 ± 0.0
Phe
2.705PheAla: 2.705 ± 0.506
0.239PheCys: 0.239 ± 0.129
2.546PheAsp: 2.546 ± 0.41
1.353PheGlu: 1.353 ± 0.326
0.716PhePhe: 0.716 ± 0.267
3.421PheGly: 3.421 ± 0.628
0.398PheHis: 0.398 ± 0.159
1.432PheIle: 1.432 ± 0.3
1.75PheLys: 1.75 ± 0.48
2.705PheLeu: 2.705 ± 0.394
0.875PheMet: 0.875 ± 0.32
2.307PheAsn: 2.307 ± 0.397
1.273PhePro: 1.273 ± 0.31
1.273PheGln: 1.273 ± 0.405
1.91PheArg: 1.91 ± 0.405
2.307PheSer: 2.307 ± 0.516
2.307PheThr: 2.307 ± 0.429
1.91PheVal: 1.91 ± 0.469
0.318PheTrp: 0.318 ± 0.176
1.034PheTyr: 1.034 ± 0.323
0.0PheXaa: 0.0 ± 0.0
Gly
6.206GlyAla: 6.206 ± 0.877
0.637GlyCys: 0.637 ± 0.264
4.933GlyAsp: 4.933 ± 0.557
3.581GlyGlu: 3.581 ± 0.405
2.626GlyPhe: 2.626 ± 0.441
9.946GlyGly: 9.946 ± 3.538
1.432GlyHis: 1.432 ± 0.368
3.74GlyIle: 3.74 ± 0.477
4.297GlyLys: 4.297 ± 0.585
4.774GlyLeu: 4.774 ± 0.65
2.467GlyMet: 2.467 ± 0.506
5.49GlyAsn: 5.49 ± 1.073
1.83GlyPro: 1.83 ± 0.375
3.183GlyGln: 3.183 ± 0.432
3.899GlyArg: 3.899 ± 0.529
6.365GlySer: 6.365 ± 0.983
5.172GlyThr: 5.172 ± 0.711
5.57GlyVal: 5.57 ± 0.93
1.114GlyTrp: 1.114 ± 0.228
3.74GlyTyr: 3.74 ± 0.441
0.0GlyXaa: 0.0 ± 0.0
His
1.273HisAla: 1.273 ± 0.335
0.159HisCys: 0.159 ± 0.106
1.034HisAsp: 1.034 ± 0.252
1.034HisGlu: 1.034 ± 0.216
0.318HisPhe: 0.318 ± 0.18
1.591HisGly: 1.591 ± 0.507
0.159HisHis: 0.159 ± 0.109
0.955HisIle: 0.955 ± 0.335
0.716HisLys: 0.716 ± 0.234
1.591HisLeu: 1.591 ± 0.352
0.875HisMet: 0.875 ± 0.292
0.875HisAsn: 0.875 ± 0.179
0.716HisPro: 0.716 ± 0.224
0.637HisGln: 0.637 ± 0.212
1.91HisArg: 1.91 ± 0.538
0.955HisSer: 0.955 ± 0.295
0.637HisThr: 0.637 ± 0.181
1.432HisVal: 1.432 ± 0.293
0.159HisTrp: 0.159 ± 0.109
0.716HisTyr: 0.716 ± 0.261
0.0HisXaa: 0.0 ± 0.0
Ile
6.763IleAla: 6.763 ± 0.677
0.398IleCys: 0.398 ± 0.162
4.376IleAsp: 4.376 ± 0.606
3.978IleGlu: 3.978 ± 0.697
1.034IlePhe: 1.034 ± 0.434
3.74IleGly: 3.74 ± 0.476
0.398IleHis: 0.398 ± 0.173
2.705IleIle: 2.705 ± 0.416
3.262IleLys: 3.262 ± 0.47
2.546IleLeu: 2.546 ± 0.397
1.512IleMet: 1.512 ± 0.309
2.864IleAsn: 2.864 ± 0.449
3.262IlePro: 3.262 ± 0.355
3.103IleGln: 3.103 ± 0.466
3.66IleArg: 3.66 ± 0.575
3.501IleSer: 3.501 ± 0.602
2.626IleThr: 2.626 ± 0.556
2.864IleVal: 2.864 ± 0.395
1.114IleTrp: 1.114 ± 0.295
0.716IleTyr: 0.716 ± 0.187
0.0IleXaa: 0.0 ± 0.0
Lys
5.888LysAla: 5.888 ± 0.938
0.239LysCys: 0.239 ± 0.188
3.342LysAsp: 3.342 ± 0.606
3.262LysGlu: 3.262 ± 0.617
1.75LysPhe: 1.75 ± 0.301
4.933LysGly: 4.933 ± 0.778
1.194LysHis: 1.194 ± 0.319
2.864LysIle: 2.864 ± 0.375
4.535LysLys: 4.535 ± 0.723
3.819LysLeu: 3.819 ± 0.556
2.467LysMet: 2.467 ± 0.496
2.705LysAsn: 2.705 ± 0.527
2.307LysPro: 2.307 ± 0.501
1.91LysGln: 1.91 ± 0.338
2.944LysArg: 2.944 ± 0.569
3.978LysSer: 3.978 ± 0.609
2.228LysThr: 2.228 ± 0.47
3.66LysVal: 3.66 ± 0.558
1.034LysTrp: 1.034 ± 0.266
2.864LysTyr: 2.864 ± 0.532
0.0LysXaa: 0.0 ± 0.0
Leu
6.684LeuAla: 6.684 ± 0.654
0.557LeuCys: 0.557 ± 0.221
5.251LeuAsp: 5.251 ± 0.578
3.899LeuGlu: 3.899 ± 0.611
1.83LeuPhe: 1.83 ± 0.365
6.604LeuGly: 6.604 ± 0.862
1.75LeuHis: 1.75 ± 0.404
4.694LeuIle: 4.694 ± 0.574
3.74LeuLys: 3.74 ± 0.615
3.342LeuLeu: 3.342 ± 0.431
2.546LeuMet: 2.546 ± 0.545
3.581LeuAsn: 3.581 ± 0.571
3.183LeuPro: 3.183 ± 0.489
2.626LeuGln: 2.626 ± 0.328
4.456LeuArg: 4.456 ± 0.516
4.456LeuSer: 4.456 ± 0.511
4.217LeuThr: 4.217 ± 0.504
5.411LeuVal: 5.411 ± 0.853
1.114LeuTrp: 1.114 ± 0.355
1.034LeuTyr: 1.034 ± 0.293
0.0LeuXaa: 0.0 ± 0.0
Met
3.421MetAla: 3.421 ± 0.824
0.477MetCys: 0.477 ± 0.176
1.83MetAsp: 1.83 ± 0.293
2.546MetGlu: 2.546 ± 0.533
0.796MetPhe: 0.796 ± 0.224
2.228MetGly: 2.228 ± 0.426
0.716MetHis: 0.716 ± 0.208
1.194MetIle: 1.194 ± 0.25
1.91MetLys: 1.91 ± 0.33
2.069MetLeu: 2.069 ± 0.414
1.353MetMet: 1.353 ± 0.379
1.432MetAsn: 1.432 ± 0.302
1.353MetPro: 1.353 ± 0.329
1.273MetGln: 1.273 ± 0.388
1.83MetArg: 1.83 ± 0.452
2.626MetSer: 2.626 ± 0.487
1.512MetThr: 1.512 ± 0.36
1.989MetVal: 1.989 ± 0.503
0.318MetTrp: 0.318 ± 0.142
1.114MetTyr: 1.114 ± 0.301
0.0MetXaa: 0.0 ± 0.0
Asn
5.092AsnAla: 5.092 ± 0.609
0.477AsnCys: 0.477 ± 0.236
2.467AsnAsp: 2.467 ± 0.548
3.103AsnGlu: 3.103 ± 0.54
1.512AsnPhe: 1.512 ± 0.249
5.331AsnGly: 5.331 ± 1.227
0.637AsnHis: 0.637 ± 0.224
2.069AsnIle: 2.069 ± 0.34
2.626AsnLys: 2.626 ± 0.359
2.148AsnLeu: 2.148 ± 0.337
1.273AsnMet: 1.273 ± 0.309
2.387AsnAsn: 2.387 ± 0.373
1.671AsnPro: 1.671 ± 0.409
1.83AsnGln: 1.83 ± 0.381
3.024AsnArg: 3.024 ± 0.555
3.262AsnSer: 3.262 ± 0.642
3.183AsnThr: 3.183 ± 0.464
2.467AsnVal: 2.467 ± 0.408
1.512AsnTrp: 1.512 ± 0.435
1.353AsnTyr: 1.353 ± 0.382
0.0AsnXaa: 0.0 ± 0.0
Pro
4.137ProAla: 4.137 ± 0.602
0.239ProCys: 0.239 ± 0.122
3.581ProAsp: 3.581 ± 0.445
2.864ProGlu: 2.864 ± 0.482
1.512ProPhe: 1.512 ± 0.266
0.239ProGly: 0.239 ± 0.126
0.955ProHis: 0.955 ± 0.319
2.228ProIle: 2.228 ± 0.354
1.512ProLys: 1.512 ± 0.287
2.228ProLeu: 2.228 ± 0.428
1.273ProMet: 1.273 ± 0.317
2.546ProAsn: 2.546 ± 0.448
1.432ProPro: 1.432 ± 0.472
1.432ProGln: 1.432 ± 0.241
1.91ProArg: 1.91 ± 0.399
3.581ProSer: 3.581 ± 0.466
3.183ProThr: 3.183 ± 0.496
2.705ProVal: 2.705 ± 0.61
0.637ProTrp: 0.637 ± 0.25
1.114ProTyr: 1.114 ± 0.397
0.0ProXaa: 0.0 ± 0.0
Gln
3.581GlnAla: 3.581 ± 0.45
0.08GlnCys: 0.08 ± 0.068
1.83GlnAsp: 1.83 ± 0.341
2.864GlnGlu: 2.864 ± 0.47
1.034GlnPhe: 1.034 ± 0.224
2.705GlnGly: 2.705 ± 0.364
0.716GlnHis: 0.716 ± 0.245
2.467GlnIle: 2.467 ± 0.332
1.432GlnLys: 1.432 ± 0.26
3.183GlnLeu: 3.183 ± 0.543
1.432GlnMet: 1.432 ± 0.303
1.353GlnAsn: 1.353 ± 0.394
1.83GlnPro: 1.83 ± 0.364
1.671GlnGln: 1.671 ± 0.394
2.307GlnArg: 2.307 ± 0.542
2.387GlnSer: 2.387 ± 0.365
3.024GlnThr: 3.024 ± 0.383
2.228GlnVal: 2.228 ± 0.344
1.194GlnTrp: 1.194 ± 0.27
1.432GlnTyr: 1.432 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
4.137ArgAla: 4.137 ± 0.503
0.557ArgCys: 0.557 ± 0.201
2.944ArgAsp: 2.944 ± 0.399
3.183ArgGlu: 3.183 ± 0.432
2.705ArgPhe: 2.705 ± 0.621
4.456ArgGly: 4.456 ± 0.854
0.716ArgHis: 0.716 ± 0.342
3.342ArgIle: 3.342 ± 0.458
3.581ArgLys: 3.581 ± 0.563
5.172ArgLeu: 5.172 ± 0.699
2.626ArgMet: 2.626 ± 0.367
3.183ArgAsn: 3.183 ± 0.569
2.148ArgPro: 2.148 ± 0.521
1.671ArgGln: 1.671 ± 0.351
3.421ArgArg: 3.421 ± 0.617
3.024ArgSer: 3.024 ± 0.455
2.387ArgThr: 2.387 ± 0.448
3.342ArgVal: 3.342 ± 0.614
1.353ArgTrp: 1.353 ± 0.418
2.228ArgTyr: 2.228 ± 0.382
0.0ArgXaa: 0.0 ± 0.0
Ser
5.729SerAla: 5.729 ± 0.736
0.239SerCys: 0.239 ± 0.131
4.933SerAsp: 4.933 ± 0.601
4.137SerGlu: 4.137 ± 0.509
2.467SerPhe: 2.467 ± 0.454
5.331SerGly: 5.331 ± 0.756
1.194SerHis: 1.194 ± 0.263
4.376SerIle: 4.376 ± 0.554
3.819SerLys: 3.819 ± 0.725
4.535SerLeu: 4.535 ± 0.579
2.148SerMet: 2.148 ± 0.357
3.66SerAsn: 3.66 ± 0.769
3.103SerPro: 3.103 ± 0.516
2.069SerGln: 2.069 ± 0.321
3.183SerArg: 3.183 ± 0.456
4.137SerSer: 4.137 ± 0.571
3.103SerThr: 3.103 ± 0.546
4.694SerVal: 4.694 ± 0.734
0.955SerTrp: 0.955 ± 0.276
2.228SerTyr: 2.228 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
5.729ThrAla: 5.729 ± 0.755
0.477ThrCys: 0.477 ± 0.191
3.024ThrAsp: 3.024 ± 0.525
3.581ThrGlu: 3.581 ± 0.471
2.705ThrPhe: 2.705 ± 0.509
5.411ThrGly: 5.411 ± 0.726
0.716ThrHis: 0.716 ± 0.246
3.024ThrIle: 3.024 ± 0.45
2.705ThrLys: 2.705 ± 0.447
4.694ThrLeu: 4.694 ± 0.474
1.194ThrMet: 1.194 ± 0.362
2.148ThrAsn: 2.148 ± 0.433
2.626ThrPro: 2.626 ± 0.308
2.546ThrGln: 2.546 ± 0.396
2.546ThrArg: 2.546 ± 0.427
3.024ThrSer: 3.024 ± 0.543
2.626ThrThr: 2.626 ± 0.54
4.376ThrVal: 4.376 ± 0.653
0.796ThrTrp: 0.796 ± 0.405
1.671ThrTyr: 1.671 ± 0.336
0.0ThrXaa: 0.0 ± 0.0
Val
5.411ValAla: 5.411 ± 0.613
0.716ValCys: 0.716 ± 0.242
4.933ValAsp: 4.933 ± 0.661
4.535ValGlu: 4.535 ± 0.405
1.83ValPhe: 1.83 ± 0.418
5.968ValGly: 5.968 ± 0.908
1.91ValHis: 1.91 ± 0.412
3.501ValIle: 3.501 ± 0.587
4.456ValLys: 4.456 ± 0.687
5.331ValLeu: 5.331 ± 0.58
1.591ValMet: 1.591 ± 0.396
2.546ValAsn: 2.546 ± 0.662
2.069ValPro: 2.069 ± 0.344
2.626ValGln: 2.626 ± 0.548
3.978ValArg: 3.978 ± 0.629
4.933ValSer: 4.933 ± 0.85
3.342ValThr: 3.342 ± 0.647
5.251ValVal: 5.251 ± 0.756
1.194ValTrp: 1.194 ± 0.293
2.705ValTyr: 2.705 ± 0.565
0.0ValXaa: 0.0 ± 0.0
Trp
1.273TrpAla: 1.273 ± 0.26
0.318TrpCys: 0.318 ± 0.131
1.194TrpAsp: 1.194 ± 0.338
1.034TrpGlu: 1.034 ± 0.284
0.875TrpPhe: 0.875 ± 0.41
0.716TrpGly: 0.716 ± 0.178
0.239TrpHis: 0.239 ± 0.151
1.034TrpIle: 1.034 ± 0.282
1.034TrpLys: 1.034 ± 0.276
1.671TrpLeu: 1.671 ± 0.389
0.398TrpMet: 0.398 ± 0.177
0.557TrpAsn: 0.557 ± 0.204
0.239TrpPro: 0.239 ± 0.161
0.557TrpGln: 0.557 ± 0.206
1.353TrpArg: 1.353 ± 0.281
1.671TrpSer: 1.671 ± 0.36
1.353TrpThr: 1.353 ± 0.274
1.432TrpVal: 1.432 ± 0.322
0.318TrpTrp: 0.318 ± 0.136
0.716TrpTyr: 0.716 ± 0.213
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.103TyrAla: 3.103 ± 0.425
0.477TyrCys: 0.477 ± 0.184
2.467TyrAsp: 2.467 ± 0.31
2.307TyrGlu: 2.307 ± 0.455
1.194TyrPhe: 1.194 ± 0.296
2.387TyrGly: 2.387 ± 0.359
0.557TyrHis: 0.557 ± 0.194
1.75TyrIle: 1.75 ± 0.346
1.989TyrLys: 1.989 ± 0.519
2.148TyrLeu: 2.148 ± 0.407
0.159TyrMet: 0.159 ± 0.111
2.148TyrAsn: 2.148 ± 0.36
1.353TyrPro: 1.353 ± 0.296
1.512TyrGln: 1.512 ± 0.311
1.83TyrArg: 1.83 ± 0.423
1.591TyrSer: 1.591 ± 0.358
1.432TyrThr: 1.432 ± 0.33
2.626TyrVal: 2.626 ± 0.407
0.477TyrTrp: 0.477 ± 0.246
1.114TyrTyr: 1.114 ± 0.333
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (12569 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski