Amino acid dipepetide frequency for Vibrio phage CP-T1 (Bacteriophage CP-T1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.73AlaAla: 6.73 ± 0.89
0.43AlaCys: 0.43 ± 0.19
4.582AlaAsp: 4.582 ± 0.635
4.296AlaGlu: 4.296 ± 0.459
2.148AlaPhe: 2.148 ± 0.423
4.511AlaGly: 4.511 ± 0.587
1.862AlaHis: 1.862 ± 0.353
5.012AlaIle: 5.012 ± 0.582
4.081AlaLys: 4.081 ± 0.548
6.802AlaLeu: 6.802 ± 0.741
1.933AlaMet: 1.933 ± 0.333
3.079AlaAsn: 3.079 ± 0.502
2.864AlaPro: 2.864 ± 0.472
3.795AlaGln: 3.795 ± 0.609
4.654AlaArg: 4.654 ± 0.509
4.224AlaSer: 4.224 ± 0.523
5.441AlaThr: 5.441 ± 0.589
5.513AlaVal: 5.513 ± 0.683
0.859AlaTrp: 0.859 ± 0.286
2.363AlaTyr: 2.363 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
0.931CysAla: 0.931 ± 0.215
0.072CysCys: 0.072 ± 0.072
1.002CysAsp: 1.002 ± 0.249
0.931CysGlu: 0.931 ± 0.278
0.501CysPhe: 0.501 ± 0.194
1.504CysGly: 1.504 ± 0.363
0.501CysHis: 0.501 ± 0.225
0.788CysIle: 0.788 ± 0.27
1.002CysLys: 1.002 ± 0.298
0.859CysLeu: 0.859 ± 0.234
0.358CysMet: 0.358 ± 0.169
0.644CysAsn: 0.644 ± 0.282
0.859CysPro: 0.859 ± 0.242
0.358CysGln: 0.358 ± 0.169
0.43CysArg: 0.43 ± 0.171
1.074CysSer: 1.074 ± 0.244
1.074CysThr: 1.074 ± 0.253
1.289CysVal: 1.289 ± 0.285
0.215CysTrp: 0.215 ± 0.166
0.286CysTyr: 0.286 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
5.799AspAla: 5.799 ± 0.598
0.788AspCys: 0.788 ± 0.248
5.513AspAsp: 5.513 ± 0.595
5.083AspGlu: 5.083 ± 0.553
1.862AspPhe: 1.862 ± 0.478
5.656AspGly: 5.656 ± 0.551
1.36AspHis: 1.36 ± 0.251
5.298AspIle: 5.298 ± 0.663
3.15AspLys: 3.15 ± 0.434
4.296AspLeu: 4.296 ± 0.624
1.718AspMet: 1.718 ± 0.378
3.651AspAsn: 3.651 ± 0.639
3.651AspPro: 3.651 ± 0.523
1.074AspGln: 1.074 ± 0.289
2.578AspArg: 2.578 ± 0.395
3.58AspSer: 3.58 ± 0.581
5.012AspThr: 5.012 ± 0.682
5.012AspVal: 5.012 ± 0.589
1.36AspTrp: 1.36 ± 0.274
2.578AspTyr: 2.578 ± 0.476
0.0AspXaa: 0.0 ± 0.0
Glu
3.795GluAla: 3.795 ± 0.639
1.146GluCys: 1.146 ± 0.298
2.291GluAsp: 2.291 ± 0.444
2.434GluGlu: 2.434 ± 0.494
2.363GluPhe: 2.363 ± 0.374
2.721GluGly: 2.721 ± 0.385
1.002GluHis: 1.002 ± 0.241
3.508GluIle: 3.508 ± 0.348
2.363GluLys: 2.363 ± 0.418
6.515GluLeu: 6.515 ± 0.705
1.933GluMet: 1.933 ± 0.385
1.933GluAsn: 1.933 ± 0.389
1.647GluPro: 1.647 ± 0.342
2.935GluGln: 2.935 ± 0.451
3.58GluArg: 3.58 ± 0.553
4.224GluSer: 4.224 ± 0.476
3.866GluThr: 3.866 ± 0.472
3.293GluVal: 3.293 ± 0.479
0.931GluTrp: 0.931 ± 0.262
2.363GluTyr: 2.363 ± 0.397
0.0GluXaa: 0.0 ± 0.0
Phe
2.578PheAla: 2.578 ± 0.488
0.644PheCys: 0.644 ± 0.201
3.293PheAsp: 3.293 ± 0.448
1.933PheGlu: 1.933 ± 0.379
0.931PhePhe: 0.931 ± 0.365
3.079PheGly: 3.079 ± 0.331
0.358PheHis: 0.358 ± 0.141
2.578PheIle: 2.578 ± 0.403
2.506PheLys: 2.506 ± 0.571
1.074PheLeu: 1.074 ± 0.247
0.286PheMet: 0.286 ± 0.145
2.291PheAsn: 2.291 ± 0.44
1.289PhePro: 1.289 ± 0.36
0.716PheGln: 0.716 ± 0.254
1.432PheArg: 1.432 ± 0.311
3.007PheSer: 3.007 ± 0.47
2.721PheThr: 2.721 ± 0.513
2.649PheVal: 2.649 ± 0.514
0.43PheTrp: 0.43 ± 0.175
1.575PheTyr: 1.575 ± 0.283
0.0PheXaa: 0.0 ± 0.0
Gly
5.441GlyAla: 5.441 ± 0.66
0.788GlyCys: 0.788 ± 0.244
5.513GlyAsp: 5.513 ± 0.526
4.296GlyGlu: 4.296 ± 0.488
2.363GlyPhe: 2.363 ± 0.52
5.943GlyGly: 5.943 ± 0.73
1.432GlyHis: 1.432 ± 0.289
4.511GlyIle: 4.511 ± 0.458
3.795GlyLys: 3.795 ± 0.479
5.728GlyLeu: 5.728 ± 0.665
2.005GlyMet: 2.005 ± 0.538
4.081GlyAsn: 4.081 ± 0.694
1.36GlyPro: 1.36 ± 0.342
2.291GlyGln: 2.291 ± 0.377
4.009GlyArg: 4.009 ± 0.49
4.511GlySer: 4.511 ± 0.549
5.441GlyThr: 5.441 ± 0.836
6.157GlyVal: 6.157 ± 0.649
1.146GlyTrp: 1.146 ± 0.262
2.792GlyTyr: 2.792 ± 0.541
0.0GlyXaa: 0.0 ± 0.0
His
1.432HisAla: 1.432 ± 0.336
0.573HisCys: 0.573 ± 0.18
1.575HisAsp: 1.575 ± 0.363
1.36HisGlu: 1.36 ± 0.267
0.788HisPhe: 0.788 ± 0.254
1.002HisGly: 1.002 ± 0.305
0.788HisHis: 0.788 ± 0.185
1.146HisIle: 1.146 ± 0.26
0.644HisLys: 0.644 ± 0.221
1.575HisLeu: 1.575 ± 0.413
0.43HisMet: 0.43 ± 0.15
1.289HisAsn: 1.289 ± 0.239
0.859HisPro: 0.859 ± 0.23
0.501HisGln: 0.501 ± 0.169
0.931HisArg: 0.931 ± 0.25
0.716HisSer: 0.716 ± 0.239
1.36HisThr: 1.36 ± 0.297
1.146HisVal: 1.146 ± 0.296
0.358HisTrp: 0.358 ± 0.153
0.859HisTyr: 0.859 ± 0.266
0.0HisXaa: 0.0 ± 0.0
Ile
5.728IleAla: 5.728 ± 0.507
0.43IleCys: 0.43 ± 0.163
5.728IleAsp: 5.728 ± 0.4
4.582IleGlu: 4.582 ± 0.554
1.79IlePhe: 1.79 ± 0.322
5.083IleGly: 5.083 ± 0.663
1.146IleHis: 1.146 ± 0.234
5.083IleIle: 5.083 ± 0.569
4.153IleLys: 4.153 ± 0.626
4.009IleLeu: 4.009 ± 0.629
1.36IleMet: 1.36 ± 0.354
4.153IleAsn: 4.153 ± 0.647
2.721IlePro: 2.721 ± 0.626
2.363IleGln: 2.363 ± 0.448
2.935IleArg: 2.935 ± 0.483
3.795IleSer: 3.795 ± 0.479
4.94IleThr: 4.94 ± 0.571
5.441IleVal: 5.441 ± 0.595
0.501IleTrp: 0.501 ± 0.178
2.291IleTyr: 2.291 ± 0.423
0.0IleXaa: 0.0 ± 0.0
Lys
3.866LysAla: 3.866 ± 0.549
1.217LysCys: 1.217 ± 0.313
3.15LysAsp: 3.15 ± 0.478
2.291LysGlu: 2.291 ± 0.343
2.363LysPhe: 2.363 ± 0.493
3.293LysGly: 3.293 ± 0.355
1.289LysHis: 1.289 ± 0.331
2.721LysIle: 2.721 ± 0.466
2.005LysLys: 2.005 ± 0.428
4.153LysLeu: 4.153 ± 0.478
2.291LysMet: 2.291 ± 0.41
2.434LysAsn: 2.434 ± 0.426
2.363LysPro: 2.363 ± 0.382
2.148LysGln: 2.148 ± 0.299
4.009LysArg: 4.009 ± 0.651
4.009LysSer: 4.009 ± 0.682
2.291LysThr: 2.291 ± 0.41
3.651LysVal: 3.651 ± 0.639
0.859LysTrp: 0.859 ± 0.232
2.22LysTyr: 2.22 ± 0.42
0.0LysXaa: 0.0 ± 0.0
Leu
5.585LeuAla: 5.585 ± 0.628
1.146LeuCys: 1.146 ± 0.305
5.298LeuAsp: 5.298 ± 0.628
3.651LeuGlu: 3.651 ± 0.604
2.935LeuPhe: 2.935 ± 0.454
5.656LeuGly: 5.656 ± 0.585
1.432LeuHis: 1.432 ± 0.381
5.656LeuIle: 5.656 ± 0.554
3.651LeuLys: 3.651 ± 0.547
5.585LeuLeu: 5.585 ± 0.79
2.935LeuMet: 2.935 ± 0.513
4.009LeuAsn: 4.009 ± 0.599
3.723LeuPro: 3.723 ± 0.499
2.721LeuGln: 2.721 ± 0.477
5.012LeuArg: 5.012 ± 0.426
4.725LeuSer: 4.725 ± 0.623
5.585LeuThr: 5.585 ± 0.687
3.938LeuVal: 3.938 ± 0.506
1.146LeuTrp: 1.146 ± 0.345
2.22LeuTyr: 2.22 ± 0.388
0.0LeuXaa: 0.0 ± 0.0
Met
1.718MetAla: 1.718 ± 0.429
0.358MetCys: 0.358 ± 0.158
1.217MetAsp: 1.217 ± 0.36
1.002MetGlu: 1.002 ± 0.24
0.931MetPhe: 0.931 ± 0.236
1.217MetGly: 1.217 ± 0.279
0.286MetHis: 0.286 ± 0.148
2.005MetIle: 2.005 ± 0.417
1.862MetLys: 1.862 ± 0.431
1.718MetLeu: 1.718 ± 0.309
0.716MetMet: 0.716 ± 0.25
1.718MetAsn: 1.718 ± 0.438
0.931MetPro: 0.931 ± 0.244
1.146MetGln: 1.146 ± 0.292
2.005MetArg: 2.005 ± 0.342
2.148MetSer: 2.148 ± 0.377
1.862MetThr: 1.862 ± 0.297
2.506MetVal: 2.506 ± 0.351
0.072MetTrp: 0.072 ± 0.068
0.573MetTyr: 0.573 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
4.582AsnAla: 4.582 ± 0.593
0.716AsnCys: 0.716 ± 0.243
3.723AsnAsp: 3.723 ± 0.534
2.578AsnGlu: 2.578 ± 0.472
1.36AsnPhe: 1.36 ± 0.251
4.725AsnGly: 4.725 ± 0.564
1.002AsnHis: 1.002 ± 0.285
3.437AsnIle: 3.437 ± 0.461
2.935AsnLys: 2.935 ± 0.411
3.58AsnLeu: 3.58 ± 0.651
1.289AsnMet: 1.289 ± 0.294
3.15AsnAsn: 3.15 ± 0.471
2.649AsnPro: 2.649 ± 0.456
2.363AsnGln: 2.363 ± 0.374
1.933AsnArg: 1.933 ± 0.293
2.649AsnSer: 2.649 ± 0.41
3.007AsnThr: 3.007 ± 0.406
4.153AsnVal: 4.153 ± 0.578
0.644AsnTrp: 0.644 ± 0.175
1.289AsnTyr: 1.289 ± 0.322
0.0AsnXaa: 0.0 ± 0.0
Pro
3.079ProAla: 3.079 ± 0.453
0.859ProCys: 0.859 ± 0.215
3.079ProAsp: 3.079 ± 0.38
2.792ProGlu: 2.792 ± 0.479
1.504ProPhe: 1.504 ± 0.305
2.649ProGly: 2.649 ± 0.526
1.217ProHis: 1.217 ± 0.265
2.363ProIle: 2.363 ± 0.415
2.22ProLys: 2.22 ± 0.344
2.935ProLeu: 2.935 ± 0.417
0.644ProMet: 0.644 ± 0.255
2.363ProAsn: 2.363 ± 0.364
3.437ProPro: 3.437 ± 1.33
2.076ProGln: 2.076 ± 0.389
2.148ProArg: 2.148 ± 0.399
2.864ProSer: 2.864 ± 0.488
2.721ProThr: 2.721 ± 0.449
3.007ProVal: 3.007 ± 0.463
0.716ProTrp: 0.716 ± 0.242
1.217ProTyr: 1.217 ± 0.304
0.0ProXaa: 0.0 ± 0.0
Gln
2.291GlnAla: 2.291 ± 0.399
1.146GlnCys: 1.146 ± 0.294
2.005GlnAsp: 2.005 ± 0.359
1.504GlnGlu: 1.504 ± 0.297
1.647GlnPhe: 1.647 ± 0.285
1.933GlnGly: 1.933 ± 0.426
0.931GlnHis: 0.931 ± 0.293
2.22GlnIle: 2.22 ± 0.38
1.575GlnLys: 1.575 ± 0.287
4.153GlnLeu: 4.153 ± 0.546
0.788GlnMet: 0.788 ± 0.184
1.504GlnAsn: 1.504 ± 0.316
1.933GlnPro: 1.933 ± 0.339
1.933GlnGln: 1.933 ± 0.365
3.365GlnArg: 3.365 ± 0.573
2.22GlnSer: 2.22 ± 0.424
1.862GlnThr: 1.862 ± 0.352
1.718GlnVal: 1.718 ± 0.29
0.931GlnTrp: 0.931 ± 0.222
1.575GlnTyr: 1.575 ± 0.321
0.0GlnXaa: 0.0 ± 0.0
Arg
4.009ArgAla: 4.009 ± 0.517
0.788ArgCys: 0.788 ± 0.215
4.009ArgAsp: 4.009 ± 0.461
2.005ArgGlu: 2.005 ± 0.345
2.076ArgPhe: 2.076 ± 0.467
4.582ArgGly: 4.582 ± 0.505
1.002ArgHis: 1.002 ± 0.225
4.296ArgIle: 4.296 ± 0.499
4.654ArgLys: 4.654 ± 0.689
4.296ArgLeu: 4.296 ± 0.59
2.076ArgMet: 2.076 ± 0.434
3.007ArgAsn: 3.007 ± 0.414
2.005ArgPro: 2.005 ± 0.435
1.862ArgGln: 1.862 ± 0.374
2.721ArgArg: 2.721 ± 0.445
2.434ArgSer: 2.434 ± 0.4
2.22ArgThr: 2.22 ± 0.364
3.437ArgVal: 3.437 ± 0.518
1.217ArgTrp: 1.217 ± 0.329
2.22ArgTyr: 2.22 ± 0.442
0.0ArgXaa: 0.0 ± 0.0
Ser
5.155SerAla: 5.155 ± 0.531
0.215SerCys: 0.215 ± 0.09
3.938SerAsp: 3.938 ± 0.473
3.079SerGlu: 3.079 ± 0.48
2.434SerPhe: 2.434 ± 0.342
5.227SerGly: 5.227 ± 0.607
0.788SerHis: 0.788 ± 0.209
4.224SerIle: 4.224 ± 0.618
3.079SerLys: 3.079 ± 0.503
5.227SerLeu: 5.227 ± 0.548
1.79SerMet: 1.79 ± 0.323
3.866SerAsn: 3.866 ± 0.544
2.22SerPro: 2.22 ± 0.407
2.363SerGln: 2.363 ± 0.434
3.723SerArg: 3.723 ± 0.403
3.795SerSer: 3.795 ± 0.504
3.58SerThr: 3.58 ± 0.532
4.009SerVal: 4.009 ± 0.554
0.716SerTrp: 0.716 ± 0.199
1.862SerTyr: 1.862 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
3.866ThrAla: 3.866 ± 0.43
1.217ThrCys: 1.217 ± 0.328
4.296ThrAsp: 4.296 ± 0.503
3.58ThrGlu: 3.58 ± 0.551
2.22ThrPhe: 2.22 ± 0.411
5.585ThrGly: 5.585 ± 0.559
1.217ThrHis: 1.217 ± 0.327
4.869ThrIle: 4.869 ± 0.537
3.508ThrLys: 3.508 ± 0.522
5.513ThrLeu: 5.513 ± 0.669
1.217ThrMet: 1.217 ± 0.278
3.007ThrAsn: 3.007 ± 0.397
4.511ThrPro: 4.511 ± 0.665
3.079ThrGln: 3.079 ± 0.473
3.15ThrArg: 3.15 ± 0.48
3.651ThrSer: 3.651 ± 0.492
4.654ThrThr: 4.654 ± 0.64
4.797ThrVal: 4.797 ± 0.585
0.859ThrTrp: 0.859 ± 0.202
2.291ThrTyr: 2.291 ± 0.424
0.0ThrXaa: 0.0 ± 0.0
Val
5.656ValAla: 5.656 ± 0.719
1.002ValCys: 1.002 ± 0.212
5.227ValAsp: 5.227 ± 0.568
4.296ValGlu: 4.296 ± 0.7
2.792ValPhe: 2.792 ± 0.41
5.298ValGly: 5.298 ± 0.591
0.716ValHis: 0.716 ± 0.198
4.511ValIle: 4.511 ± 0.482
3.15ValLys: 3.15 ± 0.463
4.797ValLeu: 4.797 ± 0.584
1.289ValMet: 1.289 ± 0.365
3.437ValAsn: 3.437 ± 0.453
2.935ValPro: 2.935 ± 0.516
1.79ValGln: 1.79 ± 0.382
3.508ValArg: 3.508 ± 0.446
4.94ValSer: 4.94 ± 0.561
6.587ValThr: 6.587 ± 0.853
4.439ValVal: 4.439 ± 0.686
0.644ValTrp: 0.644 ± 0.192
2.578ValTyr: 2.578 ± 0.39
0.0ValXaa: 0.0 ± 0.0
Trp
0.716TrpAla: 0.716 ± 0.217
0.501TrpCys: 0.501 ± 0.168
0.931TrpAsp: 0.931 ± 0.208
0.716TrpGlu: 0.716 ± 0.243
0.716TrpPhe: 0.716 ± 0.165
1.146TrpGly: 1.146 ± 0.245
0.072TrpHis: 0.072 ± 0.072
0.859TrpIle: 0.859 ± 0.233
0.931TrpLys: 0.931 ± 0.301
1.647TrpLeu: 1.647 ± 0.324
0.286TrpMet: 0.286 ± 0.137
0.573TrpAsn: 0.573 ± 0.173
0.573TrpPro: 0.573 ± 0.186
0.286TrpGln: 0.286 ± 0.118
0.716TrpArg: 0.716 ± 0.219
1.074TrpSer: 1.074 ± 0.267
0.931TrpThr: 0.931 ± 0.25
0.931TrpVal: 0.931 ± 0.204
0.286TrpTrp: 0.286 ± 0.118
0.358TrpTyr: 0.358 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.22TyrAla: 2.22 ± 0.394
0.644TyrCys: 0.644 ± 0.265
2.649TyrAsp: 2.649 ± 0.498
2.434TyrGlu: 2.434 ± 0.331
1.647TyrPhe: 1.647 ± 0.325
2.792TyrGly: 2.792 ± 0.532
0.931TyrHis: 0.931 ± 0.258
3.15TyrIle: 3.15 ± 0.367
1.36TyrLys: 1.36 ± 0.262
2.148TyrLeu: 2.148 ± 0.315
0.573TyrMet: 0.573 ± 0.237
1.575TyrAsn: 1.575 ± 0.315
1.432TyrPro: 1.432 ± 0.339
1.432TyrGln: 1.432 ± 0.366
2.076TyrArg: 2.076 ± 0.339
1.647TyrSer: 1.647 ± 0.334
1.933TyrThr: 1.933 ± 0.358
2.506TyrVal: 2.506 ± 0.333
0.358TyrTrp: 0.358 ± 0.166
1.002TyrTyr: 1.002 ± 0.259
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13968 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski