Amino acid dipepetide frequency for Sulfitobacter phage pCB2047-C

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.676AlaAla: 16.676 ± 1.771
0.939AlaCys: 0.939 ± 0.317
7.751AlaAsp: 7.751 ± 0.8
8.377AlaGlu: 8.377 ± 0.931
3.836AlaPhe: 3.836 ± 0.493
10.804AlaGly: 10.804 ± 1.293
1.801AlaHis: 1.801 ± 0.402
6.185AlaIle: 6.185 ± 0.757
4.932AlaLys: 4.932 ± 0.602
10.569AlaLeu: 10.569 ± 1.028
4.463AlaMet: 4.463 ± 0.639
4.541AlaAsn: 4.541 ± 0.578
5.011AlaPro: 5.011 ± 0.747
5.324AlaGln: 5.324 ± 0.833
7.986AlaArg: 7.986 ± 0.997
6.655AlaSer: 6.655 ± 0.648
6.42AlaThr: 6.42 ± 0.858
6.811AlaVal: 6.811 ± 0.778
2.349AlaTrp: 2.349 ± 0.548
2.192AlaTyr: 2.192 ± 0.401
0.0AlaXaa: 0.0 ± 0.0
Cys
0.939CysAla: 0.939 ± 0.308
0.078CysCys: 0.078 ± 0.088
1.096CysAsp: 1.096 ± 0.343
0.548CysGlu: 0.548 ± 0.21
0.157CysPhe: 0.157 ± 0.117
0.705CysGly: 0.705 ± 0.244
0.078CysHis: 0.078 ± 0.088
0.313CysIle: 0.313 ± 0.163
0.548CysLys: 0.548 ± 0.203
0.861CysLeu: 0.861 ± 0.244
0.235CysMet: 0.235 ± 0.174
0.47CysAsn: 0.47 ± 0.237
0.391CysPro: 0.391 ± 0.22
0.313CysGln: 0.313 ± 0.175
0.783CysArg: 0.783 ± 0.235
0.939CysSer: 0.939 ± 0.298
0.235CysThr: 0.235 ± 0.135
0.47CysVal: 0.47 ± 0.202
0.078CysTrp: 0.078 ± 0.07
0.313CysTyr: 0.313 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
9.786AspAla: 9.786 ± 0.823
0.626AspCys: 0.626 ± 0.22
4.228AspAsp: 4.228 ± 0.76
4.776AspGlu: 4.776 ± 0.786
2.27AspPhe: 2.27 ± 0.382
5.872AspGly: 5.872 ± 0.682
1.096AspHis: 1.096 ± 0.285
2.74AspIle: 2.74 ± 0.402
1.957AspLys: 1.957 ± 0.346
5.559AspLeu: 5.559 ± 0.652
2.349AspMet: 2.349 ± 0.553
3.288AspAsn: 3.288 ± 0.565
3.366AspPro: 3.366 ± 0.455
2.192AspGln: 2.192 ± 0.436
4.306AspArg: 4.306 ± 0.524
2.584AspSer: 2.584 ± 0.382
3.445AspThr: 3.445 ± 0.484
4.071AspVal: 4.071 ± 0.592
1.488AspTrp: 1.488 ± 0.378
1.644AspTyr: 1.644 ± 0.374
0.0AspXaa: 0.0 ± 0.0
Glu
6.811GluAla: 6.811 ± 1.074
0.47GluCys: 0.47 ± 0.195
4.228GluAsp: 4.228 ± 0.689
2.662GluGlu: 2.662 ± 0.558
1.879GluPhe: 1.879 ± 0.384
5.245GluGly: 5.245 ± 0.636
1.096GluHis: 1.096 ± 0.326
2.192GluIle: 2.192 ± 0.373
2.74GluLys: 2.74 ± 0.51
4.149GluLeu: 4.149 ± 0.482
2.349GluMet: 2.349 ± 0.444
2.036GluAsn: 2.036 ± 0.335
1.957GluPro: 1.957 ± 0.371
2.584GluGln: 2.584 ± 0.513
3.68GluArg: 3.68 ± 0.561
2.349GluSer: 2.349 ± 0.397
4.071GluThr: 4.071 ± 0.561
4.149GluVal: 4.149 ± 0.713
0.939GluTrp: 0.939 ± 0.241
2.036GluTyr: 2.036 ± 0.336
0.0GluXaa: 0.0 ± 0.0
Phe
2.584PheAla: 2.584 ± 0.353
0.235PheCys: 0.235 ± 0.121
3.366PheAsp: 3.366 ± 0.493
1.644PheGlu: 1.644 ± 0.379
0.939PhePhe: 0.939 ± 0.254
4.071PheGly: 4.071 ± 0.534
0.861PheHis: 0.861 ± 0.281
0.626PheIle: 0.626 ± 0.182
1.488PheLys: 1.488 ± 0.337
1.957PheLeu: 1.957 ± 0.368
1.488PheMet: 1.488 ± 0.332
1.096PheAsn: 1.096 ± 0.264
1.018PhePro: 1.018 ± 0.274
0.783PheGln: 0.783 ± 0.24
2.036PheArg: 2.036 ± 0.393
1.722PheSer: 1.722 ± 0.442
1.488PheThr: 1.488 ± 0.385
2.192PheVal: 2.192 ± 0.384
0.705PheTrp: 0.705 ± 0.229
0.783PheTyr: 0.783 ± 0.25
0.0PheXaa: 0.0 ± 0.0
Gly
10.726GlyAla: 10.726 ± 1.792
0.783GlyCys: 0.783 ± 0.224
5.324GlyAsp: 5.324 ± 0.658
4.463GlyGlu: 4.463 ± 0.579
3.053GlyPhe: 3.053 ± 0.547
9.865GlyGly: 9.865 ± 1.464
1.488GlyHis: 1.488 ± 0.346
4.071GlyIle: 4.071 ± 0.515
4.228GlyLys: 4.228 ± 0.623
6.811GlyLeu: 6.811 ± 0.767
2.27GlyMet: 2.27 ± 0.33
3.132GlyAsn: 3.132 ± 0.409
2.897GlyPro: 2.897 ± 0.352
3.68GlyGln: 3.68 ± 0.76
4.306GlyArg: 4.306 ± 0.616
5.011GlySer: 5.011 ± 0.679
4.854GlyThr: 4.854 ± 0.804
6.185GlyVal: 6.185 ± 0.834
1.566GlyTrp: 1.566 ± 0.31
3.366GlyTyr: 3.366 ± 0.533
0.0GlyXaa: 0.0 ± 0.0
His
1.488HisAla: 1.488 ± 0.37
0.235HisCys: 0.235 ± 0.134
1.722HisAsp: 1.722 ± 0.401
1.174HisGlu: 1.174 ± 0.34
0.705HisPhe: 0.705 ± 0.273
2.036HisGly: 2.036 ± 0.452
0.47HisHis: 0.47 ± 0.226
1.644HisIle: 1.644 ± 0.403
0.626HisLys: 0.626 ± 0.253
1.174HisLeu: 1.174 ± 0.353
0.391HisMet: 0.391 ± 0.159
0.626HisAsn: 0.626 ± 0.34
1.096HisPro: 1.096 ± 0.274
0.705HisGln: 0.705 ± 0.271
1.096HisArg: 1.096 ± 0.346
1.018HisSer: 1.018 ± 0.373
1.174HisThr: 1.174 ± 0.283
1.566HisVal: 1.566 ± 0.372
0.548HisTrp: 0.548 ± 0.242
0.47HisTyr: 0.47 ± 0.171
0.0HisXaa: 0.0 ± 0.0
Ile
5.793IleAla: 5.793 ± 0.793
0.313IleCys: 0.313 ± 0.196
3.053IleAsp: 3.053 ± 0.457
3.366IleGlu: 3.366 ± 0.491
1.018IlePhe: 1.018 ± 0.299
4.149IleGly: 4.149 ± 0.601
1.174IleHis: 1.174 ± 0.28
2.036IleIle: 2.036 ± 0.398
2.505IleLys: 2.505 ± 0.489
2.349IleLeu: 2.349 ± 0.299
0.939IleMet: 0.939 ± 0.251
1.253IleAsn: 1.253 ± 0.293
1.488IlePro: 1.488 ± 0.286
1.722IleGln: 1.722 ± 0.286
2.192IleArg: 2.192 ± 0.473
3.523IleSer: 3.523 ± 0.482
2.584IleThr: 2.584 ± 0.482
2.818IleVal: 2.818 ± 0.631
1.096IleTrp: 1.096 ± 0.339
1.331IleTyr: 1.331 ± 0.281
0.0IleXaa: 0.0 ± 0.0
Lys
6.968LysAla: 6.968 ± 0.88
0.861LysCys: 0.861 ± 0.276
1.331LysAsp: 1.331 ± 0.29
2.114LysGlu: 2.114 ± 0.425
1.409LysPhe: 1.409 ± 0.292
3.445LysGly: 3.445 ± 0.655
1.096LysHis: 1.096 ± 0.32
2.27LysIle: 2.27 ± 0.418
2.114LysLys: 2.114 ± 0.432
3.68LysLeu: 3.68 ± 0.422
1.409LysMet: 1.409 ± 0.355
1.174LysAsn: 1.174 ± 0.293
2.036LysPro: 2.036 ± 0.359
1.722LysGln: 1.722 ± 0.381
3.053LysArg: 3.053 ± 0.446
2.74LysSer: 2.74 ± 0.439
2.505LysThr: 2.505 ± 0.404
2.584LysVal: 2.584 ± 0.463
0.861LysTrp: 0.861 ± 0.254
1.331LysTyr: 1.331 ± 0.287
0.0LysXaa: 0.0 ± 0.0
Leu
8.455LeuAla: 8.455 ± 0.781
1.174LeuCys: 1.174 ± 0.343
3.68LeuAsp: 3.68 ± 0.464
3.366LeuGlu: 3.366 ± 0.471
1.644LeuPhe: 1.644 ± 0.381
6.968LeuGly: 6.968 ± 0.835
1.644LeuHis: 1.644 ± 0.364
3.993LeuIle: 3.993 ± 0.6
2.505LeuLys: 2.505 ± 0.382
5.637LeuLeu: 5.637 ± 0.707
2.192LeuMet: 2.192 ± 0.388
3.053LeuAsn: 3.053 ± 0.593
3.366LeuPro: 3.366 ± 0.47
2.897LeuGln: 2.897 ± 0.675
6.028LeuArg: 6.028 ± 0.76
6.107LeuSer: 6.107 ± 0.683
4.306LeuThr: 4.306 ± 0.671
3.836LeuVal: 3.836 ± 0.61
0.626LeuTrp: 0.626 ± 0.212
2.27LeuTyr: 2.27 ± 0.389
0.0LeuXaa: 0.0 ± 0.0
Met
3.288MetAla: 3.288 ± 0.577
0.157MetCys: 0.157 ± 0.111
2.114MetAsp: 2.114 ± 0.331
1.253MetGlu: 1.253 ± 0.338
0.626MetPhe: 0.626 ± 0.222
3.053MetGly: 3.053 ± 0.487
0.313MetHis: 0.313 ± 0.175
0.861MetIle: 0.861 ± 0.269
0.939MetLys: 0.939 ± 0.304
1.331MetLeu: 1.331 ± 0.265
0.783MetMet: 0.783 ± 0.266
1.018MetAsn: 1.018 ± 0.257
1.722MetPro: 1.722 ± 0.318
2.27MetGln: 2.27 ± 0.471
1.879MetArg: 1.879 ± 0.452
1.879MetSer: 1.879 ± 0.396
2.349MetThr: 2.349 ± 0.378
1.644MetVal: 1.644 ± 0.42
0.391MetTrp: 0.391 ± 0.16
0.391MetTyr: 0.391 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
4.463AsnAla: 4.463 ± 0.677
0.47AsnCys: 0.47 ± 0.206
3.366AsnAsp: 3.366 ± 0.561
1.879AsnGlu: 1.879 ± 0.348
0.783AsnPhe: 0.783 ± 0.256
3.523AsnGly: 3.523 ± 0.514
0.861AsnHis: 0.861 ± 0.271
1.174AsnIle: 1.174 ± 0.297
1.409AsnLys: 1.409 ± 0.366
2.036AsnLeu: 2.036 ± 0.324
0.548AsnMet: 0.548 ± 0.216
1.096AsnAsn: 1.096 ± 0.264
2.505AsnPro: 2.505 ± 0.407
1.957AsnGln: 1.957 ± 0.44
2.427AsnArg: 2.427 ± 0.353
1.722AsnSer: 1.722 ± 0.377
1.957AsnThr: 1.957 ± 0.346
2.427AsnVal: 2.427 ± 0.375
1.018AsnTrp: 1.018 ± 0.253
0.861AsnTyr: 0.861 ± 0.232
0.0AsnXaa: 0.0 ± 0.0
Pro
5.48ProAla: 5.48 ± 0.753
0.313ProCys: 0.313 ± 0.158
4.697ProAsp: 4.697 ± 0.775
3.445ProGlu: 3.445 ± 0.543
1.331ProPhe: 1.331 ± 0.298
2.349ProGly: 2.349 ± 0.403
1.331ProHis: 1.331 ± 0.375
1.566ProIle: 1.566 ± 0.346
2.192ProLys: 2.192 ± 0.421
3.053ProLeu: 3.053 ± 0.439
0.939ProMet: 0.939 ± 0.246
1.879ProAsn: 1.879 ± 0.441
1.644ProPro: 1.644 ± 0.4
1.879ProGln: 1.879 ± 0.312
2.74ProArg: 2.74 ± 0.4
2.505ProSer: 2.505 ± 0.564
2.74ProThr: 2.74 ± 0.476
2.27ProVal: 2.27 ± 0.391
0.705ProTrp: 0.705 ± 0.248
1.018ProTyr: 1.018 ± 0.264
0.0ProXaa: 0.0 ± 0.0
Gln
5.245GlnAla: 5.245 ± 0.824
0.391GlnCys: 0.391 ± 0.192
1.801GlnAsp: 1.801 ± 0.495
2.505GlnGlu: 2.505 ± 0.521
1.488GlnPhe: 1.488 ± 0.352
3.523GlnGly: 3.523 ± 0.76
0.548GlnHis: 0.548 ± 0.229
1.722GlnIle: 1.722 ± 0.342
2.27GlnLys: 2.27 ± 0.439
2.662GlnLeu: 2.662 ± 0.582
1.409GlnMet: 1.409 ± 0.443
1.957GlnAsn: 1.957 ± 0.41
2.114GlnPro: 2.114 ± 0.451
2.192GlnGln: 2.192 ± 0.571
3.132GlnArg: 3.132 ± 0.437
2.74GlnSer: 2.74 ± 0.536
3.132GlnThr: 3.132 ± 0.612
1.957GlnVal: 1.957 ± 0.46
1.096GlnTrp: 1.096 ± 0.336
0.783GlnTyr: 0.783 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
8.22ArgAla: 8.22 ± 0.913
0.705ArgCys: 0.705 ± 0.226
5.245ArgAsp: 5.245 ± 0.564
3.601ArgGlu: 3.601 ± 0.54
1.644ArgPhe: 1.644 ± 0.328
4.306ArgGly: 4.306 ± 0.621
1.409ArgHis: 1.409 ± 0.412
2.662ArgIle: 2.662 ± 0.452
4.071ArgLys: 4.071 ± 0.758
5.559ArgLeu: 5.559 ± 0.597
1.331ArgMet: 1.331 ± 0.28
2.427ArgAsn: 2.427 ± 0.404
2.349ArgPro: 2.349 ± 0.41
2.27ArgGln: 2.27 ± 0.596
5.48ArgArg: 5.48 ± 0.905
3.915ArgSer: 3.915 ± 0.588
2.427ArgThr: 2.427 ± 0.406
4.697ArgVal: 4.697 ± 0.649
1.331ArgTrp: 1.331 ± 0.389
1.722ArgTyr: 1.722 ± 0.348
0.0ArgXaa: 0.0 ± 0.0
Ser
7.672SerAla: 7.672 ± 0.797
0.391SerCys: 0.391 ± 0.193
4.071SerAsp: 4.071 ± 0.543
2.975SerGlu: 2.975 ± 0.587
1.722SerPhe: 1.722 ± 0.42
5.872SerGly: 5.872 ± 0.822
1.331SerHis: 1.331 ± 0.358
3.053SerIle: 3.053 ± 0.45
2.897SerLys: 2.897 ± 0.487
4.619SerLeu: 4.619 ± 0.676
1.722SerMet: 1.722 ± 0.333
2.349SerAsn: 2.349 ± 0.495
1.566SerPro: 1.566 ± 0.366
2.74SerGln: 2.74 ± 0.578
3.915SerArg: 3.915 ± 0.477
2.897SerSer: 2.897 ± 0.604
3.601SerThr: 3.601 ± 0.601
3.445SerVal: 3.445 ± 0.639
0.783SerTrp: 0.783 ± 0.282
1.174SerTyr: 1.174 ± 0.328
0.0SerXaa: 0.0 ± 0.0
Thr
6.498ThrAla: 6.498 ± 0.787
0.313ThrCys: 0.313 ± 0.15
3.445ThrAsp: 3.445 ± 0.714
2.897ThrGlu: 2.897 ± 0.487
3.053ThrPhe: 3.053 ± 0.5
5.715ThrGly: 5.715 ± 0.847
1.331ThrHis: 1.331 ± 0.323
2.427ThrIle: 2.427 ± 0.505
2.192ThrLys: 2.192 ± 0.503
3.915ThrLeu: 3.915 ± 0.579
0.783ThrMet: 0.783 ± 0.268
1.957ThrAsn: 1.957 ± 0.52
5.48ThrPro: 5.48 ± 0.738
3.053ThrGln: 3.053 ± 0.553
2.897ThrArg: 2.897 ± 0.48
2.662ThrSer: 2.662 ± 0.582
3.366ThrThr: 3.366 ± 0.537
4.463ThrVal: 4.463 ± 0.735
0.705ThrTrp: 0.705 ± 0.222
0.861ThrTyr: 0.861 ± 0.277
0.0ThrXaa: 0.0 ± 0.0
Val
7.359ValAla: 7.359 ± 0.794
0.548ValCys: 0.548 ± 0.235
3.836ValAsp: 3.836 ± 0.548
4.228ValGlu: 4.228 ± 0.572
2.036ValPhe: 2.036 ± 0.428
3.523ValGly: 3.523 ± 0.658
0.861ValHis: 0.861 ± 0.248
3.288ValIle: 3.288 ± 0.599
3.288ValLys: 3.288 ± 0.546
4.776ValLeu: 4.776 ± 0.694
1.957ValMet: 1.957 ± 0.425
2.114ValAsn: 2.114 ± 0.325
2.427ValPro: 2.427 ± 0.439
2.74ValGln: 2.74 ± 0.487
3.993ValArg: 3.993 ± 0.642
4.932ValSer: 4.932 ± 0.798
5.011ValThr: 5.011 ± 0.626
3.601ValVal: 3.601 ± 0.591
0.626ValTrp: 0.626 ± 0.221
1.801ValTyr: 1.801 ± 0.426
0.0ValXaa: 0.0 ± 0.0
Trp
2.27TrpAla: 2.27 ± 0.455
0.391TrpCys: 0.391 ± 0.177
1.331TrpAsp: 1.331 ± 0.424
1.018TrpGlu: 1.018 ± 0.199
0.391TrpPhe: 0.391 ± 0.209
1.174TrpGly: 1.174 ± 0.329
0.47TrpHis: 0.47 ± 0.179
0.783TrpIle: 0.783 ± 0.241
0.783TrpLys: 0.783 ± 0.272
1.409TrpLeu: 1.409 ± 0.391
0.157TrpMet: 0.157 ± 0.113
0.47TrpAsn: 0.47 ± 0.234
0.391TrpPro: 0.391 ± 0.16
0.783TrpGln: 0.783 ± 0.225
1.331TrpArg: 1.331 ± 0.315
1.174TrpSer: 1.174 ± 0.366
0.939TrpThr: 0.939 ± 0.285
1.488TrpVal: 1.488 ± 0.385
0.235TrpTrp: 0.235 ± 0.122
0.391TrpTyr: 0.391 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.897TyrAla: 2.897 ± 0.429
0.157TyrCys: 0.157 ± 0.115
1.957TyrAsp: 1.957 ± 0.517
1.331TyrGlu: 1.331 ± 0.31
1.096TyrPhe: 1.096 ± 0.376
1.879TyrGly: 1.879 ± 0.371
0.705TyrHis: 0.705 ± 0.298
1.096TyrIle: 1.096 ± 0.244
1.018TyrLys: 1.018 ± 0.285
1.644TyrLeu: 1.644 ± 0.323
0.47TyrMet: 0.47 ± 0.187
0.548TyrAsn: 0.548 ± 0.206
1.253TyrPro: 1.253 ± 0.297
0.939TyrGln: 0.939 ± 0.269
2.114TyrArg: 2.114 ± 0.36
1.722TyrSer: 1.722 ± 0.298
1.409TyrThr: 1.409 ± 0.313
2.27TyrVal: 2.27 ± 0.418
0.235TyrTrp: 0.235 ± 0.119
0.548TyrTyr: 0.548 ± 0.234
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (12774 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski