Amino acid dipepetide frequency for Achromobacter phage vB_AxyP_19-32_Axy09

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.301AlaAla: 20.301 ± 1.792
1.158AlaCys: 1.158 ± 0.37
7.024AlaAsp: 7.024 ± 0.743
6.484AlaGlu: 6.484 ± 0.614
3.396AlaPhe: 3.396 ± 0.587
9.88AlaGly: 9.88 ± 1.286
2.161AlaHis: 2.161 ± 0.311
3.782AlaIle: 3.782 ± 0.559
6.407AlaLys: 6.407 ± 0.681
10.498AlaLeu: 10.498 ± 1.012
3.86AlaMet: 3.86 ± 0.603
5.017AlaAsn: 5.017 ± 0.527
5.944AlaPro: 5.944 ± 2.022
7.487AlaGln: 7.487 ± 1.065
7.024AlaArg: 7.024 ± 0.786
6.098AlaSer: 6.098 ± 1.07
6.484AlaThr: 6.484 ± 0.895
9.031AlaVal: 9.031 ± 1.167
1.081AlaTrp: 1.081 ± 0.383
4.014AlaTyr: 4.014 ± 0.701
0.0AlaXaa: 0.0 ± 0.0
Cys
0.463CysAla: 0.463 ± 0.185
0.077CysCys: 0.077 ± 0.084
0.386CysAsp: 0.386 ± 0.193
0.309CysGlu: 0.309 ± 0.16
0.154CysPhe: 0.154 ± 0.116
0.54CysGly: 0.54 ± 0.257
0.077CysHis: 0.077 ± 0.078
0.386CysIle: 0.386 ± 0.162
0.154CysLys: 0.154 ± 0.111
0.695CysLeu: 0.695 ± 0.253
0.077CysMet: 0.077 ± 0.072
0.232CysAsn: 0.232 ± 0.137
0.463CysPro: 0.463 ± 0.199
0.232CysGln: 0.232 ± 0.185
0.772CysArg: 0.772 ± 0.307
0.232CysSer: 0.232 ± 0.153
0.386CysThr: 0.386 ± 0.21
0.618CysVal: 0.618 ± 0.266
0.077CysTrp: 0.077 ± 0.083
0.463CysTyr: 0.463 ± 0.207
0.0CysXaa: 0.0 ± 0.0
Asp
7.796AspAla: 7.796 ± 0.789
0.618AspCys: 0.618 ± 0.23
2.933AspAsp: 2.933 ± 0.588
2.702AspGlu: 2.702 ± 0.415
1.312AspPhe: 1.312 ± 0.326
4.554AspGly: 4.554 ± 0.643
1.003AspHis: 1.003 ± 0.299
3.705AspIle: 3.705 ± 0.6
2.624AspLys: 2.624 ± 0.538
5.635AspLeu: 5.635 ± 0.72
1.698AspMet: 1.698 ± 0.375
1.775AspAsn: 1.775 ± 0.321
3.705AspPro: 3.705 ± 0.464
2.007AspGln: 2.007 ± 0.318
3.86AspArg: 3.86 ± 0.587
4.554AspSer: 4.554 ± 0.578
2.239AspThr: 2.239 ± 0.513
4.168AspVal: 4.168 ± 0.412
0.772AspTrp: 0.772 ± 0.193
2.007AspTyr: 2.007 ± 0.372
0.0AspXaa: 0.0 ± 0.0
Glu
7.796GluAla: 7.796 ± 0.74
0.309GluCys: 0.309 ± 0.168
2.779GluAsp: 2.779 ± 0.458
2.624GluGlu: 2.624 ± 0.567
2.624GluPhe: 2.624 ± 0.428
3.474GluGly: 3.474 ± 0.476
1.467GluHis: 1.467 ± 0.475
2.393GluIle: 2.393 ± 0.524
2.084GluLys: 2.084 ± 0.38
4.631GluLeu: 4.631 ± 0.582
2.007GluMet: 2.007 ± 0.296
1.775GluAsn: 1.775 ± 0.327
1.003GluPro: 1.003 ± 0.26
2.007GluGln: 2.007 ± 0.475
4.245GluArg: 4.245 ± 0.594
2.47GluSer: 2.47 ± 0.542
3.088GluThr: 3.088 ± 0.386
3.782GluVal: 3.782 ± 0.588
0.849GluTrp: 0.849 ± 0.257
2.161GluTyr: 2.161 ± 0.394
0.0GluXaa: 0.0 ± 0.0
Phe
3.782PheAla: 3.782 ± 0.496
0.309PheCys: 0.309 ± 0.134
2.47PheAsp: 2.47 ± 0.423
1.389PheGlu: 1.389 ± 0.319
0.772PhePhe: 0.772 ± 0.265
2.007PheGly: 2.007 ± 0.329
0.618PheHis: 0.618 ± 0.203
1.312PheIle: 1.312 ± 0.318
1.389PheLys: 1.389 ± 0.309
2.393PheLeu: 2.393 ± 0.46
0.695PheMet: 0.695 ± 0.147
1.853PheAsn: 1.853 ± 0.306
2.084PhePro: 2.084 ± 0.354
0.926PheGln: 0.926 ± 0.273
1.698PheArg: 1.698 ± 0.389
1.853PheSer: 1.853 ± 0.469
2.239PheThr: 2.239 ± 0.35
2.316PheVal: 2.316 ± 0.341
0.463PheTrp: 0.463 ± 0.154
0.849PheTyr: 0.849 ± 0.278
0.0PheXaa: 0.0 ± 0.0
Gly
7.487GlyAla: 7.487 ± 0.735
0.154GlyCys: 0.154 ± 0.102
4.709GlyAsp: 4.709 ± 0.469
3.165GlyGlu: 3.165 ± 0.51
1.93GlyPhe: 1.93 ± 0.49
7.102GlyGly: 7.102 ± 1.063
1.081GlyHis: 1.081 ± 0.275
4.4GlyIle: 4.4 ± 0.492
5.095GlyLys: 5.095 ± 0.868
6.021GlyLeu: 6.021 ± 0.691
2.933GlyMet: 2.933 ± 0.471
3.165GlyAsn: 3.165 ± 0.469
2.547GlyPro: 2.547 ± 0.539
4.323GlyGln: 4.323 ± 0.655
6.407GlyArg: 6.407 ± 0.694
3.705GlySer: 3.705 ± 0.418
5.017GlyThr: 5.017 ± 0.697
6.175GlyVal: 6.175 ± 0.604
1.003GlyTrp: 1.003 ± 0.282
2.007GlyTyr: 2.007 ± 0.36
0.0GlyXaa: 0.0 ± 0.0
His
2.702HisAla: 2.702 ± 0.42
0.154HisCys: 0.154 ± 0.125
1.158HisAsp: 1.158 ± 0.403
1.003HisGlu: 1.003 ± 0.245
0.309HisPhe: 0.309 ± 0.146
1.775HisGly: 1.775 ± 0.418
0.695HisHis: 0.695 ± 0.191
1.389HisIle: 1.389 ± 0.339
1.158HisLys: 1.158 ± 0.296
2.161HisLeu: 2.161 ± 0.383
0.54HisMet: 0.54 ± 0.198
0.54HisAsn: 0.54 ± 0.218
0.695HisPro: 0.695 ± 0.269
0.926HisGln: 0.926 ± 0.292
1.235HisArg: 1.235 ± 0.259
0.772HisSer: 0.772 ± 0.268
1.158HisThr: 1.158 ± 0.365
1.467HisVal: 1.467 ± 0.399
0.618HisTrp: 0.618 ± 0.25
0.772HisTyr: 0.772 ± 0.346
0.0HisXaa: 0.0 ± 0.0
Ile
4.631IleAla: 4.631 ± 0.69
0.386IleCys: 0.386 ± 0.169
3.628IleAsp: 3.628 ± 0.542
2.007IleGlu: 2.007 ± 0.389
0.463IlePhe: 0.463 ± 0.143
3.319IleGly: 3.319 ± 0.472
1.158IleHis: 1.158 ± 0.375
2.161IleIle: 2.161 ± 0.324
2.702IleLys: 2.702 ± 0.427
2.779IleLeu: 2.779 ± 0.485
0.926IleMet: 0.926 ± 0.348
1.698IleAsn: 1.698 ± 0.446
1.93IlePro: 1.93 ± 0.41
2.007IleGln: 2.007 ± 0.501
2.624IleArg: 2.624 ± 0.478
1.698IleSer: 1.698 ± 0.315
3.396IleThr: 3.396 ± 0.675
2.856IleVal: 2.856 ± 0.466
0.463IleTrp: 0.463 ± 0.185
0.849IleTyr: 0.849 ± 0.218
0.0IleXaa: 0.0 ± 0.0
Lys
6.33LysAla: 6.33 ± 0.782
0.154LysCys: 0.154 ± 0.117
2.779LysAsp: 2.779 ± 0.351
2.239LysGlu: 2.239 ± 0.393
1.467LysPhe: 1.467 ± 0.3
3.242LysGly: 3.242 ± 0.571
1.081LysHis: 1.081 ± 0.357
1.389LysIle: 1.389 ± 0.355
1.621LysLys: 1.621 ± 0.505
4.709LysLeu: 4.709 ± 0.643
0.926LysMet: 0.926 ± 0.279
1.467LysAsn: 1.467 ± 0.292
2.007LysPro: 2.007 ± 0.388
2.161LysGln: 2.161 ± 0.371
3.242LysArg: 3.242 ± 0.618
2.316LysSer: 2.316 ± 0.416
2.084LysThr: 2.084 ± 0.437
3.319LysVal: 3.319 ± 0.531
1.003LysTrp: 1.003 ± 0.236
1.621LysTyr: 1.621 ± 0.34
0.0LysXaa: 0.0 ± 0.0
Leu
11.424LeuAla: 11.424 ± 1.144
0.618LeuCys: 0.618 ± 0.255
6.098LeuAsp: 6.098 ± 0.618
4.863LeuGlu: 4.863 ± 0.53
2.624LeuPhe: 2.624 ± 0.351
5.712LeuGly: 5.712 ± 0.684
1.93LeuHis: 1.93 ± 0.385
4.245LeuIle: 4.245 ± 0.684
3.165LeuLys: 3.165 ± 0.62
6.561LeuLeu: 6.561 ± 0.781
2.161LeuMet: 2.161 ± 0.369
3.474LeuAsn: 3.474 ± 0.411
3.396LeuPro: 3.396 ± 0.526
3.628LeuGln: 3.628 ± 0.511
6.021LeuArg: 6.021 ± 0.684
4.323LeuSer: 4.323 ± 0.593
5.481LeuThr: 5.481 ± 0.556
4.709LeuVal: 4.709 ± 0.534
1.158LeuTrp: 1.158 ± 0.221
2.007LeuTyr: 2.007 ± 0.457
0.0LeuXaa: 0.0 ± 0.0
Met
3.474MetAla: 3.474 ± 0.575
0.154MetCys: 0.154 ± 0.091
1.312MetAsp: 1.312 ± 0.292
1.467MetGlu: 1.467 ± 0.444
1.081MetPhe: 1.081 ± 0.214
1.621MetGly: 1.621 ± 0.403
1.003MetHis: 1.003 ± 0.321
1.158MetIle: 1.158 ± 0.283
1.389MetLys: 1.389 ± 0.39
2.161MetLeu: 2.161 ± 0.411
0.54MetMet: 0.54 ± 0.154
0.926MetAsn: 0.926 ± 0.329
1.235MetPro: 1.235 ± 0.248
2.084MetGln: 2.084 ± 0.511
2.316MetArg: 2.316 ± 0.484
1.544MetSer: 1.544 ± 0.384
1.621MetThr: 1.621 ± 0.324
1.698MetVal: 1.698 ± 0.261
0.463MetTrp: 0.463 ± 0.178
1.081MetTyr: 1.081 ± 0.235
0.0MetXaa: 0.0 ± 0.0
Asn
3.551AsnAla: 3.551 ± 0.69
0.232AsnCys: 0.232 ± 0.159
2.161AsnAsp: 2.161 ± 0.307
2.393AsnGlu: 2.393 ± 0.391
1.235AsnPhe: 1.235 ± 0.283
3.165AsnGly: 3.165 ± 0.435
0.695AsnHis: 0.695 ± 0.247
1.93AsnIle: 1.93 ± 0.403
1.235AsnLys: 1.235 ± 0.302
3.088AsnLeu: 3.088 ± 0.423
1.003AsnMet: 1.003 ± 0.242
0.695AsnAsn: 0.695 ± 0.254
2.007AsnPro: 2.007 ± 0.362
1.621AsnGln: 1.621 ± 0.307
2.624AsnArg: 2.624 ± 0.513
1.544AsnSer: 1.544 ± 0.295
2.239AsnThr: 2.239 ± 0.386
2.856AsnVal: 2.856 ± 0.557
0.695AsnTrp: 0.695 ± 0.235
1.081AsnTyr: 1.081 ± 0.251
0.0AsnXaa: 0.0 ± 0.0
Pro
6.947ProAla: 6.947 ± 1.654
0.309ProCys: 0.309 ± 0.145
3.474ProAsp: 3.474 ± 0.4
3.628ProGlu: 3.628 ± 0.643
1.775ProPhe: 1.775 ± 0.222
4.091ProGly: 4.091 ± 0.608
0.54ProHis: 0.54 ± 0.208
1.389ProIle: 1.389 ± 0.343
2.007ProLys: 2.007 ± 0.458
3.242ProLeu: 3.242 ± 0.558
0.926ProMet: 0.926 ± 0.211
1.081ProAsn: 1.081 ± 0.332
2.393ProPro: 2.393 ± 0.551
2.007ProGln: 2.007 ± 0.397
2.084ProArg: 2.084 ± 0.427
2.084ProSer: 2.084 ± 0.298
3.01ProThr: 3.01 ± 0.474
4.323ProVal: 4.323 ± 0.957
0.232ProTrp: 0.232 ± 0.131
1.467ProTyr: 1.467 ± 0.446
0.0ProXaa: 0.0 ± 0.0
Gln
6.175GlnAla: 6.175 ± 0.553
0.309GlnCys: 0.309 ± 0.203
2.393GlnAsp: 2.393 ± 0.37
3.01GlnGlu: 3.01 ± 0.91
1.544GlnPhe: 1.544 ± 0.274
3.782GlnGly: 3.782 ± 0.393
0.926GlnHis: 0.926 ± 0.25
1.698GlnIle: 1.698 ± 0.274
1.621GlnLys: 1.621 ± 0.338
5.095GlnLeu: 5.095 ± 0.657
1.93GlnMet: 1.93 ± 0.27
1.621GlnAsn: 1.621 ± 0.366
2.393GlnPro: 2.393 ± 0.639
3.242GlnGln: 3.242 ± 0.662
4.168GlnArg: 4.168 ± 0.507
2.547GlnSer: 2.547 ± 0.476
2.316GlnThr: 2.316 ± 0.496
2.084GlnVal: 2.084 ± 0.364
0.772GlnTrp: 0.772 ± 0.218
1.312GlnTyr: 1.312 ± 0.363
0.0GlnXaa: 0.0 ± 0.0
Arg
8.028ArgAla: 8.028 ± 0.917
0.54ArgCys: 0.54 ± 0.238
3.782ArgAsp: 3.782 ± 0.487
3.628ArgGlu: 3.628 ± 0.592
1.698ArgPhe: 1.698 ± 0.337
4.477ArgGly: 4.477 ± 0.554
1.312ArgHis: 1.312 ± 0.364
2.933ArgIle: 2.933 ± 0.465
3.705ArgLys: 3.705 ± 0.488
5.789ArgLeu: 5.789 ± 0.71
2.393ArgMet: 2.393 ± 0.451
2.47ArgAsn: 2.47 ± 0.369
2.933ArgPro: 2.933 ± 0.401
2.856ArgGln: 2.856 ± 0.644
4.014ArgArg: 4.014 ± 0.635
2.856ArgSer: 2.856 ± 0.522
3.628ArgThr: 3.628 ± 0.585
4.786ArgVal: 4.786 ± 0.662
1.544ArgTrp: 1.544 ± 0.375
1.853ArgTyr: 1.853 ± 0.39
0.0ArgXaa: 0.0 ± 0.0
Ser
6.252SerAla: 6.252 ± 0.837
0.232SerCys: 0.232 ± 0.161
3.242SerAsp: 3.242 ± 0.398
2.702SerGlu: 2.702 ± 0.423
1.853SerPhe: 1.853 ± 0.329
5.481SerGly: 5.481 ± 0.587
1.312SerHis: 1.312 ± 0.33
1.312SerIle: 1.312 ± 0.291
2.239SerLys: 2.239 ± 0.407
3.782SerLeu: 3.782 ± 0.503
1.158SerMet: 1.158 ± 0.279
2.702SerAsn: 2.702 ± 0.426
2.239SerPro: 2.239 ± 0.4
2.702SerGln: 2.702 ± 0.31
2.779SerArg: 2.779 ± 0.387
2.856SerSer: 2.856 ± 0.533
3.242SerThr: 3.242 ± 0.527
3.782SerVal: 3.782 ± 0.54
1.003SerTrp: 1.003 ± 0.339
1.081SerTyr: 1.081 ± 0.271
0.0SerXaa: 0.0 ± 0.0
Thr
6.638ThrAla: 6.638 ± 0.716
0.232ThrCys: 0.232 ± 0.127
3.474ThrAsp: 3.474 ± 0.509
3.782ThrGlu: 3.782 ± 0.515
1.853ThrPhe: 1.853 ± 0.363
5.481ThrGly: 5.481 ± 0.904
0.695ThrHis: 0.695 ± 0.22
2.624ThrIle: 2.624 ± 0.656
2.084ThrLys: 2.084 ± 0.468
4.863ThrLeu: 4.863 ± 0.631
1.235ThrMet: 1.235 ± 0.306
2.316ThrAsn: 2.316 ± 0.563
3.937ThrPro: 3.937 ± 0.62
3.165ThrGln: 3.165 ± 0.457
2.393ThrArg: 2.393 ± 0.313
3.705ThrSer: 3.705 ± 0.66
3.86ThrThr: 3.86 ± 0.851
5.249ThrVal: 5.249 ± 0.848
0.695ThrTrp: 0.695 ± 0.159
1.775ThrTyr: 1.775 ± 0.395
0.0ThrXaa: 0.0 ± 0.0
Val
9.494ValAla: 9.494 ± 1.123
0.386ValCys: 0.386 ± 0.219
3.242ValAsp: 3.242 ± 0.467
3.474ValGlu: 3.474 ± 0.511
2.933ValPhe: 2.933 ± 0.461
5.017ValGly: 5.017 ± 0.795
2.161ValHis: 2.161 ± 0.515
2.393ValIle: 2.393 ± 0.693
2.702ValLys: 2.702 ± 0.4
5.172ValLeu: 5.172 ± 0.618
2.316ValMet: 2.316 ± 0.43
1.775ValAsn: 1.775 ± 0.391
4.477ValPro: 4.477 ± 0.709
3.319ValGln: 3.319 ± 0.408
4.631ValArg: 4.631 ± 0.537
3.705ValSer: 3.705 ± 0.497
5.249ValThr: 5.249 ± 0.863
4.786ValVal: 4.786 ± 0.629
0.772ValTrp: 0.772 ± 0.23
1.93ValTyr: 1.93 ± 0.347
0.0ValXaa: 0.0 ± 0.0
Trp
1.235TrpAla: 1.235 ± 0.326
0.232TrpCys: 0.232 ± 0.131
0.618TrpAsp: 0.618 ± 0.181
0.926TrpGlu: 0.926 ± 0.218
0.618TrpPhe: 0.618 ± 0.198
1.235TrpGly: 1.235 ± 0.328
0.386TrpHis: 0.386 ± 0.16
0.232TrpIle: 0.232 ± 0.111
0.386TrpLys: 0.386 ± 0.146
2.007TrpLeu: 2.007 ± 0.519
0.232TrpMet: 0.232 ± 0.197
0.54TrpAsn: 0.54 ± 0.186
0.386TrpPro: 0.386 ± 0.149
0.695TrpGln: 0.695 ± 0.203
0.849TrpArg: 0.849 ± 0.346
1.081TrpSer: 1.081 ± 0.264
1.235TrpThr: 1.235 ± 0.248
0.695TrpVal: 0.695 ± 0.315
0.309TrpTrp: 0.309 ± 0.123
0.54TrpTyr: 0.54 ± 0.211
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.702TyrAla: 2.702 ± 0.356
0.232TyrCys: 0.232 ± 0.153
2.007TyrAsp: 2.007 ± 0.485
1.775TyrGlu: 1.775 ± 0.438
1.775TyrPhe: 1.775 ± 0.413
2.47TyrGly: 2.47 ± 0.635
0.926TyrHis: 0.926 ± 0.232
0.849TyrIle: 0.849 ± 0.282
1.389TyrLys: 1.389 ± 0.337
2.316TyrLeu: 2.316 ± 0.469
0.695TyrMet: 0.695 ± 0.254
0.926TyrAsn: 0.926 ± 0.322
1.235TyrPro: 1.235 ± 0.298
1.544TyrGln: 1.544 ± 0.268
2.239TyrArg: 2.239 ± 0.367
2.007TyrSer: 2.007 ± 0.373
2.084TyrThr: 2.084 ± 0.371
1.235TyrVal: 1.235 ± 0.416
0.463TyrTrp: 0.463 ± 0.213
0.54TyrTyr: 0.54 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (12956 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski