Amino acid dipepetide frequency for Arthrobacter phage Constance

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.816AlaAla: 18.816 ± 1.708
1.153AlaCys: 1.153 ± 0.361
7.642AlaAsp: 7.642 ± 0.788
6.921AlaGlu: 6.921 ± 0.871
2.884AlaPhe: 2.884 ± 0.392
13.049AlaGly: 13.049 ± 1.726
2.307AlaHis: 2.307 ± 0.549
5.263AlaIle: 5.263 ± 0.658
6.2AlaLys: 6.2 ± 0.672
10.165AlaLeu: 10.165 ± 0.915
2.235AlaMet: 2.235 ± 0.572
4.542AlaAsn: 4.542 ± 0.831
6.272AlaPro: 6.272 ± 0.91
5.119AlaGln: 5.119 ± 0.665
7.281AlaArg: 7.281 ± 0.756
5.984AlaSer: 5.984 ± 0.75
7.714AlaThr: 7.714 ± 0.82
6.488AlaVal: 6.488 ± 0.8
1.874AlaTrp: 1.874 ± 0.392
2.019AlaTyr: 2.019 ± 0.409
0.0AlaXaa: 0.0 ± 0.0
Cys
0.865CysAla: 0.865 ± 0.384
0.144CysCys: 0.144 ± 0.092
0.505CysAsp: 0.505 ± 0.192
0.577CysGlu: 0.577 ± 0.214
0.072CysPhe: 0.072 ± 0.073
0.793CysGly: 0.793 ± 0.233
0.216CysHis: 0.216 ± 0.109
0.505CysIle: 0.505 ± 0.234
0.721CysLys: 0.721 ± 0.214
0.36CysLeu: 0.36 ± 0.166
0.0CysMet: 0.0 ± 0.0
0.216CysAsn: 0.216 ± 0.136
0.793CysPro: 0.793 ± 0.262
0.433CysGln: 0.433 ± 0.163
0.721CysArg: 0.721 ± 0.325
0.793CysSer: 0.793 ± 0.228
0.721CysThr: 0.721 ± 0.264
0.288CysVal: 0.288 ± 0.161
0.072CysTrp: 0.072 ± 0.069
0.144CysTyr: 0.144 ± 0.098
0.0CysXaa: 0.0 ± 0.0
Asp
7.209AspAla: 7.209 ± 0.754
0.505AspCys: 0.505 ± 0.218
4.253AspAsp: 4.253 ± 0.711
3.172AspGlu: 3.172 ± 0.503
2.523AspPhe: 2.523 ± 0.435
6.344AspGly: 6.344 ± 0.904
1.153AspHis: 1.153 ± 0.298
2.956AspIle: 2.956 ± 0.386
2.595AspLys: 2.595 ± 0.381
3.965AspLeu: 3.965 ± 0.423
1.586AspMet: 1.586 ± 0.309
1.514AspAsn: 1.514 ± 0.364
3.965AspPro: 3.965 ± 0.552
1.947AspGln: 1.947 ± 0.405
4.109AspArg: 4.109 ± 0.615
3.172AspSer: 3.172 ± 0.684
3.1AspThr: 3.1 ± 0.468
3.605AspVal: 3.605 ± 0.493
1.153AspTrp: 1.153 ± 0.252
1.298AspTyr: 1.298 ± 0.256
0.0AspXaa: 0.0 ± 0.0
Glu
6.777GluAla: 6.777 ± 0.829
0.433GluCys: 0.433 ± 0.183
3.244GluAsp: 3.244 ± 0.513
4.037GluGlu: 4.037 ± 0.615
1.514GluPhe: 1.514 ± 0.329
4.758GluGly: 4.758 ± 0.566
1.009GluHis: 1.009 ± 0.247
2.667GluIle: 2.667 ± 0.381
2.091GluLys: 2.091 ± 0.388
5.84GluLeu: 5.84 ± 0.612
1.658GluMet: 1.658 ± 0.282
1.226GluAsn: 1.226 ± 0.327
2.595GluPro: 2.595 ± 0.452
3.1GluGln: 3.1 ± 0.357
4.037GluArg: 4.037 ± 0.557
2.667GluSer: 2.667 ± 0.483
3.172GluThr: 3.172 ± 0.384
3.533GluVal: 3.533 ± 0.472
1.73GluTrp: 1.73 ± 0.427
1.37GluTyr: 1.37 ± 0.349
0.0GluXaa: 0.0 ± 0.0
Phe
3.244PheAla: 3.244 ± 0.478
0.649PheCys: 0.649 ± 0.253
2.307PheAsp: 2.307 ± 0.356
1.226PheGlu: 1.226 ± 0.264
0.721PhePhe: 0.721 ± 0.242
2.019PheGly: 2.019 ± 0.352
0.649PheHis: 0.649 ± 0.215
1.442PheIle: 1.442 ± 0.305
1.658PheLys: 1.658 ± 0.351
1.658PheLeu: 1.658 ± 0.352
0.577PheMet: 0.577 ± 0.209
0.937PheAsn: 0.937 ± 0.262
1.226PhePro: 1.226 ± 0.298
0.649PheGln: 0.649 ± 0.148
1.514PheArg: 1.514 ± 0.368
0.721PheSer: 0.721 ± 0.234
2.307PheThr: 2.307 ± 0.384
1.298PheVal: 1.298 ± 0.272
0.288PheTrp: 0.288 ± 0.154
0.793PheTyr: 0.793 ± 0.226
0.0PheXaa: 0.0 ± 0.0
Gly
9.372GlyAla: 9.372 ± 1.264
0.505GlyCys: 0.505 ± 0.201
4.758GlyAsp: 4.758 ± 0.556
4.47GlyGlu: 4.47 ± 0.478
2.451GlyPhe: 2.451 ± 0.4
7.353GlyGly: 7.353 ± 1.071
1.658GlyHis: 1.658 ± 0.317
3.677GlyIle: 3.677 ± 0.615
4.181GlyLys: 4.181 ± 0.579
7.353GlyLeu: 7.353 ± 0.812
2.307GlyMet: 2.307 ± 0.39
3.316GlyAsn: 3.316 ± 0.626
3.677GlyPro: 3.677 ± 0.575
3.244GlyGln: 3.244 ± 0.444
6.128GlyArg: 6.128 ± 0.803
5.046GlySer: 5.046 ± 0.81
7.209GlyThr: 7.209 ± 0.784
5.623GlyVal: 5.623 ± 0.66
2.307GlyTrp: 2.307 ± 0.487
2.091GlyTyr: 2.091 ± 0.447
0.0GlyXaa: 0.0 ± 0.0
His
1.73HisAla: 1.73 ± 0.432
0.0HisCys: 0.0 ± 0.0
1.442HisAsp: 1.442 ± 0.354
1.153HisGlu: 1.153 ± 0.275
0.433HisPhe: 0.433 ± 0.177
1.442HisGly: 1.442 ± 0.405
0.505HisHis: 0.505 ± 0.208
0.433HisIle: 0.433 ± 0.218
0.793HisLys: 0.793 ± 0.248
1.586HisLeu: 1.586 ± 0.375
0.577HisMet: 0.577 ± 0.217
0.288HisAsn: 0.288 ± 0.138
1.298HisPro: 1.298 ± 0.306
0.937HisGln: 0.937 ± 0.279
1.226HisArg: 1.226 ± 0.364
1.226HisSer: 1.226 ± 0.333
1.226HisThr: 1.226 ± 0.414
1.226HisVal: 1.226 ± 0.239
0.433HisTrp: 0.433 ± 0.207
0.649HisTyr: 0.649 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
6.056IleAla: 6.056 ± 0.601
0.505IleCys: 0.505 ± 0.178
3.533IleAsp: 3.533 ± 0.541
3.244IleGlu: 3.244 ± 0.419
0.937IlePhe: 0.937 ± 0.217
2.451IleGly: 2.451 ± 0.351
0.865IleHis: 0.865 ± 0.279
1.73IleIle: 1.73 ± 0.349
1.874IleLys: 1.874 ± 0.28
1.802IleLeu: 1.802 ± 0.304
0.793IleMet: 0.793 ± 0.184
1.442IleAsn: 1.442 ± 0.312
3.388IlePro: 3.388 ± 0.532
1.442IleGln: 1.442 ± 0.307
3.1IleArg: 3.1 ± 0.454
3.028IleSer: 3.028 ± 0.53
3.965IleThr: 3.965 ± 0.547
2.812IleVal: 2.812 ± 0.585
0.505IleTrp: 0.505 ± 0.2
1.226IleTyr: 1.226 ± 0.288
0.0IleXaa: 0.0 ± 0.0
Lys
7.281LysAla: 7.281 ± 0.991
0.36LysCys: 0.36 ± 0.15
2.523LysAsp: 2.523 ± 0.441
2.091LysGlu: 2.091 ± 0.362
0.937LysPhe: 0.937 ± 0.28
3.172LysGly: 3.172 ± 0.456
0.865LysHis: 0.865 ± 0.269
1.586LysIle: 1.586 ± 0.323
2.451LysLys: 2.451 ± 0.386
4.037LysLeu: 4.037 ± 0.467
1.009LysMet: 1.009 ± 0.242
1.947LysAsn: 1.947 ± 0.437
3.028LysPro: 3.028 ± 0.512
2.091LysGln: 2.091 ± 0.523
2.667LysArg: 2.667 ± 0.374
2.379LysSer: 2.379 ± 0.515
3.172LysThr: 3.172 ± 0.383
3.172LysVal: 3.172 ± 0.536
0.505LysTrp: 0.505 ± 0.182
1.514LysTyr: 1.514 ± 0.36
0.0LysXaa: 0.0 ± 0.0
Leu
9.733LeuAla: 9.733 ± 0.84
0.793LeuCys: 0.793 ± 0.265
5.551LeuAsp: 5.551 ± 0.719
4.181LeuGlu: 4.181 ± 0.57
1.73LeuPhe: 1.73 ± 0.344
6.344LeuGly: 6.344 ± 0.662
1.586LeuHis: 1.586 ± 0.409
3.749LeuIle: 3.749 ± 0.451
4.326LeuLys: 4.326 ± 0.699
5.551LeuLeu: 5.551 ± 0.601
1.586LeuMet: 1.586 ± 0.278
2.091LeuAsn: 2.091 ± 0.292
4.109LeuPro: 4.109 ± 0.535
3.172LeuGln: 3.172 ± 0.643
4.902LeuArg: 4.902 ± 0.798
4.614LeuSer: 4.614 ± 0.615
5.623LeuThr: 5.623 ± 0.618
4.902LeuVal: 4.902 ± 0.662
0.793LeuTrp: 0.793 ± 0.276
1.802LeuTyr: 1.802 ± 0.291
0.0LeuXaa: 0.0 ± 0.0
Met
2.956MetAla: 2.956 ± 0.399
0.072MetCys: 0.072 ± 0.065
1.37MetAsp: 1.37 ± 0.328
0.865MetGlu: 0.865 ± 0.244
0.433MetPhe: 0.433 ± 0.143
1.586MetGly: 1.586 ± 0.362
0.072MetHis: 0.072 ± 0.073
0.577MetIle: 0.577 ± 0.177
0.865MetLys: 0.865 ± 0.241
1.874MetLeu: 1.874 ± 0.362
0.144MetMet: 0.144 ± 0.099
0.288MetAsn: 0.288 ± 0.145
1.081MetPro: 1.081 ± 0.32
0.865MetGln: 0.865 ± 0.219
1.37MetArg: 1.37 ± 0.351
2.379MetSer: 2.379 ± 0.322
2.667MetThr: 2.667 ± 0.391
0.865MetVal: 0.865 ± 0.256
0.144MetTrp: 0.144 ± 0.09
0.505MetTyr: 0.505 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
3.677AsnAla: 3.677 ± 0.507
0.072AsnCys: 0.072 ± 0.073
1.153AsnAsp: 1.153 ± 0.254
1.226AsnGlu: 1.226 ± 0.311
0.577AsnPhe: 0.577 ± 0.187
3.533AsnGly: 3.533 ± 0.55
0.865AsnHis: 0.865 ± 0.211
2.091AsnIle: 2.091 ± 0.382
1.081AsnLys: 1.081 ± 0.294
2.091AsnLeu: 2.091 ± 0.415
0.505AsnMet: 0.505 ± 0.211
0.721AsnAsn: 0.721 ± 0.29
2.235AsnPro: 2.235 ± 0.586
1.874AsnGln: 1.874 ± 0.358
1.658AsnArg: 1.658 ± 0.354
1.802AsnSer: 1.802 ± 0.46
1.802AsnThr: 1.802 ± 0.374
2.235AsnVal: 2.235 ± 0.357
0.577AsnTrp: 0.577 ± 0.266
1.009AsnTyr: 1.009 ± 0.228
0.0AsnXaa: 0.0 ± 0.0
Pro
7.065ProAla: 7.065 ± 0.858
0.288ProCys: 0.288 ± 0.164
2.667ProAsp: 2.667 ± 0.534
4.47ProGlu: 4.47 ± 0.608
1.947ProPhe: 1.947 ± 0.413
4.758ProGly: 4.758 ± 0.592
0.937ProHis: 0.937 ± 0.284
1.73ProIle: 1.73 ± 0.322
1.874ProLys: 1.874 ± 0.393
4.109ProLeu: 4.109 ± 0.62
0.937ProMet: 0.937 ± 0.26
1.658ProAsn: 1.658 ± 0.421
3.388ProPro: 3.388 ± 0.846
2.523ProGln: 2.523 ± 0.623
3.46ProArg: 3.46 ± 0.487
3.749ProSer: 3.749 ± 0.591
3.677ProThr: 3.677 ± 0.644
4.974ProVal: 4.974 ± 0.649
0.721ProTrp: 0.721 ± 0.26
1.514ProTyr: 1.514 ± 0.293
0.0ProXaa: 0.0 ± 0.0
Gln
4.686GlnAla: 4.686 ± 0.799
0.288GlnCys: 0.288 ± 0.166
2.523GlnAsp: 2.523 ± 0.5
1.947GlnGlu: 1.947 ± 0.292
1.442GlnPhe: 1.442 ± 0.289
3.172GlnGly: 3.172 ± 0.51
0.36GlnHis: 0.36 ± 0.156
1.73GlnIle: 1.73 ± 0.319
1.874GlnLys: 1.874 ± 0.401
3.244GlnLeu: 3.244 ± 0.522
1.226GlnMet: 1.226 ± 0.283
1.009GlnAsn: 1.009 ± 0.239
2.74GlnPro: 2.74 ± 0.628
3.605GlnGln: 3.605 ± 1.339
2.812GlnArg: 2.812 ± 0.418
2.595GlnSer: 2.595 ± 0.484
2.667GlnThr: 2.667 ± 0.517
2.595GlnVal: 2.595 ± 0.46
0.36GlnTrp: 0.36 ± 0.196
1.226GlnTyr: 1.226 ± 0.349
0.0GlnXaa: 0.0 ± 0.0
Arg
6.993ArgAla: 6.993 ± 0.598
0.721ArgCys: 0.721 ± 0.221
3.1ArgAsp: 3.1 ± 0.388
3.965ArgGlu: 3.965 ± 0.616
1.081ArgPhe: 1.081 ± 0.289
5.263ArgGly: 5.263 ± 0.653
1.37ArgHis: 1.37 ± 0.482
3.172ArgIle: 3.172 ± 0.431
3.316ArgLys: 3.316 ± 0.6
5.551ArgLeu: 5.551 ± 0.624
1.73ArgMet: 1.73 ± 0.279
2.163ArgAsn: 2.163 ± 0.34
3.172ArgPro: 3.172 ± 0.626
1.73ArgGln: 1.73 ± 0.309
4.902ArgArg: 4.902 ± 0.668
3.533ArgSer: 3.533 ± 0.38
4.326ArgThr: 4.326 ± 0.476
5.263ArgVal: 5.263 ± 0.778
2.091ArgTrp: 2.091 ± 0.478
1.298ArgTyr: 1.298 ± 0.359
0.0ArgXaa: 0.0 ± 0.0
Ser
6.056SerAla: 6.056 ± 0.775
0.505SerCys: 0.505 ± 0.164
3.388SerAsp: 3.388 ± 0.513
3.46SerGlu: 3.46 ± 0.499
1.514SerPhe: 1.514 ± 0.333
5.407SerGly: 5.407 ± 0.576
0.793SerHis: 0.793 ± 0.234
3.028SerIle: 3.028 ± 0.626
3.172SerLys: 3.172 ± 0.431
3.821SerLeu: 3.821 ± 0.57
1.153SerMet: 1.153 ± 0.318
1.153SerAsn: 1.153 ± 0.316
2.667SerPro: 2.667 ± 0.397
2.595SerGln: 2.595 ± 0.391
3.1SerArg: 3.1 ± 0.506
1.802SerSer: 1.802 ± 0.395
3.749SerThr: 3.749 ± 0.461
4.47SerVal: 4.47 ± 0.559
1.081SerTrp: 1.081 ± 0.296
1.658SerTyr: 1.658 ± 0.278
0.0SerXaa: 0.0 ± 0.0
Thr
8.435ThrAla: 8.435 ± 0.833
0.577ThrCys: 0.577 ± 0.235
4.109ThrAsp: 4.109 ± 0.387
3.244ThrGlu: 3.244 ± 0.612
2.307ThrPhe: 2.307 ± 0.398
7.209ThrGly: 7.209 ± 0.702
1.586ThrHis: 1.586 ± 0.371
3.172ThrIle: 3.172 ± 0.546
2.235ThrLys: 2.235 ± 0.52
6.344ThrLeu: 6.344 ± 0.692
0.793ThrMet: 0.793 ± 0.214
2.091ThrAsn: 2.091 ± 0.43
5.046ThrPro: 5.046 ± 0.895
1.947ThrGln: 1.947 ± 0.367
4.253ThrArg: 4.253 ± 0.582
3.893ThrSer: 3.893 ± 0.698
6.056ThrThr: 6.056 ± 0.725
5.551ThrVal: 5.551 ± 0.724
0.937ThrTrp: 0.937 ± 0.259
2.451ThrTyr: 2.451 ± 0.406
0.0ThrXaa: 0.0 ± 0.0
Val
8.435ValAla: 8.435 ± 0.91
0.505ValCys: 0.505 ± 0.173
3.677ValAsp: 3.677 ± 0.455
4.542ValGlu: 4.542 ± 0.642
1.298ValPhe: 1.298 ± 0.251
5.119ValGly: 5.119 ± 0.698
1.226ValHis: 1.226 ± 0.268
3.172ValIle: 3.172 ± 0.573
3.821ValLys: 3.821 ± 0.476
4.398ValLeu: 4.398 ± 0.638
1.298ValMet: 1.298 ± 0.26
2.379ValAsn: 2.379 ± 0.435
3.749ValPro: 3.749 ± 0.475
2.235ValGln: 2.235 ± 0.584
4.326ValArg: 4.326 ± 0.54
3.533ValSer: 3.533 ± 0.434
5.623ValThr: 5.623 ± 0.737
4.253ValVal: 4.253 ± 0.66
0.937ValTrp: 0.937 ± 0.242
1.442ValTyr: 1.442 ± 0.306
0.0ValXaa: 0.0 ± 0.0
Trp
2.091TrpAla: 2.091 ± 0.462
0.505TrpCys: 0.505 ± 0.17
0.937TrpAsp: 0.937 ± 0.28
1.081TrpGlu: 1.081 ± 0.318
0.505TrpPhe: 0.505 ± 0.163
1.298TrpGly: 1.298 ± 0.323
0.36TrpHis: 0.36 ± 0.147
1.009TrpIle: 1.009 ± 0.317
0.577TrpLys: 0.577 ± 0.221
1.081TrpLeu: 1.081 ± 0.236
0.36TrpMet: 0.36 ± 0.136
0.721TrpAsn: 0.721 ± 0.221
0.577TrpPro: 0.577 ± 0.239
1.081TrpGln: 1.081 ± 0.331
1.298TrpArg: 1.298 ± 0.327
0.865TrpSer: 0.865 ± 0.232
0.865TrpThr: 0.865 ± 0.264
1.442TrpVal: 1.442 ± 0.375
0.505TrpTrp: 0.505 ± 0.245
0.072TrpTyr: 0.072 ± 0.063
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.523TyrAla: 2.523 ± 0.393
0.36TyrCys: 0.36 ± 0.19
1.442TyrAsp: 1.442 ± 0.299
1.586TyrGlu: 1.586 ± 0.258
0.577TyrPhe: 0.577 ± 0.195
1.658TyrGly: 1.658 ± 0.274
0.216TyrHis: 0.216 ± 0.132
1.009TyrIle: 1.009 ± 0.264
1.081TyrLys: 1.081 ± 0.295
2.307TyrLeu: 2.307 ± 0.487
0.433TyrMet: 0.433 ± 0.153
1.081TyrAsn: 1.081 ± 0.277
1.442TyrPro: 1.442 ± 0.323
1.514TyrGln: 1.514 ± 0.43
1.874TyrArg: 1.874 ± 0.344
0.793TyrSer: 0.793 ± 0.207
2.595TyrThr: 2.595 ± 0.403
1.37TyrVal: 1.37 ± 0.28
0.216TyrTrp: 0.216 ± 0.12
0.577TyrTyr: 0.577 ± 0.2
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13872 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski