Amino acid dipepetide frequency for Arthrobacter phage Adolin

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.868AlaAla: 16.868 ± 1.59
0.304AlaCys: 0.304 ± 0.142
7.522AlaAsp: 7.522 ± 0.867
9.194AlaGlu: 9.194 ± 0.786
3.419AlaPhe: 3.419 ± 0.473
8.51AlaGly: 8.51 ± 0.96
1.824AlaHis: 1.824 ± 0.37
5.319AlaIle: 5.319 ± 0.734
5.927AlaLys: 5.927 ± 0.775
11.777AlaLeu: 11.777 ± 1.234
3.343AlaMet: 3.343 ± 0.512
3.115AlaAsn: 3.115 ± 0.436
6.99AlaPro: 6.99 ± 1.259
4.255AlaGln: 4.255 ± 0.633
9.27AlaArg: 9.27 ± 1.028
6.079AlaSer: 6.079 ± 0.644
6.079AlaThr: 6.079 ± 0.766
8.586AlaVal: 8.586 ± 0.954
2.355AlaTrp: 2.355 ± 0.332
3.343AlaTyr: 3.343 ± 0.549
0.0AlaXaa: 0.0 ± 0.0
Cys
0.608CysAla: 0.608 ± 0.209
0.152CysCys: 0.152 ± 0.113
0.532CysAsp: 0.532 ± 0.202
0.608CysGlu: 0.608 ± 0.218
0.304CysPhe: 0.304 ± 0.141
0.608CysGly: 0.608 ± 0.183
0.228CysHis: 0.228 ± 0.129
0.152CysIle: 0.152 ± 0.117
0.228CysLys: 0.228 ± 0.169
0.228CysLeu: 0.228 ± 0.118
0.0CysMet: 0.0 ± 0.0
0.456CysAsn: 0.456 ± 0.196
0.684CysPro: 0.684 ± 0.218
0.228CysGln: 0.228 ± 0.129
0.456CysArg: 0.456 ± 0.18
0.228CysSer: 0.228 ± 0.129
0.228CysThr: 0.228 ± 0.147
0.38CysVal: 0.38 ± 0.189
0.076CysTrp: 0.076 ± 0.072
0.152CysTyr: 0.152 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
8.282AspAla: 8.282 ± 0.763
0.38AspCys: 0.38 ± 0.156
4.635AspAsp: 4.635 ± 0.537
4.787AspGlu: 4.787 ± 0.533
1.672AspPhe: 1.672 ± 0.314
6.99AspGly: 6.99 ± 0.855
1.824AspHis: 1.824 ± 0.376
1.368AspIle: 1.368 ± 0.309
1.672AspLys: 1.672 ± 0.401
7.142AspLeu: 7.142 ± 1.171
0.532AspMet: 0.532 ± 0.179
1.672AspAsn: 1.672 ± 0.337
4.103AspPro: 4.103 ± 0.516
1.292AspGln: 1.292 ± 0.301
3.951AspArg: 3.951 ± 0.597
3.647AspSer: 3.647 ± 0.413
3.419AspThr: 3.419 ± 0.467
4.483AspVal: 4.483 ± 0.575
0.988AspTrp: 0.988 ± 0.227
1.976AspTyr: 1.976 ± 0.377
0.0AspXaa: 0.0 ± 0.0
Glu
9.574GluAla: 9.574 ± 1.027
0.608GluCys: 0.608 ± 0.218
3.343GluAsp: 3.343 ± 0.546
4.407GluGlu: 4.407 ± 0.638
1.748GluPhe: 1.748 ± 0.423
4.863GluGly: 4.863 ± 0.505
1.672GluHis: 1.672 ± 0.395
3.571GluIle: 3.571 ± 0.435
2.355GluLys: 2.355 ± 0.42
5.319GluLeu: 5.319 ± 0.796
1.14GluMet: 1.14 ± 0.246
1.824GluAsn: 1.824 ± 0.335
3.343GluPro: 3.343 ± 0.641
0.988GluGln: 0.988 ± 0.264
5.091GluArg: 5.091 ± 0.731
2.735GluSer: 2.735 ± 0.399
4.483GluThr: 4.483 ± 0.636
4.787GluVal: 4.787 ± 0.635
1.596GluTrp: 1.596 ± 0.343
1.292GluTyr: 1.292 ± 0.283
0.0GluXaa: 0.0 ± 0.0
Phe
3.191PheAla: 3.191 ± 0.453
0.456PheCys: 0.456 ± 0.189
2.811PheAsp: 2.811 ± 0.44
2.355PheGlu: 2.355 ± 0.339
0.836PhePhe: 0.836 ± 0.233
2.887PheGly: 2.887 ± 0.443
0.456PheHis: 0.456 ± 0.173
1.368PheIle: 1.368 ± 0.296
1.216PheLys: 1.216 ± 0.307
2.355PheLeu: 2.355 ± 0.482
0.836PheMet: 0.836 ± 0.202
0.684PheAsn: 0.684 ± 0.223
1.444PhePro: 1.444 ± 0.319
1.064PheGln: 1.064 ± 0.356
1.672PheArg: 1.672 ± 0.334
1.824PheSer: 1.824 ± 0.355
2.279PheThr: 2.279 ± 0.4
1.824PheVal: 1.824 ± 0.416
0.304PheTrp: 0.304 ± 0.147
0.912PheTyr: 0.912 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
7.978GlyAla: 7.978 ± 0.865
0.456GlyCys: 0.456 ± 0.175
5.623GlyAsp: 5.623 ± 0.547
5.319GlyGlu: 5.319 ± 0.543
3.267GlyPhe: 3.267 ± 0.504
6.003GlyGly: 6.003 ± 0.67
1.444GlyHis: 1.444 ± 0.256
3.647GlyIle: 3.647 ± 0.664
4.711GlyLys: 4.711 ± 0.634
8.358GlyLeu: 8.358 ± 0.729
1.748GlyMet: 1.748 ± 0.368
3.039GlyAsn: 3.039 ± 0.59
4.255GlyPro: 4.255 ± 0.621
1.824GlyGln: 1.824 ± 0.389
6.003GlyArg: 6.003 ± 0.714
4.483GlySer: 4.483 ± 0.585
5.395GlyThr: 5.395 ± 1.002
5.851GlyVal: 5.851 ± 0.731
1.976GlyTrp: 1.976 ± 0.454
3.647GlyTyr: 3.647 ± 0.524
0.0GlyXaa: 0.0 ± 0.0
His
2.052HisAla: 2.052 ± 0.38
0.076HisCys: 0.076 ± 0.067
1.824HisAsp: 1.824 ± 0.453
1.368HisGlu: 1.368 ± 0.35
0.912HisPhe: 0.912 ± 0.245
1.748HisGly: 1.748 ± 0.341
0.456HisHis: 0.456 ± 0.177
0.76HisIle: 0.76 ± 0.229
0.608HisLys: 0.608 ± 0.245
1.292HisLeu: 1.292 ± 0.318
0.228HisMet: 0.228 ± 0.119
0.304HisAsn: 0.304 ± 0.15
1.444HisPro: 1.444 ± 0.301
0.456HisGln: 0.456 ± 0.163
1.52HisArg: 1.52 ± 0.297
0.608HisSer: 0.608 ± 0.177
1.064HisThr: 1.064 ± 0.271
1.14HisVal: 1.14 ± 0.257
0.38HisTrp: 0.38 ± 0.187
0.228HisTyr: 0.228 ± 0.121
0.0HisXaa: 0.0 ± 0.0
Ile
3.875IleAla: 3.875 ± 0.44
0.304IleCys: 0.304 ± 0.155
2.355IleAsp: 2.355 ± 0.497
2.735IleGlu: 2.735 ± 0.421
1.596IlePhe: 1.596 ± 0.357
3.571IleGly: 3.571 ± 0.732
1.14IleHis: 1.14 ± 0.243
1.064IleIle: 1.064 ± 0.295
1.368IleLys: 1.368 ± 0.364
3.343IleLeu: 3.343 ± 0.545
0.532IleMet: 0.532 ± 0.187
1.216IleAsn: 1.216 ± 0.255
2.279IlePro: 2.279 ± 0.425
2.127IleGln: 2.127 ± 0.512
4.179IleArg: 4.179 ± 0.445
2.052IleSer: 2.052 ± 0.504
2.963IleThr: 2.963 ± 0.45
2.887IleVal: 2.887 ± 0.521
0.608IleTrp: 0.608 ± 0.224
0.988IleTyr: 0.988 ± 0.286
0.0IleXaa: 0.0 ± 0.0
Lys
5.851LysAla: 5.851 ± 0.764
0.304LysCys: 0.304 ± 0.214
2.052LysAsp: 2.052 ± 0.432
2.431LysGlu: 2.431 ± 0.435
1.064LysPhe: 1.064 ± 0.26
4.027LysGly: 4.027 ± 0.527
1.14LysHis: 1.14 ± 0.337
1.976LysIle: 1.976 ± 0.322
1.976LysLys: 1.976 ± 0.62
3.419LysLeu: 3.419 ± 0.561
0.912LysMet: 0.912 ± 0.281
1.216LysAsn: 1.216 ± 0.282
2.583LysPro: 2.583 ± 0.392
0.988LysGln: 0.988 ± 0.323
3.267LysArg: 3.267 ± 0.534
1.976LysSer: 1.976 ± 0.384
2.507LysThr: 2.507 ± 0.409
3.267LysVal: 3.267 ± 0.471
0.912LysTrp: 0.912 ± 0.23
0.532LysTyr: 0.532 ± 0.184
0.0LysXaa: 0.0 ± 0.0
Leu
13.221LeuAla: 13.221 ± 1.072
0.532LeuCys: 0.532 ± 0.2
6.99LeuAsp: 6.99 ± 0.661
3.723LeuGlu: 3.723 ± 0.572
2.127LeuPhe: 2.127 ± 0.362
7.75LeuGly: 7.75 ± 0.847
1.52LeuHis: 1.52 ± 0.417
3.799LeuIle: 3.799 ± 0.523
2.431LeuLys: 2.431 ± 0.439
6.838LeuLeu: 6.838 ± 0.638
1.216LeuMet: 1.216 ± 0.273
2.431LeuAsn: 2.431 ± 0.367
4.787LeuPro: 4.787 ± 0.572
2.963LeuGln: 2.963 ± 0.495
7.294LeuArg: 7.294 ± 0.691
5.091LeuSer: 5.091 ± 0.637
6.382LeuThr: 6.382 ± 0.667
5.091LeuVal: 5.091 ± 0.529
1.368LeuTrp: 1.368 ± 0.26
2.052LeuTyr: 2.052 ± 0.413
0.0LeuXaa: 0.0 ± 0.0
Met
3.115MetAla: 3.115 ± 0.58
0.152MetCys: 0.152 ± 0.107
1.064MetAsp: 1.064 ± 0.287
0.836MetGlu: 0.836 ± 0.216
0.304MetPhe: 0.304 ± 0.153
1.216MetGly: 1.216 ± 0.438
0.228MetHis: 0.228 ± 0.12
1.444MetIle: 1.444 ± 0.308
1.064MetLys: 1.064 ± 0.28
1.368MetLeu: 1.368 ± 0.352
0.228MetMet: 0.228 ± 0.117
0.76MetAsn: 0.76 ± 0.236
1.064MetPro: 1.064 ± 0.316
0.532MetGln: 0.532 ± 0.16
1.216MetArg: 1.216 ± 0.259
1.444MetSer: 1.444 ± 0.279
2.127MetThr: 2.127 ± 0.46
1.064MetVal: 1.064 ± 0.222
0.0MetTrp: 0.0 ± 0.0
0.076MetTyr: 0.076 ± 0.087
0.0MetXaa: 0.0 ± 0.0
Asn
3.495AsnAla: 3.495 ± 0.504
0.0AsnCys: 0.0 ± 0.0
1.976AsnAsp: 1.976 ± 0.273
0.532AsnGlu: 0.532 ± 0.199
0.456AsnPhe: 0.456 ± 0.154
3.571AsnGly: 3.571 ± 0.463
0.38AsnHis: 0.38 ± 0.189
0.988AsnIle: 0.988 ± 0.314
0.76AsnLys: 0.76 ± 0.2
2.659AsnLeu: 2.659 ± 0.386
1.14AsnMet: 1.14 ± 0.278
0.76AsnAsn: 0.76 ± 0.245
2.507AsnPro: 2.507 ± 0.48
0.836AsnGln: 0.836 ± 0.309
1.14AsnArg: 1.14 ± 0.267
1.216AsnSer: 1.216 ± 0.275
2.203AsnThr: 2.203 ± 0.552
1.976AsnVal: 1.976 ± 0.597
0.38AsnTrp: 0.38 ± 0.159
0.684AsnTyr: 0.684 ± 0.204
0.0AsnXaa: 0.0 ± 0.0
Pro
7.826ProAla: 7.826 ± 1.227
0.456ProCys: 0.456 ± 0.163
3.723ProAsp: 3.723 ± 0.472
4.711ProGlu: 4.711 ± 0.72
1.976ProPhe: 1.976 ± 0.381
5.547ProGly: 5.547 ± 0.696
0.912ProHis: 0.912 ± 0.259
2.583ProIle: 2.583 ± 0.522
2.811ProLys: 2.811 ± 0.478
3.723ProLeu: 3.723 ± 0.587
1.14ProMet: 1.14 ± 0.26
0.988ProAsn: 0.988 ± 0.229
1.976ProPro: 1.976 ± 0.613
1.064ProGln: 1.064 ± 0.274
2.963ProArg: 2.963 ± 0.493
2.735ProSer: 2.735 ± 0.427
2.963ProThr: 2.963 ± 0.558
4.407ProVal: 4.407 ± 0.547
0.836ProTrp: 0.836 ± 0.307
1.216ProTyr: 1.216 ± 0.322
0.0ProXaa: 0.0 ± 0.0
Gln
2.887GlnAla: 2.887 ± 0.482
0.608GlnCys: 0.608 ± 0.216
1.064GlnAsp: 1.064 ± 0.321
1.596GlnGlu: 1.596 ± 0.322
0.988GlnPhe: 0.988 ± 0.306
1.596GlnGly: 1.596 ± 0.398
0.684GlnHis: 0.684 ± 0.206
1.52GlnIle: 1.52 ± 0.32
1.596GlnLys: 1.596 ± 0.346
1.824GlnLeu: 1.824 ± 0.343
1.14GlnMet: 1.14 ± 0.493
0.836GlnAsn: 0.836 ± 0.287
0.836GlnPro: 0.836 ± 0.222
0.228GlnGln: 0.228 ± 0.12
2.127GlnArg: 2.127 ± 0.402
1.824GlnSer: 1.824 ± 0.339
2.659GlnThr: 2.659 ± 0.51
2.052GlnVal: 2.052 ± 0.401
0.532GlnTrp: 0.532 ± 0.201
0.38GlnTyr: 0.38 ± 0.163
0.0GlnXaa: 0.0 ± 0.0
Arg
7.37ArgAla: 7.37 ± 0.777
0.304ArgCys: 0.304 ± 0.128
5.547ArgAsp: 5.547 ± 0.596
5.623ArgGlu: 5.623 ± 0.624
2.279ArgPhe: 2.279 ± 0.37
5.015ArgGly: 5.015 ± 0.674
0.988ArgHis: 0.988 ± 0.254
2.659ArgIle: 2.659 ± 0.511
3.723ArgLys: 3.723 ± 0.52
7.446ArgLeu: 7.446 ± 0.917
1.292ArgMet: 1.292 ± 0.319
1.976ArgAsn: 1.976 ± 0.371
3.115ArgPro: 3.115 ± 0.45
2.583ArgGln: 2.583 ± 0.388
6.307ArgArg: 6.307 ± 1.164
3.875ArgSer: 3.875 ± 0.748
3.647ArgThr: 3.647 ± 0.56
5.699ArgVal: 5.699 ± 0.563
1.14ArgTrp: 1.14 ± 0.268
2.735ArgTyr: 2.735 ± 0.613
0.0ArgXaa: 0.0 ± 0.0
Ser
6.534SerAla: 6.534 ± 0.788
0.228SerCys: 0.228 ± 0.13
2.887SerAsp: 2.887 ± 0.431
2.507SerGlu: 2.507 ± 0.369
1.9SerPhe: 1.9 ± 0.377
6.079SerGly: 6.079 ± 0.783
0.532SerHis: 0.532 ± 0.171
2.507SerIle: 2.507 ± 0.572
2.583SerLys: 2.583 ± 0.538
4.027SerLeu: 4.027 ± 0.589
0.988SerMet: 0.988 ± 0.273
1.444SerAsn: 1.444 ± 0.299
3.495SerPro: 3.495 ± 0.667
1.292SerGln: 1.292 ± 0.319
3.039SerArg: 3.039 ± 0.45
2.811SerSer: 2.811 ± 0.536
4.103SerThr: 4.103 ± 0.484
4.635SerVal: 4.635 ± 0.711
0.912SerTrp: 0.912 ± 0.245
1.444SerTyr: 1.444 ± 0.314
0.0SerXaa: 0.0 ± 0.0
Thr
7.446ThrAla: 7.446 ± 0.732
0.456ThrCys: 0.456 ± 0.248
3.799ThrAsp: 3.799 ± 0.363
4.939ThrGlu: 4.939 ± 0.736
2.507ThrPhe: 2.507 ± 0.368
6.003ThrGly: 6.003 ± 0.722
0.76ThrHis: 0.76 ± 0.215
1.748ThrIle: 1.748 ± 0.352
2.735ThrLys: 2.735 ± 0.36
6.382ThrLeu: 6.382 ± 0.635
0.684ThrMet: 0.684 ± 0.246
1.292ThrAsn: 1.292 ± 0.263
4.179ThrPro: 4.179 ± 0.699
1.14ThrGln: 1.14 ± 0.308
5.091ThrArg: 5.091 ± 0.783
4.255ThrSer: 4.255 ± 0.631
4.787ThrThr: 4.787 ± 0.83
4.483ThrVal: 4.483 ± 0.584
0.684ThrTrp: 0.684 ± 0.293
1.292ThrTyr: 1.292 ± 0.287
0.0ThrXaa: 0.0 ± 0.0
Val
9.194ValAla: 9.194 ± 0.864
0.608ValCys: 0.608 ± 0.246
4.027ValAsp: 4.027 ± 0.484
4.179ValGlu: 4.179 ± 0.682
2.279ValPhe: 2.279 ± 0.39
4.787ValGly: 4.787 ± 0.619
1.368ValHis: 1.368 ± 0.363
2.659ValIle: 2.659 ± 0.493
2.887ValLys: 2.887 ± 0.42
6.686ValLeu: 6.686 ± 0.574
1.216ValMet: 1.216 ± 0.284
2.279ValAsn: 2.279 ± 0.497
3.951ValPro: 3.951 ± 0.501
2.052ValGln: 2.052 ± 0.413
5.395ValArg: 5.395 ± 0.745
4.483ValSer: 4.483 ± 0.605
3.647ValThr: 3.647 ± 0.533
5.091ValVal: 5.091 ± 0.773
1.292ValTrp: 1.292 ± 0.322
2.507ValTyr: 2.507 ± 0.476
0.0ValXaa: 0.0 ± 0.0
Trp
1.976TrpAla: 1.976 ± 0.327
0.0TrpCys: 0.0 ± 0.0
1.216TrpAsp: 1.216 ± 0.314
1.368TrpGlu: 1.368 ± 0.337
0.304TrpPhe: 0.304 ± 0.175
1.368TrpGly: 1.368 ± 0.324
0.304TrpHis: 0.304 ± 0.139
0.836TrpIle: 0.836 ± 0.243
0.608TrpLys: 0.608 ± 0.209
1.14TrpLeu: 1.14 ± 0.265
0.38TrpMet: 0.38 ± 0.147
0.684TrpAsn: 0.684 ± 0.214
0.076TrpPro: 0.076 ± 0.072
0.532TrpGln: 0.532 ± 0.189
1.596TrpArg: 1.596 ± 0.291
1.52TrpSer: 1.52 ± 0.444
1.596TrpThr: 1.596 ± 0.316
0.988TrpVal: 0.988 ± 0.281
0.152TrpTrp: 0.152 ± 0.106
0.38TrpTyr: 0.38 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.735TyrAla: 2.735 ± 0.533
0.152TyrCys: 0.152 ± 0.104
1.824TyrAsp: 1.824 ± 0.413
1.52TyrGlu: 1.52 ± 0.366
0.836TyrPhe: 0.836 ± 0.235
2.887TyrGly: 2.887 ± 0.554
0.608TyrHis: 0.608 ± 0.177
1.216TyrIle: 1.216 ± 0.343
1.14TyrLys: 1.14 ± 0.274
2.279TyrLeu: 2.279 ± 0.401
0.532TyrMet: 0.532 ± 0.193
0.684TyrAsn: 0.684 ± 0.338
1.444TyrPro: 1.444 ± 0.322
0.532TyrGln: 0.532 ± 0.169
1.748TyrArg: 1.748 ± 0.343
1.14TyrSer: 1.14 ± 0.314
2.127TyrThr: 2.127 ± 0.409
1.9TyrVal: 1.9 ± 0.431
0.532TyrTrp: 0.532 ± 0.197
0.532TyrTyr: 0.532 ± 0.245
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (13162 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski