Amino acid dipepetide frequency for Tetrasphaera phage TJE1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.135AlaAla: 9.135 ± 0.97
1.133AlaCys: 1.133 ± 0.333
4.674AlaAsp: 4.674 ± 0.687
5.736AlaGlu: 5.736 ± 1.283
3.328AlaPhe: 3.328 ± 0.486
6.444AlaGly: 6.444 ± 0.795
1.345AlaHis: 1.345 ± 0.296
4.815AlaIle: 4.815 ± 0.634
5.382AlaLys: 5.382 ± 0.977
8.922AlaLeu: 8.922 ± 0.856
2.337AlaMet: 2.337 ± 0.419
3.753AlaAsn: 3.753 ± 0.557
3.753AlaPro: 3.753 ± 0.6
3.895AlaGln: 3.895 ± 0.553
4.461AlaArg: 4.461 ± 0.642
6.585AlaSer: 6.585 ± 0.826
5.098AlaThr: 5.098 ± 0.56
5.736AlaVal: 5.736 ± 0.587
0.921AlaTrp: 0.921 ± 0.24
3.187AlaTyr: 3.187 ± 0.518
0.0AlaXaa: 0.0 ± 0.0
Cys
0.779CysAla: 0.779 ± 0.295
0.071CysCys: 0.071 ± 0.06
0.496CysAsp: 0.496 ± 0.169
0.566CysGlu: 0.566 ± 0.201
0.566CysPhe: 0.566 ± 0.181
0.921CysGly: 0.921 ± 0.257
0.212CysHis: 0.212 ± 0.123
0.496CysIle: 0.496 ± 0.187
0.354CysLys: 0.354 ± 0.154
0.779CysLeu: 0.779 ± 0.202
0.212CysMet: 0.212 ± 0.118
0.212CysAsn: 0.212 ± 0.121
0.283CysPro: 0.283 ± 0.143
0.212CysGln: 0.212 ± 0.113
0.566CysArg: 0.566 ± 0.171
0.354CysSer: 0.354 ± 0.171
0.496CysThr: 0.496 ± 0.166
0.425CysVal: 0.425 ± 0.199
0.212CysTrp: 0.212 ± 0.12
0.212CysTyr: 0.212 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
6.161AspAla: 6.161 ± 0.707
0.921AspCys: 0.921 ± 0.236
1.983AspAsp: 1.983 ± 0.4
3.611AspGlu: 3.611 ± 0.628
2.762AspPhe: 2.762 ± 0.386
4.603AspGly: 4.603 ± 0.581
1.204AspHis: 1.204 ± 0.333
2.337AspIle: 2.337 ± 0.438
2.832AspLys: 2.832 ± 0.489
5.736AspLeu: 5.736 ± 0.674
1.062AspMet: 1.062 ± 0.317
1.629AspAsn: 1.629 ± 0.294
3.682AspPro: 3.682 ± 0.56
2.408AspGln: 2.408 ± 0.409
2.408AspArg: 2.408 ± 0.353
3.116AspSer: 3.116 ± 0.462
2.974AspThr: 2.974 ± 0.434
3.257AspVal: 3.257 ± 0.4
1.345AspTrp: 1.345 ± 0.279
1.699AspTyr: 1.699 ± 0.417
0.0AspXaa: 0.0 ± 0.0
Glu
7.364GluAla: 7.364 ± 1.444
0.283GluCys: 0.283 ± 0.142
3.682GluAsp: 3.682 ± 0.55
6.798GluGlu: 6.798 ± 1.268
3.257GluPhe: 3.257 ± 0.529
3.187GluGly: 3.187 ± 0.475
0.85GluHis: 0.85 ± 0.209
3.045GluIle: 3.045 ± 0.39
3.257GluLys: 3.257 ± 0.61
5.169GluLeu: 5.169 ± 0.626
1.77GluMet: 1.77 ± 0.353
2.054GluAsn: 2.054 ± 0.384
2.762GluPro: 2.762 ± 0.775
1.841GluGln: 1.841 ± 0.28
4.744GluArg: 4.744 ± 0.913
3.399GluSer: 3.399 ± 0.573
3.116GluThr: 3.116 ± 0.443
4.461GluVal: 4.461 ± 0.646
1.345GluTrp: 1.345 ± 0.351
1.487GluTyr: 1.487 ± 0.359
0.0GluXaa: 0.0 ± 0.0
Phe
3.47PheAla: 3.47 ± 0.601
0.566PheCys: 0.566 ± 0.167
3.47PheAsp: 3.47 ± 0.504
2.62PheGlu: 2.62 ± 0.5
1.629PhePhe: 1.629 ± 0.367
3.328PheGly: 3.328 ± 0.534
0.85PheHis: 0.85 ± 0.198
2.054PheIle: 2.054 ± 0.393
2.549PheLys: 2.549 ± 0.425
3.682PheLeu: 3.682 ± 0.544
0.779PheMet: 0.779 ± 0.219
1.487PheAsn: 1.487 ± 0.36
1.983PhePro: 1.983 ± 0.399
1.345PheGln: 1.345 ± 0.36
2.054PheArg: 2.054 ± 0.413
2.832PheSer: 2.832 ± 0.517
2.903PheThr: 2.903 ± 0.445
2.266PheVal: 2.266 ± 0.362
0.708PheTrp: 0.708 ± 0.209
1.345PheTyr: 1.345 ± 0.324
0.0PheXaa: 0.0 ± 0.0
Gly
6.019GlyAla: 6.019 ± 0.668
0.496GlyCys: 0.496 ± 0.192
4.178GlyAsp: 4.178 ± 0.508
3.965GlyGlu: 3.965 ± 0.547
3.47GlyPhe: 3.47 ± 0.424
6.161GlyGly: 6.161 ± 0.686
1.416GlyHis: 1.416 ± 0.402
5.098GlyIle: 5.098 ± 0.643
4.744GlyLys: 4.744 ± 0.557
6.869GlyLeu: 6.869 ± 0.573
2.62GlyMet: 2.62 ± 0.42
3.682GlyAsn: 3.682 ± 0.57
3.682GlyPro: 3.682 ± 0.551
3.257GlyGln: 3.257 ± 0.459
4.32GlyArg: 4.32 ± 0.74
4.036GlySer: 4.036 ± 0.623
5.169GlyThr: 5.169 ± 0.681
6.585GlyVal: 6.585 ± 0.736
1.487GlyTrp: 1.487 ± 0.354
3.257GlyTyr: 3.257 ± 0.555
0.0GlyXaa: 0.0 ± 0.0
His
0.85HisAla: 0.85 ± 0.216
0.142HisCys: 0.142 ± 0.091
0.708HisAsp: 0.708 ± 0.207
1.275HisGlu: 1.275 ± 0.316
0.85HisPhe: 0.85 ± 0.208
1.983HisGly: 1.983 ± 0.421
0.779HisHis: 0.779 ± 0.259
0.566HisIle: 0.566 ± 0.196
0.85HisLys: 0.85 ± 0.275
1.629HisLeu: 1.629 ± 0.309
0.425HisMet: 0.425 ± 0.186
0.637HisAsn: 0.637 ± 0.216
1.204HisPro: 1.204 ± 0.303
0.425HisGln: 0.425 ± 0.171
1.275HisArg: 1.275 ± 0.34
0.566HisSer: 0.566 ± 0.202
0.921HisThr: 0.921 ± 0.253
0.991HisVal: 0.991 ± 0.295
0.354HisTrp: 0.354 ± 0.145
0.779HisTyr: 0.779 ± 0.214
0.0HisXaa: 0.0 ± 0.0
Ile
4.036IleAla: 4.036 ± 0.502
0.85IleCys: 0.85 ± 0.233
2.903IleAsp: 2.903 ± 0.472
3.116IleGlu: 3.116 ± 0.414
1.416IlePhe: 1.416 ± 0.301
4.603IleGly: 4.603 ± 0.565
1.133IleHis: 1.133 ± 0.263
2.266IleIle: 2.266 ± 0.478
1.912IleLys: 1.912 ± 0.344
6.019IleLeu: 6.019 ± 0.557
0.708IleMet: 0.708 ± 0.23
1.558IleAsn: 1.558 ± 0.261
2.691IlePro: 2.691 ± 0.443
2.762IleGln: 2.762 ± 0.343
3.045IleArg: 3.045 ± 0.42
3.824IleSer: 3.824 ± 0.509
2.974IleThr: 2.974 ± 0.522
2.903IleVal: 2.903 ± 0.486
0.991IleTrp: 0.991 ± 0.238
1.629IleTyr: 1.629 ± 0.396
0.0IleXaa: 0.0 ± 0.0
Lys
6.444LysAla: 6.444 ± 0.805
0.142LysCys: 0.142 ± 0.105
3.257LysAsp: 3.257 ± 0.481
4.744LysGlu: 4.744 ± 0.794
1.77LysPhe: 1.77 ± 0.389
4.107LysGly: 4.107 ± 0.445
0.637LysHis: 0.637 ± 0.197
2.549LysIle: 2.549 ± 0.442
4.32LysLys: 4.32 ± 0.823
3.045LysLeu: 3.045 ± 0.571
2.195LysMet: 2.195 ± 0.41
2.195LysAsn: 2.195 ± 0.402
2.903LysPro: 2.903 ± 0.476
2.691LysGln: 2.691 ± 0.575
3.257LysArg: 3.257 ± 0.64
2.762LysSer: 2.762 ± 0.319
3.611LysThr: 3.611 ± 0.457
3.47LysVal: 3.47 ± 0.498
0.85LysTrp: 0.85 ± 0.23
0.991LysTyr: 0.991 ± 0.246
0.0LysXaa: 0.0 ± 0.0
Leu
8.781LeuAla: 8.781 ± 0.863
0.637LeuCys: 0.637 ± 0.296
4.674LeuAsp: 4.674 ± 0.506
5.382LeuGlu: 5.382 ± 0.625
2.832LeuPhe: 2.832 ± 0.416
5.594LeuGly: 5.594 ± 0.706
1.062LeuHis: 1.062 ± 0.26
4.674LeuIle: 4.674 ± 0.669
5.24LeuLys: 5.24 ± 0.661
6.019LeuLeu: 6.019 ± 0.752
1.699LeuMet: 1.699 ± 0.426
3.541LeuAsn: 3.541 ± 0.509
4.036LeuPro: 4.036 ± 0.605
3.895LeuGln: 3.895 ± 0.687
3.965LeuArg: 3.965 ± 0.593
5.807LeuSer: 5.807 ± 0.607
4.886LeuThr: 4.886 ± 0.492
6.727LeuVal: 6.727 ± 0.752
1.487LeuTrp: 1.487 ± 0.256
2.762LeuTyr: 2.762 ± 0.46
0.0LeuXaa: 0.0 ± 0.0
Met
2.408MetAla: 2.408 ± 0.389
0.142MetCys: 0.142 ± 0.093
1.416MetAsp: 1.416 ± 0.268
1.062MetGlu: 1.062 ± 0.272
0.708MetPhe: 0.708 ± 0.25
1.77MetGly: 1.77 ± 0.376
0.212MetHis: 0.212 ± 0.124
1.77MetIle: 1.77 ± 0.395
1.133MetLys: 1.133 ± 0.269
1.841MetLeu: 1.841 ± 0.303
0.566MetMet: 0.566 ± 0.195
1.062MetAsn: 1.062 ± 0.384
1.345MetPro: 1.345 ± 0.301
0.708MetGln: 0.708 ± 0.206
1.487MetArg: 1.487 ± 0.268
2.054MetSer: 2.054 ± 0.338
2.408MetThr: 2.408 ± 0.408
2.054MetVal: 2.054 ± 0.387
0.142MetTrp: 0.142 ± 0.087
0.637MetTyr: 0.637 ± 0.223
0.0MetXaa: 0.0 ± 0.0
Asn
3.965AsnAla: 3.965 ± 0.608
0.425AsnCys: 0.425 ± 0.167
1.912AsnAsp: 1.912 ± 0.333
2.691AsnGlu: 2.691 ± 0.468
1.345AsnPhe: 1.345 ± 0.326
3.47AsnGly: 3.47 ± 0.477
0.425AsnHis: 0.425 ± 0.155
1.629AsnIle: 1.629 ± 0.315
1.133AsnLys: 1.133 ± 0.304
3.611AsnLeu: 3.611 ± 0.614
0.779AsnMet: 0.779 ± 0.285
1.133AsnAsn: 1.133 ± 0.239
2.762AsnPro: 2.762 ± 0.516
1.841AsnGln: 1.841 ± 0.356
2.054AsnArg: 2.054 ± 0.408
2.266AsnSer: 2.266 ± 0.446
1.629AsnThr: 1.629 ± 0.396
2.549AsnVal: 2.549 ± 0.451
0.85AsnTrp: 0.85 ± 0.278
1.345AsnTyr: 1.345 ± 0.31
0.0AsnXaa: 0.0 ± 0.0
Pro
4.107ProAla: 4.107 ± 0.528
0.566ProCys: 0.566 ± 0.202
3.541ProAsp: 3.541 ± 0.523
4.461ProGlu: 4.461 ± 0.868
1.77ProPhe: 1.77 ± 0.309
5.452ProGly: 5.452 ± 0.542
0.637ProHis: 0.637 ± 0.21
2.195ProIle: 2.195 ± 0.415
2.478ProLys: 2.478 ± 0.463
3.257ProLeu: 3.257 ± 0.541
1.345ProMet: 1.345 ± 0.271
1.629ProAsn: 1.629 ± 0.319
2.974ProPro: 2.974 ± 0.614
2.124ProGln: 2.124 ± 0.376
2.266ProArg: 2.266 ± 0.495
3.965ProSer: 3.965 ± 0.676
3.541ProThr: 3.541 ± 0.493
3.045ProVal: 3.045 ± 0.45
1.133ProTrp: 1.133 ± 0.304
1.062ProTyr: 1.062 ± 0.226
0.0ProXaa: 0.0 ± 0.0
Gln
4.603GlnAla: 4.603 ± 0.715
0.071GlnCys: 0.071 ± 0.067
2.549GlnAsp: 2.549 ± 0.379
2.762GlnGlu: 2.762 ± 0.401
1.699GlnPhe: 1.699 ± 0.447
4.178GlnGly: 4.178 ± 0.453
0.991GlnHis: 0.991 ± 0.207
1.699GlnIle: 1.699 ± 0.255
2.62GlnLys: 2.62 ± 0.664
3.116GlnLeu: 3.116 ± 0.46
0.85GlnMet: 0.85 ± 0.281
1.912GlnAsn: 1.912 ± 0.443
1.983GlnPro: 1.983 ± 0.39
1.912GlnGln: 1.912 ± 0.397
1.416GlnArg: 1.416 ± 0.293
2.691GlnSer: 2.691 ± 0.412
2.903GlnThr: 2.903 ± 0.55
2.478GlnVal: 2.478 ± 0.4
0.921GlnTrp: 0.921 ± 0.282
1.062GlnTyr: 1.062 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
3.399ArgAla: 3.399 ± 0.586
0.354ArgCys: 0.354 ± 0.176
3.045ArgAsp: 3.045 ± 0.455
3.116ArgGlu: 3.116 ± 0.688
2.408ArgPhe: 2.408 ± 0.41
4.249ArgGly: 4.249 ± 0.575
1.204ArgHis: 1.204 ± 0.331
2.974ArgIle: 2.974 ± 0.377
4.178ArgLys: 4.178 ± 0.568
4.532ArgLeu: 4.532 ± 0.693
2.337ArgMet: 2.337 ± 0.38
1.983ArgAsn: 1.983 ± 0.455
2.549ArgPro: 2.549 ± 0.517
2.337ArgGln: 2.337 ± 0.375
3.753ArgArg: 3.753 ± 0.597
2.974ArgSer: 2.974 ± 0.453
2.054ArgThr: 2.054 ± 0.378
3.328ArgVal: 3.328 ± 0.416
0.921ArgTrp: 0.921 ± 0.24
1.912ArgTyr: 1.912 ± 0.334
0.0ArgXaa: 0.0 ± 0.0
Ser
4.532SerAla: 4.532 ± 0.545
0.496SerCys: 0.496 ± 0.237
3.682SerAsp: 3.682 ± 0.46
1.912SerGlu: 1.912 ± 0.463
3.47SerPhe: 3.47 ± 0.476
6.373SerGly: 6.373 ± 0.908
0.85SerHis: 0.85 ± 0.218
3.257SerIle: 3.257 ± 0.525
3.611SerLys: 3.611 ± 0.566
4.815SerLeu: 4.815 ± 0.626
1.558SerMet: 1.558 ± 0.364
2.195SerAsn: 2.195 ± 0.402
2.549SerPro: 2.549 ± 0.479
3.328SerGln: 3.328 ± 0.473
3.116SerArg: 3.116 ± 0.607
3.47SerSer: 3.47 ± 0.531
4.532SerThr: 4.532 ± 0.707
4.886SerVal: 4.886 ± 0.562
0.921SerTrp: 0.921 ± 0.24
2.124SerTyr: 2.124 ± 0.326
0.0SerXaa: 0.0 ± 0.0
Thr
4.815ThrAla: 4.815 ± 0.662
0.425ThrCys: 0.425 ± 0.185
3.328ThrAsp: 3.328 ± 0.498
2.903ThrGlu: 2.903 ± 0.463
2.832ThrPhe: 2.832 ± 0.459
5.24ThrGly: 5.24 ± 0.643
0.991ThrHis: 0.991 ± 0.262
4.178ThrIle: 4.178 ± 0.455
3.541ThrLys: 3.541 ± 0.505
4.036ThrLeu: 4.036 ± 0.614
1.558ThrMet: 1.558 ± 0.313
2.337ThrAsn: 2.337 ± 0.419
3.328ThrPro: 3.328 ± 0.556
2.266ThrGln: 2.266 ± 0.405
3.045ThrArg: 3.045 ± 0.729
3.824ThrSer: 3.824 ± 0.61
4.249ThrThr: 4.249 ± 0.627
4.815ThrVal: 4.815 ± 0.656
1.416ThrTrp: 1.416 ± 0.271
2.266ThrTyr: 2.266 ± 0.454
0.0ThrXaa: 0.0 ± 0.0
Val
5.877ValAla: 5.877 ± 0.65
0.425ValCys: 0.425 ± 0.207
3.257ValAsp: 3.257 ± 0.473
4.32ValGlu: 4.32 ± 0.523
3.611ValPhe: 3.611 ± 0.501
4.957ValGly: 4.957 ± 0.724
1.062ValHis: 1.062 ± 0.296
3.328ValIle: 3.328 ± 0.514
3.257ValLys: 3.257 ± 0.426
5.807ValLeu: 5.807 ± 0.636
0.991ValMet: 0.991 ± 0.275
2.62ValAsn: 2.62 ± 0.397
4.39ValPro: 4.39 ± 0.617
3.045ValGln: 3.045 ± 0.506
3.541ValArg: 3.541 ± 0.459
4.107ValSer: 4.107 ± 0.488
5.098ValThr: 5.098 ± 0.757
5.382ValVal: 5.382 ± 0.704
1.416ValTrp: 1.416 ± 0.303
1.841ValTyr: 1.841 ± 0.255
0.0ValXaa: 0.0 ± 0.0
Trp
1.629TrpAla: 1.629 ± 0.36
0.212TrpCys: 0.212 ± 0.121
1.487TrpAsp: 1.487 ± 0.331
1.062TrpGlu: 1.062 ± 0.303
0.566TrpPhe: 0.566 ± 0.173
1.275TrpGly: 1.275 ± 0.317
0.425TrpHis: 0.425 ± 0.149
0.921TrpIle: 0.921 ± 0.265
1.062TrpLys: 1.062 ± 0.285
1.699TrpLeu: 1.699 ± 0.33
0.496TrpMet: 0.496 ± 0.202
0.566TrpAsn: 0.566 ± 0.211
0.921TrpPro: 0.921 ± 0.266
0.425TrpGln: 0.425 ± 0.158
0.991TrpArg: 0.991 ± 0.26
1.204TrpSer: 1.204 ± 0.267
1.204TrpThr: 1.204 ± 0.324
1.133TrpVal: 1.133 ± 0.258
0.566TrpTrp: 0.566 ± 0.244
0.566TrpTyr: 0.566 ± 0.235
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.266TyrAla: 2.266 ± 0.336
0.071TyrCys: 0.071 ± 0.066
1.487TyrAsp: 1.487 ± 0.277
1.77TyrGlu: 1.77 ± 0.361
1.629TyrPhe: 1.629 ± 0.344
2.62TyrGly: 2.62 ± 0.39
0.921TyrHis: 0.921 ± 0.254
1.487TyrIle: 1.487 ± 0.416
1.487TyrLys: 1.487 ± 0.356
3.116TyrLeu: 3.116 ± 0.479
0.566TyrMet: 0.566 ± 0.185
1.629TyrAsn: 1.629 ± 0.44
1.77TyrPro: 1.77 ± 0.407
1.699TyrGln: 1.699 ± 0.358
1.912TyrArg: 1.912 ± 0.337
1.841TyrSer: 1.841 ± 0.347
1.629TyrThr: 1.629 ± 0.35
1.841TyrVal: 1.841 ± 0.426
0.425TyrTrp: 0.425 ± 0.162
0.425TyrTyr: 0.425 ± 0.203
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (14123 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski