Amino acid dipepetide frequency for Burkholderia phage JG068

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.009AlaAla: 18.009 ± 1.892
0.843AlaCys: 0.843 ± 0.305
7.28AlaAsp: 7.28 ± 1.207
6.897AlaGlu: 6.897 ± 0.616
3.219AlaPhe: 3.219 ± 0.498
8.89AlaGly: 8.89 ± 1.065
1.839AlaHis: 1.839 ± 0.347
4.138AlaIle: 4.138 ± 0.764
5.441AlaLys: 5.441 ± 0.748
8.736AlaLeu: 8.736 ± 1.004
3.985AlaMet: 3.985 ± 0.532
4.828AlaAsn: 4.828 ± 1.048
4.062AlaPro: 4.062 ± 0.733
5.901AlaGln: 5.901 ± 0.874
8.353AlaArg: 8.353 ± 0.937
6.744AlaSer: 6.744 ± 0.95
6.131AlaThr: 6.131 ± 0.93
7.204AlaVal: 7.204 ± 0.853
1.916AlaTrp: 1.916 ± 0.414
3.602AlaTyr: 3.602 ± 0.581
0.0AlaXaa: 0.0 ± 0.0
Cys
0.92CysAla: 0.92 ± 0.276
0.0CysCys: 0.0 ± 0.0
0.536CysAsp: 0.536 ± 0.217
0.383CysGlu: 0.383 ± 0.178
0.153CysPhe: 0.153 ± 0.103
0.536CysGly: 0.536 ± 0.257
0.077CysHis: 0.077 ± 0.07
0.383CysIle: 0.383 ± 0.178
0.613CysLys: 0.613 ± 0.266
0.843CysLeu: 0.843 ± 0.329
0.153CysMet: 0.153 ± 0.113
0.23CysAsn: 0.23 ± 0.135
0.307CysPro: 0.307 ± 0.141
0.23CysGln: 0.23 ± 0.168
0.383CysArg: 0.383 ± 0.158
0.69CysSer: 0.69 ± 0.226
0.077CysThr: 0.077 ± 0.074
0.536CysVal: 0.536 ± 0.271
0.0CysTrp: 0.0 ± 0.0
0.153CysTyr: 0.153 ± 0.092
0.0CysXaa: 0.0 ± 0.0
Asp
7.204AspAla: 7.204 ± 0.85
0.46AspCys: 0.46 ± 0.183
4.138AspAsp: 4.138 ± 0.692
3.525AspGlu: 3.525 ± 0.369
1.992AspPhe: 1.992 ± 0.396
6.437AspGly: 6.437 ± 0.683
1.073AspHis: 1.073 ± 0.368
3.449AspIle: 3.449 ± 0.447
3.985AspLys: 3.985 ± 0.763
4.598AspLeu: 4.598 ± 0.656
1.456AspMet: 1.456 ± 0.23
1.609AspAsn: 1.609 ± 0.367
2.682AspPro: 2.682 ± 0.517
2.069AspGln: 2.069 ± 0.571
2.759AspArg: 2.759 ± 0.499
2.835AspSer: 2.835 ± 0.37
4.215AspThr: 4.215 ± 0.718
3.142AspVal: 3.142 ± 0.649
1.073AspTrp: 1.073 ± 0.256
1.916AspTyr: 1.916 ± 0.257
0.0AspXaa: 0.0 ± 0.0
Glu
5.748GluAla: 5.748 ± 0.817
0.46GluCys: 0.46 ± 0.18
2.759GluAsp: 2.759 ± 0.554
2.989GluGlu: 2.989 ± 0.643
2.529GluPhe: 2.529 ± 0.388
3.755GluGly: 3.755 ± 0.498
1.686GluHis: 1.686 ± 0.344
1.686GluIle: 1.686 ± 0.39
2.376GluLys: 2.376 ± 0.461
5.977GluLeu: 5.977 ± 0.799
2.222GluMet: 2.222 ± 0.384
2.146GluAsn: 2.146 ± 0.352
1.992GluPro: 1.992 ± 0.45
4.138GluGln: 4.138 ± 0.521
4.445GluArg: 4.445 ± 0.669
3.142GluSer: 3.142 ± 0.403
3.602GluThr: 3.602 ± 0.518
2.912GluVal: 2.912 ± 0.599
1.609GluTrp: 1.609 ± 0.351
1.303GluTyr: 1.303 ± 0.301
0.0GluXaa: 0.0 ± 0.0
Phe
2.376PheAla: 2.376 ± 0.503
0.383PheCys: 0.383 ± 0.165
2.835PheAsp: 2.835 ± 0.535
1.839PheGlu: 1.839 ± 0.397
1.226PhePhe: 1.226 ± 0.309
3.142PheGly: 3.142 ± 0.5
0.69PheHis: 0.69 ± 0.217
1.992PheIle: 1.992 ± 0.372
2.069PheLys: 2.069 ± 0.446
2.835PheLeu: 2.835 ± 0.581
0.766PheMet: 0.766 ± 0.247
1.916PheAsn: 1.916 ± 0.32
1.15PhePro: 1.15 ± 0.272
1.303PheGln: 1.303 ± 0.328
1.992PheArg: 1.992 ± 0.36
1.992PheSer: 1.992 ± 0.4
2.682PheThr: 2.682 ± 0.372
3.755PheVal: 3.755 ± 0.488
0.23PheTrp: 0.23 ± 0.134
0.69PheTyr: 0.69 ± 0.247
0.0PheXaa: 0.0 ± 0.0
Gly
8.047GlyAla: 8.047 ± 0.961
0.613GlyCys: 0.613 ± 0.219
5.058GlyAsp: 5.058 ± 0.509
3.908GlyGlu: 3.908 ± 0.668
3.065GlyPhe: 3.065 ± 0.474
7.357GlyGly: 7.357 ± 0.991
1.073GlyHis: 1.073 ± 0.267
3.985GlyIle: 3.985 ± 0.519
5.364GlyLys: 5.364 ± 0.689
5.594GlyLeu: 5.594 ± 0.768
2.835GlyMet: 2.835 ± 0.553
2.912GlyAsn: 2.912 ± 0.45
2.452GlyPro: 2.452 ± 0.462
4.062GlyGln: 4.062 ± 0.443
4.598GlyArg: 4.598 ± 0.69
4.828GlySer: 4.828 ± 0.791
4.521GlyThr: 4.521 ± 0.607
6.361GlyVal: 6.361 ± 0.63
1.686GlyTrp: 1.686 ± 0.383
2.606GlyTyr: 2.606 ± 0.336
0.0GlyXaa: 0.0 ± 0.0
His
1.456HisAla: 1.456 ± 0.263
0.307HisCys: 0.307 ± 0.151
0.996HisAsp: 0.996 ± 0.234
0.766HisGlu: 0.766 ± 0.32
0.996HisPhe: 0.996 ± 0.285
1.686HisGly: 1.686 ± 0.376
0.307HisHis: 0.307 ± 0.161
0.843HisIle: 0.843 ± 0.275
1.303HisLys: 1.303 ± 0.35
1.303HisLeu: 1.303 ± 0.306
0.766HisMet: 0.766 ± 0.217
0.996HisAsn: 0.996 ± 0.322
0.92HisPro: 0.92 ± 0.293
0.536HisGln: 0.536 ± 0.209
1.916HisArg: 1.916 ± 0.467
0.996HisSer: 0.996 ± 0.243
1.15HisThr: 1.15 ± 0.239
0.69HisVal: 0.69 ± 0.192
0.307HisTrp: 0.307 ± 0.149
0.996HisTyr: 0.996 ± 0.269
0.0HisXaa: 0.0 ± 0.0
Ile
5.748IleAla: 5.748 ± 0.574
0.153IleCys: 0.153 ± 0.111
3.449IleAsp: 3.449 ± 0.387
3.525IleGlu: 3.525 ± 0.691
1.226IlePhe: 1.226 ± 0.33
3.219IleGly: 3.219 ± 0.447
1.15IleHis: 1.15 ± 0.3
1.839IleIle: 1.839 ± 0.431
2.759IleLys: 2.759 ± 0.443
2.912IleLeu: 2.912 ± 0.6
1.379IleMet: 1.379 ± 0.3
2.452IleAsn: 2.452 ± 0.48
1.916IlePro: 1.916 ± 0.332
2.069IleGln: 2.069 ± 0.296
2.912IleArg: 2.912 ± 0.467
2.606IleSer: 2.606 ± 0.367
3.295IleThr: 3.295 ± 0.553
2.606IleVal: 2.606 ± 0.528
0.613IleTrp: 0.613 ± 0.265
0.766IleTyr: 0.766 ± 0.234
0.0IleXaa: 0.0 ± 0.0
Lys
6.897LysAla: 6.897 ± 0.97
0.307LysCys: 0.307 ± 0.2
2.835LysAsp: 2.835 ± 0.459
3.449LysGlu: 3.449 ± 0.568
2.452LysPhe: 2.452 ± 0.391
2.912LysGly: 2.912 ± 0.335
1.763LysHis: 1.763 ± 0.482
1.609LysIle: 1.609 ± 0.397
1.763LysLys: 1.763 ± 0.483
5.134LysLeu: 5.134 ± 0.613
1.073LysMet: 1.073 ± 0.224
2.146LysAsn: 2.146 ± 0.344
2.759LysPro: 2.759 ± 0.591
2.376LysGln: 2.376 ± 0.397
3.372LysArg: 3.372 ± 0.619
2.835LysSer: 2.835 ± 0.393
2.835LysThr: 2.835 ± 0.486
2.759LysVal: 2.759 ± 0.638
0.766LysTrp: 0.766 ± 0.192
1.609LysTyr: 1.609 ± 0.366
0.0LysXaa: 0.0 ± 0.0
Leu
9.349LeuAla: 9.349 ± 0.985
0.69LeuCys: 0.69 ± 0.293
5.901LeuAsp: 5.901 ± 0.644
4.598LeuGlu: 4.598 ± 0.755
2.222LeuPhe: 2.222 ± 0.507
5.441LeuGly: 5.441 ± 0.883
1.609LeuHis: 1.609 ± 0.39
3.525LeuIle: 3.525 ± 0.751
3.678LeuLys: 3.678 ± 0.78
5.671LeuLeu: 5.671 ± 0.779
1.916LeuMet: 1.916 ± 0.363
3.832LeuAsn: 3.832 ± 0.641
4.138LeuPro: 4.138 ± 0.567
3.908LeuGln: 3.908 ± 0.716
6.054LeuArg: 6.054 ± 0.74
3.755LeuSer: 3.755 ± 0.43
3.908LeuThr: 3.908 ± 0.603
3.755LeuVal: 3.755 ± 0.407
1.073LeuTrp: 1.073 ± 0.221
2.146LeuTyr: 2.146 ± 0.481
0.0LeuXaa: 0.0 ± 0.0
Met
3.372MetAla: 3.372 ± 0.666
0.23MetCys: 0.23 ± 0.125
1.609MetAsp: 1.609 ± 0.337
1.379MetGlu: 1.379 ± 0.389
0.383MetPhe: 0.383 ± 0.159
1.916MetGly: 1.916 ± 0.549
0.383MetHis: 0.383 ± 0.186
1.226MetIle: 1.226 ± 0.28
1.073MetLys: 1.073 ± 0.337
3.142MetLeu: 3.142 ± 0.339
0.996MetMet: 0.996 ± 0.259
1.226MetAsn: 1.226 ± 0.318
1.303MetPro: 1.303 ± 0.316
1.226MetGln: 1.226 ± 0.367
1.992MetArg: 1.992 ± 0.401
2.069MetSer: 2.069 ± 0.345
1.609MetThr: 1.609 ± 0.349
1.15MetVal: 1.15 ± 0.268
0.23MetTrp: 0.23 ± 0.125
0.843MetTyr: 0.843 ± 0.332
0.0MetXaa: 0.0 ± 0.0
Asn
5.748AsnAla: 5.748 ± 0.99
0.153AsnCys: 0.153 ± 0.119
2.299AsnAsp: 2.299 ± 0.449
1.456AsnGlu: 1.456 ± 0.358
1.763AsnPhe: 1.763 ± 0.346
3.755AsnGly: 3.755 ± 0.752
0.536AsnHis: 0.536 ± 0.241
2.989AsnIle: 2.989 ± 0.371
2.529AsnLys: 2.529 ± 0.549
3.525AsnLeu: 3.525 ± 0.553
1.303AsnMet: 1.303 ± 0.304
2.069AsnAsn: 2.069 ± 0.317
2.146AsnPro: 2.146 ± 0.363
1.686AsnGln: 1.686 ± 0.328
1.686AsnArg: 1.686 ± 0.311
2.146AsnSer: 2.146 ± 0.374
2.759AsnThr: 2.759 ± 0.511
1.686AsnVal: 1.686 ± 0.341
0.69AsnTrp: 0.69 ± 0.285
1.073AsnTyr: 1.073 ± 0.232
0.0AsnXaa: 0.0 ± 0.0
Pro
4.368ProAla: 4.368 ± 0.815
0.23ProCys: 0.23 ± 0.141
2.452ProAsp: 2.452 ± 0.432
3.832ProGlu: 3.832 ± 0.509
1.609ProPhe: 1.609 ± 0.46
4.368ProGly: 4.368 ± 0.551
0.69ProHis: 0.69 ± 0.219
1.839ProIle: 1.839 ± 0.548
2.222ProLys: 2.222 ± 0.357
2.452ProLeu: 2.452 ± 0.464
0.996ProMet: 0.996 ± 0.32
1.533ProAsn: 1.533 ± 0.301
1.456ProPro: 1.456 ± 0.417
2.376ProGln: 2.376 ± 0.396
1.456ProArg: 1.456 ± 0.329
2.222ProSer: 2.222 ± 0.575
3.065ProThr: 3.065 ± 0.481
3.219ProVal: 3.219 ± 0.446
0.843ProTrp: 0.843 ± 0.246
1.686ProTyr: 1.686 ± 0.336
0.0ProXaa: 0.0 ± 0.0
Gln
8.276GlnAla: 8.276 ± 1.068
0.613GlnCys: 0.613 ± 0.258
2.146GlnAsp: 2.146 ± 0.358
1.916GlnGlu: 1.916 ± 0.416
2.069GlnPhe: 2.069 ± 0.27
2.989GlnGly: 2.989 ± 0.41
1.379GlnHis: 1.379 ± 0.29
2.376GlnIle: 2.376 ± 0.603
2.452GlnLys: 2.452 ± 0.509
3.525GlnLeu: 3.525 ± 0.62
1.15GlnMet: 1.15 ± 0.268
1.609GlnAsn: 1.609 ± 0.456
2.069GlnPro: 2.069 ± 0.452
3.678GlnGln: 3.678 ± 0.871
4.138GlnArg: 4.138 ± 0.684
1.609GlnSer: 1.609 ± 0.398
1.992GlnThr: 1.992 ± 0.3
2.989GlnVal: 2.989 ± 0.56
0.766GlnTrp: 0.766 ± 0.186
2.299GlnTyr: 2.299 ± 0.371
0.0GlnXaa: 0.0 ± 0.0
Arg
6.82ArgAla: 6.82 ± 0.803
0.536ArgCys: 0.536 ± 0.202
3.985ArgAsp: 3.985 ± 0.683
4.445ArgGlu: 4.445 ± 0.762
2.069ArgPhe: 2.069 ± 0.457
3.985ArgGly: 3.985 ± 0.511
1.226ArgHis: 1.226 ± 0.285
3.449ArgIle: 3.449 ± 0.707
3.295ArgLys: 3.295 ± 0.523
3.985ArgLeu: 3.985 ± 0.525
1.15ArgMet: 1.15 ± 0.281
3.372ArgAsn: 3.372 ± 0.616
2.606ArgPro: 2.606 ± 0.358
3.295ArgGln: 3.295 ± 0.58
3.295ArgArg: 3.295 ± 0.51
3.525ArgSer: 3.525 ± 0.448
3.525ArgThr: 3.525 ± 0.414
4.215ArgVal: 4.215 ± 0.663
1.15ArgTrp: 1.15 ± 0.366
2.146ArgTyr: 2.146 ± 0.413
0.0ArgXaa: 0.0 ± 0.0
Ser
6.284SerAla: 6.284 ± 0.758
0.383SerCys: 0.383 ± 0.179
3.142SerAsp: 3.142 ± 0.413
2.452SerGlu: 2.452 ± 0.44
2.222SerPhe: 2.222 ± 0.49
5.134SerGly: 5.134 ± 0.581
0.766SerHis: 0.766 ± 0.244
3.908SerIle: 3.908 ± 0.597
2.452SerLys: 2.452 ± 0.365
3.372SerLeu: 3.372 ± 0.517
1.609SerMet: 1.609 ± 0.402
2.682SerAsn: 2.682 ± 0.577
2.452SerPro: 2.452 ± 0.441
2.529SerGln: 2.529 ± 0.464
3.065SerArg: 3.065 ± 0.529
3.065SerSer: 3.065 ± 0.559
4.062SerThr: 4.062 ± 1.033
3.065SerVal: 3.065 ± 0.409
0.613SerTrp: 0.613 ± 0.207
1.839SerTyr: 1.839 ± 0.35
0.0SerXaa: 0.0 ± 0.0
Thr
6.667ThrAla: 6.667 ± 1.57
0.307ThrCys: 0.307 ± 0.145
2.606ThrAsp: 2.606 ± 0.398
2.835ThrGlu: 2.835 ± 0.542
2.069ThrPhe: 2.069 ± 0.303
6.82ThrGly: 6.82 ± 0.64
1.379ThrHis: 1.379 ± 0.273
2.606ThrIle: 2.606 ± 0.427
3.219ThrLys: 3.219 ± 0.462
4.368ThrLeu: 4.368 ± 0.783
1.379ThrMet: 1.379 ± 0.302
2.376ThrAsn: 2.376 ± 0.588
3.832ThrPro: 3.832 ± 0.622
3.219ThrGln: 3.219 ± 0.63
2.452ThrArg: 2.452 ± 0.58
2.835ThrSer: 2.835 ± 0.304
3.602ThrThr: 3.602 ± 0.697
3.525ThrVal: 3.525 ± 0.597
0.69ThrTrp: 0.69 ± 0.219
1.609ThrTyr: 1.609 ± 0.351
0.0ThrXaa: 0.0 ± 0.0
Val
5.977ValAla: 5.977 ± 0.672
0.383ValCys: 0.383 ± 0.178
3.832ValAsp: 3.832 ± 0.545
4.521ValGlu: 4.521 ± 0.612
2.069ValPhe: 2.069 ± 0.459
4.675ValGly: 4.675 ± 0.621
1.073ValHis: 1.073 ± 0.265
2.912ValIle: 2.912 ± 0.469
3.219ValLys: 3.219 ± 0.53
4.598ValLeu: 4.598 ± 0.652
1.15ValMet: 1.15 ± 0.253
1.839ValAsn: 1.839 ± 0.477
2.912ValPro: 2.912 ± 0.534
3.295ValGln: 3.295 ± 0.469
4.062ValArg: 4.062 ± 0.696
4.292ValSer: 4.292 ± 0.698
3.295ValThr: 3.295 ± 0.485
4.215ValVal: 4.215 ± 0.648
1.15ValTrp: 1.15 ± 0.285
1.916ValTyr: 1.916 ± 0.333
0.0ValXaa: 0.0 ± 0.0
Trp
1.303TrpAla: 1.303 ± 0.291
0.077TrpCys: 0.077 ± 0.085
1.15TrpAsp: 1.15 ± 0.258
0.766TrpGlu: 0.766 ± 0.253
1.15TrpPhe: 1.15 ± 0.327
0.843TrpGly: 0.843 ± 0.339
0.077TrpHis: 0.077 ± 0.07
0.843TrpIle: 0.843 ± 0.304
0.69TrpLys: 0.69 ± 0.227
1.839TrpLeu: 1.839 ± 0.351
0.46TrpMet: 0.46 ± 0.161
0.536TrpAsn: 0.536 ± 0.208
0.69TrpPro: 0.69 ± 0.256
0.69TrpGln: 0.69 ± 0.187
1.073TrpArg: 1.073 ± 0.253
0.843TrpSer: 0.843 ± 0.308
0.383TrpThr: 0.383 ± 0.183
1.533TrpVal: 1.533 ± 0.291
0.0TrpTrp: 0.0 ± 0.0
0.613TrpTyr: 0.613 ± 0.238
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.449TyrAla: 3.449 ± 0.414
0.077TyrCys: 0.077 ± 0.064
1.609TyrAsp: 1.609 ± 0.403
1.992TyrGlu: 1.992 ± 0.386
1.379TyrPhe: 1.379 ± 0.358
3.065TyrGly: 3.065 ± 0.356
0.46TyrHis: 0.46 ± 0.162
0.996TyrIle: 0.996 ± 0.271
1.15TyrLys: 1.15 ± 0.205
2.759TyrLeu: 2.759 ± 0.514
0.536TyrMet: 0.536 ± 0.175
1.686TyrAsn: 1.686 ± 0.432
1.073TyrPro: 1.073 ± 0.351
1.456TyrGln: 1.456 ± 0.349
2.146TyrArg: 2.146 ± 0.361
2.069TyrSer: 2.069 ± 0.407
1.686TyrThr: 1.686 ± 0.303
2.069TyrVal: 2.069 ± 0.497
0.153TyrTrp: 0.153 ± 0.121
0.613TyrTyr: 0.613 ± 0.211
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (13050 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski