Amino acid dipepetide frequency for Arthrobacter phage vB_ArS-ArV2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.511AlaAla: 18.511 ± 2.773
0.83AlaCys: 0.83 ± 0.305
7.139AlaAsp: 7.139 ± 0.843
8.633AlaGlu: 8.633 ± 1.169
4.565AlaPhe: 4.565 ± 0.728
10.708AlaGly: 10.708 ± 1.291
2.739AlaHis: 2.739 ± 0.511
4.565AlaIle: 4.565 ± 0.656
5.894AlaLys: 5.894 ± 0.7
9.38AlaLeu: 9.38 ± 0.902
4.565AlaMet: 4.565 ± 0.459
2.407AlaAsn: 2.407 ± 0.417
5.147AlaPro: 5.147 ± 0.753
4.565AlaGln: 4.565 ± 0.638
6.973AlaArg: 6.973 ± 1.007
6.89AlaSer: 6.89 ± 1.085
5.396AlaThr: 5.396 ± 0.492
7.886AlaVal: 7.886 ± 0.801
2.324AlaTrp: 2.324 ± 0.373
1.494AlaTyr: 1.494 ± 0.397
0.0AlaXaa: 0.0 ± 0.0
Cys
0.996CysAla: 0.996 ± 0.282
0.249CysCys: 0.249 ± 0.131
0.415CysAsp: 0.415 ± 0.2
0.83CysGlu: 0.83 ± 0.299
0.166CysPhe: 0.166 ± 0.125
1.577CysGly: 1.577 ± 0.496
0.166CysHis: 0.166 ± 0.116
0.415CysIle: 0.415 ± 0.204
0.332CysLys: 0.332 ± 0.185
0.332CysLeu: 0.332 ± 0.163
0.415CysMet: 0.415 ± 0.253
0.498CysAsn: 0.498 ± 0.211
0.581CysPro: 0.581 ± 0.232
0.332CysGln: 0.332 ± 0.169
1.245CysArg: 1.245 ± 0.373
0.747CysSer: 0.747 ± 0.247
0.83CysThr: 0.83 ± 0.279
0.166CysVal: 0.166 ± 0.123
0.498CysTrp: 0.498 ± 0.233
0.249CysTyr: 0.249 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
6.89AspAla: 6.89 ± 0.597
0.747AspCys: 0.747 ± 0.239
3.32AspAsp: 3.32 ± 0.583
2.988AspGlu: 2.988 ± 0.582
2.158AspPhe: 2.158 ± 0.396
6.558AspGly: 6.558 ± 0.68
1.245AspHis: 1.245 ± 0.306
2.656AspIle: 2.656 ± 0.412
1.992AspLys: 1.992 ± 0.505
6.309AspLeu: 6.309 ± 0.701
1.411AspMet: 1.411 ± 0.369
1.909AspAsn: 1.909 ± 0.326
3.818AspPro: 3.818 ± 0.6
1.577AspGln: 1.577 ± 0.369
3.154AspArg: 3.154 ± 0.566
2.739AspSer: 2.739 ± 0.468
2.407AspThr: 2.407 ± 0.408
4.316AspVal: 4.316 ± 0.62
1.079AspTrp: 1.079 ± 0.284
1.245AspTyr: 1.245 ± 0.303
0.0AspXaa: 0.0 ± 0.0
Glu
6.641GluAla: 6.641 ± 0.687
0.996GluCys: 0.996 ± 0.373
2.656GluAsp: 2.656 ± 0.5
3.237GluGlu: 3.237 ± 0.5
1.909GluPhe: 1.909 ± 0.416
4.316GluGly: 4.316 ± 0.626
0.913GluHis: 0.913 ± 0.322
3.154GluIle: 3.154 ± 0.559
2.075GluLys: 2.075 ± 0.412
4.648GluLeu: 4.648 ± 0.691
0.996GluMet: 0.996 ± 0.286
1.494GluAsn: 1.494 ± 0.333
2.988GluPro: 2.988 ± 0.561
2.905GluGln: 2.905 ± 0.649
3.984GluArg: 3.984 ± 0.646
3.486GluSer: 3.486 ± 0.528
3.901GluThr: 3.901 ± 0.447
4.731GluVal: 4.731 ± 0.707
1.577GluTrp: 1.577 ± 0.418
0.996GluTyr: 0.996 ± 0.254
0.0GluXaa: 0.0 ± 0.0
Phe
3.735PheAla: 3.735 ± 0.585
0.332PheCys: 0.332 ± 0.209
2.905PheAsp: 2.905 ± 0.654
1.909PheGlu: 1.909 ± 0.412
1.079PhePhe: 1.079 ± 0.342
3.071PheGly: 3.071 ± 0.446
0.747PheHis: 0.747 ± 0.225
1.328PheIle: 1.328 ± 0.317
1.079PheLys: 1.079 ± 0.287
2.822PheLeu: 2.822 ± 0.452
0.332PheMet: 0.332 ± 0.155
0.581PheAsn: 0.581 ± 0.228
1.328PhePro: 1.328 ± 0.332
1.079PheGln: 1.079 ± 0.263
1.411PheArg: 1.411 ± 0.493
1.909PheSer: 1.909 ± 0.412
3.071PheThr: 3.071 ± 0.597
1.743PheVal: 1.743 ± 0.408
0.747PheTrp: 0.747 ± 0.239
0.747PheTyr: 0.747 ± 0.344
0.0PheXaa: 0.0 ± 0.0
Gly
8.218GlyAla: 8.218 ± 1.212
0.664GlyCys: 0.664 ± 0.252
3.901GlyAsp: 3.901 ± 0.58
5.479GlyGlu: 5.479 ± 0.703
3.071GlyPhe: 3.071 ± 0.527
6.475GlyGly: 6.475 ± 1.008
2.158GlyHis: 2.158 ± 0.512
4.731GlyIle: 4.731 ± 0.841
4.399GlyLys: 4.399 ± 0.594
6.392GlyLeu: 6.392 ± 0.663
1.66GlyMet: 1.66 ± 0.268
3.237GlyAsn: 3.237 ± 0.562
4.233GlyPro: 4.233 ± 1.471
3.569GlyGln: 3.569 ± 0.425
5.313GlyArg: 5.313 ± 0.552
4.648GlySer: 4.648 ± 0.567
6.89GlyThr: 6.89 ± 0.935
5.894GlyVal: 5.894 ± 0.987
3.071GlyTrp: 3.071 ± 0.573
1.992GlyTyr: 1.992 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
1.66HisAla: 1.66 ± 0.466
0.581HisCys: 0.581 ± 0.226
1.494HisAsp: 1.494 ± 0.324
1.577HisGlu: 1.577 ± 0.438
0.664HisPhe: 0.664 ± 0.237
2.573HisGly: 2.573 ± 0.373
0.249HisHis: 0.249 ± 0.126
1.411HisIle: 1.411 ± 0.298
0.498HisLys: 0.498 ± 0.205
1.909HisLeu: 1.909 ± 0.415
0.332HisMet: 0.332 ± 0.158
0.747HisAsn: 0.747 ± 0.236
1.743HisPro: 1.743 ± 0.459
1.245HisGln: 1.245 ± 0.367
2.324HisArg: 2.324 ± 0.441
1.162HisSer: 1.162 ± 0.249
0.664HisThr: 0.664 ± 0.239
0.996HisVal: 0.996 ± 0.252
0.332HisTrp: 0.332 ± 0.147
0.913HisTyr: 0.913 ± 0.246
0.0HisXaa: 0.0 ± 0.0
Ile
4.399IleAla: 4.399 ± 0.584
0.415IleCys: 0.415 ± 0.165
3.984IleAsp: 3.984 ± 0.568
1.909IleGlu: 1.909 ± 0.44
0.747IlePhe: 0.747 ± 0.283
4.316IleGly: 4.316 ± 1.096
0.996IleHis: 0.996 ± 0.216
2.075IleIle: 2.075 ± 0.452
2.324IleLys: 2.324 ± 0.542
3.32IleLeu: 3.32 ± 0.438
1.162IleMet: 1.162 ± 0.334
1.494IleAsn: 1.494 ± 0.395
2.739IlePro: 2.739 ± 0.423
1.577IleGln: 1.577 ± 0.372
2.739IleArg: 2.739 ± 0.562
2.49IleSer: 2.49 ± 0.352
3.071IleThr: 3.071 ± 0.453
2.49IleVal: 2.49 ± 0.448
0.747IleTrp: 0.747 ± 0.241
1.079IleTyr: 1.079 ± 0.24
0.0IleXaa: 0.0 ± 0.0
Lys
6.06LysAla: 6.06 ± 0.87
0.664LysCys: 0.664 ± 0.254
2.407LysAsp: 2.407 ± 0.487
3.071LysGlu: 3.071 ± 0.529
0.664LysPhe: 0.664 ± 0.205
2.822LysGly: 2.822 ± 0.581
0.747LysHis: 0.747 ± 0.304
2.075LysIle: 2.075 ± 0.412
1.66LysLys: 1.66 ± 0.565
3.486LysLeu: 3.486 ± 0.587
1.245LysMet: 1.245 ± 0.346
0.913LysAsn: 0.913 ± 0.24
4.648LysPro: 4.648 ± 0.658
1.079LysGln: 1.079 ± 0.29
3.237LysArg: 3.237 ± 0.543
1.826LysSer: 1.826 ± 0.355
2.324LysThr: 2.324 ± 0.485
2.573LysVal: 2.573 ± 0.428
0.747LysTrp: 0.747 ± 0.28
0.747LysTyr: 0.747 ± 0.236
0.0LysXaa: 0.0 ± 0.0
Leu
8.882LeuAla: 8.882 ± 0.831
0.83LeuCys: 0.83 ± 0.273
4.731LeuAsp: 4.731 ± 0.797
3.486LeuGlu: 3.486 ± 0.634
1.992LeuPhe: 1.992 ± 0.317
4.814LeuGly: 4.814 ± 0.585
2.075LeuHis: 2.075 ± 0.375
2.905LeuIle: 2.905 ± 0.408
3.984LeuLys: 3.984 ± 0.779
5.728LeuLeu: 5.728 ± 0.802
1.162LeuMet: 1.162 ± 0.289
3.071LeuAsn: 3.071 ± 0.519
3.237LeuPro: 3.237 ± 0.565
2.739LeuGln: 2.739 ± 0.489
5.562LeuArg: 5.562 ± 0.899
6.06LeuSer: 6.06 ± 0.801
6.143LeuThr: 6.143 ± 0.624
4.897LeuVal: 4.897 ± 0.559
0.747LeuTrp: 0.747 ± 0.235
1.992LeuTyr: 1.992 ± 0.373
0.0LeuXaa: 0.0 ± 0.0
Met
3.818MetAla: 3.818 ± 0.576
0.083MetCys: 0.083 ± 0.085
1.079MetAsp: 1.079 ± 0.287
0.664MetGlu: 0.664 ± 0.196
0.332MetPhe: 0.332 ± 0.156
2.739MetGly: 2.739 ± 0.469
0.581MetHis: 0.581 ± 0.25
0.996MetIle: 0.996 ± 0.293
0.581MetLys: 0.581 ± 0.224
1.826MetLeu: 1.826 ± 0.434
0.166MetMet: 0.166 ± 0.119
0.83MetAsn: 0.83 ± 0.233
1.577MetPro: 1.577 ± 0.33
0.415MetGln: 0.415 ± 0.223
1.743MetArg: 1.743 ± 0.321
1.66MetSer: 1.66 ± 0.394
1.245MetThr: 1.245 ± 0.278
1.162MetVal: 1.162 ± 0.296
0.498MetTrp: 0.498 ± 0.197
0.166MetTyr: 0.166 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
4.482AsnAla: 4.482 ± 0.616
0.415AsnCys: 0.415 ± 0.165
1.411AsnAsp: 1.411 ± 0.282
1.162AsnGlu: 1.162 ± 0.272
0.415AsnPhe: 0.415 ± 0.257
3.237AsnGly: 3.237 ± 0.592
0.332AsnHis: 0.332 ± 0.153
1.245AsnIle: 1.245 ± 0.342
0.996AsnLys: 0.996 ± 0.302
1.494AsnLeu: 1.494 ± 0.362
0.332AsnMet: 0.332 ± 0.151
1.162AsnAsn: 1.162 ± 0.346
2.739AsnPro: 2.739 ± 0.467
0.83AsnGln: 0.83 ± 0.247
1.909AsnArg: 1.909 ± 0.414
1.743AsnSer: 1.743 ± 0.42
1.743AsnThr: 1.743 ± 0.484
1.743AsnVal: 1.743 ± 0.455
0.747AsnTrp: 0.747 ± 0.211
0.664AsnTyr: 0.664 ± 0.209
0.0AsnXaa: 0.0 ± 0.0
Pro
7.305ProAla: 7.305 ± 0.806
0.747ProCys: 0.747 ± 0.433
4.399ProAsp: 4.399 ± 0.465
2.988ProGlu: 2.988 ± 0.55
1.66ProPhe: 1.66 ± 0.357
4.565ProGly: 4.565 ± 0.663
2.158ProHis: 2.158 ± 0.465
2.656ProIle: 2.656 ± 0.421
1.992ProLys: 1.992 ± 0.348
3.486ProLeu: 3.486 ± 0.557
1.328ProMet: 1.328 ± 0.305
1.411ProAsn: 1.411 ± 0.403
3.237ProPro: 3.237 ± 0.81
1.909ProGln: 1.909 ± 0.536
3.237ProArg: 3.237 ± 0.587
2.822ProSer: 2.822 ± 0.533
4.067ProThr: 4.067 ± 0.463
5.313ProVal: 5.313 ± 0.542
1.162ProTrp: 1.162 ± 0.296
1.909ProTyr: 1.909 ± 0.45
0.0ProXaa: 0.0 ± 0.0
Gln
4.399GlnAla: 4.399 ± 0.699
0.581GlnCys: 0.581 ± 0.234
1.826GlnAsp: 1.826 ± 0.385
1.826GlnGlu: 1.826 ± 0.369
1.328GlnPhe: 1.328 ± 0.292
3.901GlnGly: 3.901 ± 1.102
0.747GlnHis: 0.747 ± 0.25
1.411GlnIle: 1.411 ± 0.298
1.743GlnLys: 1.743 ± 0.3
2.075GlnLeu: 2.075 ± 0.375
0.996GlnMet: 0.996 ± 0.299
0.747GlnAsn: 0.747 ± 0.238
3.071GlnPro: 3.071 ± 0.564
1.66GlnGln: 1.66 ± 0.443
2.241GlnArg: 2.241 ± 0.514
1.245GlnSer: 1.245 ± 0.302
2.158GlnThr: 2.158 ± 0.357
2.407GlnVal: 2.407 ± 0.426
0.664GlnTrp: 0.664 ± 0.17
1.162GlnTyr: 1.162 ± 0.311
0.0GlnXaa: 0.0 ± 0.0
Arg
7.969ArgAla: 7.969 ± 0.753
0.83ArgCys: 0.83 ± 0.286
3.901ArgAsp: 3.901 ± 0.584
5.147ArgGlu: 5.147 ± 0.745
2.158ArgPhe: 2.158 ± 0.374
3.901ArgGly: 3.901 ± 0.722
1.992ArgHis: 1.992 ± 0.452
2.407ArgIle: 2.407 ± 0.549
3.984ArgLys: 3.984 ± 0.651
4.482ArgLeu: 4.482 ± 0.741
2.075ArgMet: 2.075 ± 0.359
1.577ArgAsn: 1.577 ± 0.312
3.32ArgPro: 3.32 ± 0.643
2.075ArgGln: 2.075 ± 0.396
6.641ArgArg: 6.641 ± 0.836
3.32ArgSer: 3.32 ± 0.547
4.648ArgThr: 4.648 ± 0.677
5.147ArgVal: 5.147 ± 1.015
1.162ArgTrp: 1.162 ± 0.312
1.079ArgTyr: 1.079 ± 0.299
0.0ArgXaa: 0.0 ± 0.0
Ser
6.807SerAla: 6.807 ± 0.882
0.581SerCys: 0.581 ± 0.215
2.988SerAsp: 2.988 ± 0.476
3.32SerGlu: 3.32 ± 0.656
1.909SerPhe: 1.909 ± 0.44
5.977SerGly: 5.977 ± 0.578
1.328SerHis: 1.328 ± 0.351
1.743SerIle: 1.743 ± 0.428
2.822SerLys: 2.822 ± 0.557
4.15SerLeu: 4.15 ± 0.537
1.162SerMet: 1.162 ± 0.246
1.743SerAsn: 1.743 ± 0.36
3.32SerPro: 3.32 ± 0.519
1.826SerGln: 1.826 ± 0.366
4.399SerArg: 4.399 ± 0.823
4.399SerSer: 4.399 ± 0.642
3.818SerThr: 3.818 ± 0.497
3.486SerVal: 3.486 ± 0.502
1.245SerTrp: 1.245 ± 0.356
1.577SerTyr: 1.577 ± 0.3
0.0SerXaa: 0.0 ± 0.0
Thr
7.305ThrAla: 7.305 ± 0.77
0.581ThrCys: 0.581 ± 0.259
3.071ThrAsp: 3.071 ± 0.594
3.818ThrGlu: 3.818 ± 0.605
2.656ThrPhe: 2.656 ± 0.442
5.479ThrGly: 5.479 ± 0.626
1.245ThrHis: 1.245 ± 0.291
3.071ThrIle: 3.071 ± 0.585
2.822ThrLys: 2.822 ± 0.555
4.98ThrLeu: 4.98 ± 0.488
1.079ThrMet: 1.079 ± 0.213
1.66ThrAsn: 1.66 ± 0.486
3.901ThrPro: 3.901 ± 0.539
2.324ThrGln: 2.324 ± 0.512
3.486ThrArg: 3.486 ± 0.486
3.901ThrSer: 3.901 ± 0.611
4.482ThrThr: 4.482 ± 0.809
5.313ThrVal: 5.313 ± 0.678
1.494ThrTrp: 1.494 ± 0.3
1.743ThrTyr: 1.743 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
7.139ValAla: 7.139 ± 0.779
0.498ValCys: 0.498 ± 0.223
3.984ValAsp: 3.984 ± 0.68
3.071ValGlu: 3.071 ± 0.582
3.403ValPhe: 3.403 ± 0.604
5.562ValGly: 5.562 ± 0.743
1.411ValHis: 1.411 ± 0.435
3.569ValIle: 3.569 ± 0.715
2.905ValLys: 2.905 ± 0.596
4.233ValLeu: 4.233 ± 0.572
0.913ValMet: 0.913 ± 0.272
2.324ValAsn: 2.324 ± 0.484
3.652ValPro: 3.652 ± 0.559
2.49ValGln: 2.49 ± 0.435
5.147ValArg: 5.147 ± 0.569
5.064ValSer: 5.064 ± 0.674
5.064ValThr: 5.064 ± 0.606
5.064ValVal: 5.064 ± 0.717
1.411ValTrp: 1.411 ± 0.314
2.075ValTyr: 2.075 ± 0.393
0.0ValXaa: 0.0 ± 0.0
Trp
1.909TrpAla: 1.909 ± 0.418
0.083TrpCys: 0.083 ± 0.1
1.66TrpAsp: 1.66 ± 0.41
1.245TrpGlu: 1.245 ± 0.334
0.664TrpPhe: 0.664 ± 0.267
1.245TrpGly: 1.245 ± 0.324
0.747TrpHis: 0.747 ± 0.254
0.913TrpIle: 0.913 ± 0.241
0.664TrpLys: 0.664 ± 0.194
2.075TrpLeu: 2.075 ± 0.513
0.249TrpMet: 0.249 ± 0.171
0.498TrpAsn: 0.498 ± 0.225
1.328TrpPro: 1.328 ± 0.353
1.245TrpGln: 1.245 ± 0.326
1.411TrpArg: 1.411 ± 0.386
0.83TrpSer: 0.83 ± 0.291
1.162TrpThr: 1.162 ± 0.35
2.158TrpVal: 2.158 ± 0.562
0.332TrpTrp: 0.332 ± 0.164
0.415TrpTyr: 0.415 ± 0.141
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.486TyrAla: 3.486 ± 0.602
0.332TyrCys: 0.332 ± 0.164
1.245TyrAsp: 1.245 ± 0.296
1.245TyrGlu: 1.245 ± 0.309
0.415TyrPhe: 0.415 ± 0.132
2.158TyrGly: 2.158 ± 0.441
0.498TyrHis: 0.498 ± 0.218
0.747TyrIle: 0.747 ± 0.202
0.249TyrLys: 0.249 ± 0.123
1.909TyrLeu: 1.909 ± 0.344
0.498TyrMet: 0.498 ± 0.199
0.498TyrAsn: 0.498 ± 0.202
1.743TyrPro: 1.743 ± 0.349
0.747TyrGln: 0.747 ± 0.267
1.66TyrArg: 1.66 ± 0.393
1.66TyrSer: 1.66 ± 0.44
1.328TyrThr: 1.328 ± 0.423
1.577TyrVal: 1.577 ± 0.382
0.249TyrTrp: 0.249 ± 0.123
0.664TyrTyr: 0.664 ± 0.274
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (12048 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski