Amino acid dipepetide frequency for Brucella phage Tb

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.421AlaAla: 9.421 ± 1.154
0.766AlaCys: 0.766 ± 0.244
6.281AlaAsp: 6.281 ± 0.685
5.132AlaGlu: 5.132 ± 0.876
3.37AlaPhe: 3.37 ± 0.458
7.047AlaGly: 7.047 ± 1.191
1.379AlaHis: 1.379 ± 0.324
5.132AlaIle: 5.132 ± 0.787
5.132AlaLys: 5.132 ± 0.657
9.268AlaLeu: 9.268 ± 1.01
1.991AlaMet: 1.991 ± 0.329
4.136AlaAsn: 4.136 ± 0.635
3.447AlaPro: 3.447 ± 0.691
5.591AlaGln: 5.591 ± 0.994
5.132AlaArg: 5.132 ± 0.717
7.353AlaSer: 7.353 ± 0.837
5.132AlaThr: 5.132 ± 0.874
5.668AlaVal: 5.668 ± 0.559
1.762AlaTrp: 1.762 ± 0.316
1.915AlaTyr: 1.915 ± 0.432
0.0AlaXaa: 0.0 ± 0.0
Cys
0.536CysAla: 0.536 ± 0.244
0.077CysCys: 0.077 ± 0.076
0.689CysAsp: 0.689 ± 0.234
0.996CysGlu: 0.996 ± 0.321
0.383CysPhe: 0.383 ± 0.135
1.072CysGly: 1.072 ± 0.37
0.077CysHis: 0.077 ± 0.076
0.843CysIle: 0.843 ± 0.251
0.306CysLys: 0.306 ± 0.185
0.689CysLeu: 0.689 ± 0.22
0.383CysMet: 0.383 ± 0.193
0.383CysAsn: 0.383 ± 0.192
0.996CysPro: 0.996 ± 0.28
0.383CysGln: 0.383 ± 0.206
0.996CysArg: 0.996 ± 0.232
0.613CysSer: 0.613 ± 0.26
0.46CysThr: 0.46 ± 0.188
0.919CysVal: 0.919 ± 0.277
0.077CysTrp: 0.077 ± 0.078
0.306CysTyr: 0.306 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
5.132AspAla: 5.132 ± 0.601
0.843AspCys: 0.843 ± 0.292
3.753AspAsp: 3.753 ± 0.562
4.059AspGlu: 4.059 ± 0.704
2.604AspPhe: 2.604 ± 0.348
4.902AspGly: 4.902 ± 0.593
1.685AspHis: 1.685 ± 0.455
4.902AspIle: 4.902 ± 0.622
2.757AspLys: 2.757 ± 0.573
4.059AspLeu: 4.059 ± 0.602
1.685AspMet: 1.685 ± 0.306
2.374AspAsn: 2.374 ± 0.349
2.834AspPro: 2.834 ± 0.439
2.298AspGln: 2.298 ± 0.391
3.523AspArg: 3.523 ± 0.617
2.834AspSer: 2.834 ± 0.532
2.451AspThr: 2.451 ± 0.46
2.911AspVal: 2.911 ± 0.508
0.689AspTrp: 0.689 ± 0.278
2.757AspTyr: 2.757 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
6.664GluAla: 6.664 ± 0.872
0.536GluCys: 0.536 ± 0.191
3.447GluAsp: 3.447 ± 0.548
3.447GluGlu: 3.447 ± 0.749
1.991GluPhe: 1.991 ± 0.352
3.983GluGly: 3.983 ± 0.668
1.532GluHis: 1.532 ± 0.348
3.906GluIle: 3.906 ± 0.679
3.523GluLys: 3.523 ± 0.594
6.51GluLeu: 6.51 ± 0.891
2.068GluMet: 2.068 ± 0.382
2.681GluAsn: 2.681 ± 0.418
2.145GluPro: 2.145 ± 0.496
3.064GluGln: 3.064 ± 0.732
4.289GluArg: 4.289 ± 0.487
3.217GluSer: 3.217 ± 0.436
2.834GluThr: 2.834 ± 0.477
5.668GluVal: 5.668 ± 0.734
1.149GluTrp: 1.149 ± 0.386
2.068GluTyr: 2.068 ± 0.392
0.0GluXaa: 0.0 ± 0.0
Phe
3.447PheAla: 3.447 ± 0.45
1.072PheCys: 1.072 ± 0.316
1.838PheAsp: 1.838 ± 0.286
1.455PheGlu: 1.455 ± 0.309
1.225PhePhe: 1.225 ± 0.314
2.911PheGly: 2.911 ± 0.458
0.766PheHis: 0.766 ± 0.229
1.455PheIle: 1.455 ± 0.403
1.532PheLys: 1.532 ± 0.342
1.762PheLeu: 1.762 ± 0.337
1.149PheMet: 1.149 ± 0.283
1.762PheAsn: 1.762 ± 0.333
1.302PhePro: 1.302 ± 0.229
1.379PheGln: 1.379 ± 0.319
1.532PheArg: 1.532 ± 0.397
2.911PheSer: 2.911 ± 0.501
2.528PheThr: 2.528 ± 0.458
1.915PheVal: 1.915 ± 0.443
0.306PheTrp: 0.306 ± 0.145
0.919PheTyr: 0.919 ± 0.296
0.0PheXaa: 0.0 ± 0.0
Gly
6.97GlyAla: 6.97 ± 1.048
0.536GlyCys: 0.536 ± 0.207
4.979GlyAsp: 4.979 ± 0.829
3.983GlyGlu: 3.983 ± 0.655
3.294GlyPhe: 3.294 ± 0.441
8.655GlyGly: 8.655 ± 1.592
0.613GlyHis: 0.613 ± 0.241
3.6GlyIle: 3.6 ± 0.524
5.285GlyLys: 5.285 ± 0.83
6.664GlyLeu: 6.664 ± 0.729
2.221GlyMet: 2.221 ± 0.404
4.672GlyAsn: 4.672 ± 0.633
2.451GlyPro: 2.451 ± 0.414
2.681GlyGln: 2.681 ± 0.458
4.136GlyArg: 4.136 ± 0.617
6.051GlySer: 6.051 ± 0.869
5.055GlyThr: 5.055 ± 0.871
5.974GlyVal: 5.974 ± 0.907
1.532GlyTrp: 1.532 ± 0.44
2.757GlyTyr: 2.757 ± 0.553
0.0GlyXaa: 0.0 ± 0.0
His
1.225HisAla: 1.225 ± 0.383
0.153HisCys: 0.153 ± 0.093
0.766HisAsp: 0.766 ± 0.199
0.843HisGlu: 0.843 ± 0.25
0.689HisPhe: 0.689 ± 0.282
0.996HisGly: 0.996 ± 0.254
0.536HisHis: 0.536 ± 0.19
1.379HisIle: 1.379 ± 0.365
0.996HisLys: 0.996 ± 0.3
1.379HisLeu: 1.379 ± 0.392
0.153HisMet: 0.153 ± 0.091
0.919HisAsn: 0.919 ± 0.309
1.072HisPro: 1.072 ± 0.243
0.613HisGln: 0.613 ± 0.195
1.225HisArg: 1.225 ± 0.365
0.689HisSer: 0.689 ± 0.254
0.536HisThr: 0.536 ± 0.198
1.149HisVal: 1.149 ± 0.299
0.23HisTrp: 0.23 ± 0.119
0.613HisTyr: 0.613 ± 0.258
0.0HisXaa: 0.0 ± 0.0
Ile
5.438IleAla: 5.438 ± 0.782
0.843IleCys: 0.843 ± 0.256
4.289IleAsp: 4.289 ± 0.804
4.825IleGlu: 4.825 ± 0.681
1.608IlePhe: 1.608 ± 0.422
4.442IleGly: 4.442 ± 0.614
0.46IleHis: 0.46 ± 0.193
3.6IleIle: 3.6 ± 0.406
3.064IleLys: 3.064 ± 0.499
4.366IleLeu: 4.366 ± 0.524
1.149IleMet: 1.149 ± 0.346
2.604IleAsn: 2.604 ± 0.366
2.834IlePro: 2.834 ± 0.518
2.145IleGln: 2.145 ± 0.391
2.221IleArg: 2.221 ± 0.436
4.289IleSer: 4.289 ± 0.518
3.217IleThr: 3.217 ± 0.443
3.064IleVal: 3.064 ± 0.646
0.689IleTrp: 0.689 ± 0.248
1.838IleTyr: 1.838 ± 0.362
0.0IleXaa: 0.0 ± 0.0
Lys
5.744LysAla: 5.744 ± 0.634
0.843LysCys: 0.843 ± 0.263
3.294LysAsp: 3.294 ± 0.747
4.366LysGlu: 4.366 ± 0.667
1.455LysPhe: 1.455 ± 0.335
3.676LysGly: 3.676 ± 0.545
1.149LysHis: 1.149 ± 0.338
3.447LysIle: 3.447 ± 0.548
3.294LysLys: 3.294 ± 0.62
2.987LysLeu: 2.987 ± 0.47
1.685LysMet: 1.685 ± 0.393
1.838LysAsn: 1.838 ± 0.425
3.217LysPro: 3.217 ± 0.619
1.302LysGln: 1.302 ± 0.445
2.987LysArg: 2.987 ± 0.517
3.6LysSer: 3.6 ± 0.501
2.528LysThr: 2.528 ± 0.433
3.064LysVal: 3.064 ± 0.49
1.072LysTrp: 1.072 ± 0.344
1.838LysTyr: 1.838 ± 0.467
0.0LysXaa: 0.0 ± 0.0
Leu
7.2LeuAla: 7.2 ± 1.139
0.843LeuCys: 0.843 ± 0.314
3.83LeuAsp: 3.83 ± 0.597
4.672LeuGlu: 4.672 ± 0.827
2.221LeuPhe: 2.221 ± 0.455
5.515LeuGly: 5.515 ± 0.637
0.613LeuHis: 0.613 ± 0.237
4.519LeuIle: 4.519 ± 0.747
4.059LeuLys: 4.059 ± 0.719
3.447LeuLeu: 3.447 ± 0.581
1.532LeuMet: 1.532 ± 0.356
4.136LeuAsn: 4.136 ± 0.513
3.906LeuPro: 3.906 ± 0.725
3.6LeuGln: 3.6 ± 0.819
4.213LeuArg: 4.213 ± 0.696
5.668LeuSer: 5.668 ± 0.596
4.825LeuThr: 4.825 ± 0.785
4.825LeuVal: 4.825 ± 0.616
0.996LeuTrp: 0.996 ± 0.326
1.762LeuTyr: 1.762 ± 0.328
0.0LeuXaa: 0.0 ± 0.0
Met
2.604MetAla: 2.604 ± 0.36
0.383MetCys: 0.383 ± 0.138
1.225MetAsp: 1.225 ± 0.315
1.225MetGlu: 1.225 ± 0.387
0.46MetPhe: 0.46 ± 0.178
2.604MetGly: 2.604 ± 0.396
0.383MetHis: 0.383 ± 0.179
0.843MetIle: 0.843 ± 0.293
1.685MetLys: 1.685 ± 0.427
2.068MetLeu: 2.068 ± 0.317
0.996MetMet: 0.996 ± 0.349
0.843MetAsn: 0.843 ± 0.279
1.072MetPro: 1.072 ± 0.254
1.302MetGln: 1.302 ± 0.335
1.762MetArg: 1.762 ± 0.282
2.374MetSer: 2.374 ± 0.377
1.991MetThr: 1.991 ± 0.393
1.379MetVal: 1.379 ± 0.361
0.613MetTrp: 0.613 ± 0.249
0.843MetTyr: 0.843 ± 0.262
0.0MetXaa: 0.0 ± 0.0
Asn
5.285AsnAla: 5.285 ± 0.749
0.153AsnCys: 0.153 ± 0.101
2.451AsnAsp: 2.451 ± 0.505
2.068AsnGlu: 2.068 ± 0.3
1.302AsnPhe: 1.302 ± 0.294
4.213AsnGly: 4.213 ± 0.75
0.919AsnHis: 0.919 ± 0.299
2.834AsnIle: 2.834 ± 0.504
1.532AsnLys: 1.532 ± 0.327
3.14AsnLeu: 3.14 ± 0.412
1.685AsnMet: 1.685 ± 0.36
2.757AsnAsn: 2.757 ± 0.761
3.37AsnPro: 3.37 ± 0.602
1.991AsnGln: 1.991 ± 0.539
3.064AsnArg: 3.064 ± 0.468
2.221AsnSer: 2.221 ± 0.48
2.987AsnThr: 2.987 ± 0.57
2.374AsnVal: 2.374 ± 0.446
1.302AsnTrp: 1.302 ± 0.333
1.532AsnTyr: 1.532 ± 0.249
0.0AsnXaa: 0.0 ± 0.0
Pro
4.213ProAla: 4.213 ± 0.73
0.613ProCys: 0.613 ± 0.177
3.37ProAsp: 3.37 ± 0.387
4.366ProGlu: 4.366 ± 0.646
2.604ProPhe: 2.604 ± 0.433
3.064ProGly: 3.064 ± 0.557
0.766ProHis: 0.766 ± 0.179
2.145ProIle: 2.145 ± 0.475
1.991ProLys: 1.991 ± 0.38
2.374ProLeu: 2.374 ± 0.393
1.149ProMet: 1.149 ± 0.279
2.221ProAsn: 2.221 ± 0.361
1.762ProPro: 1.762 ± 0.395
2.145ProGln: 2.145 ± 0.498
1.608ProArg: 1.608 ± 0.386
3.217ProSer: 3.217 ± 0.618
3.064ProThr: 3.064 ± 0.435
4.059ProVal: 4.059 ± 0.435
0.383ProTrp: 0.383 ± 0.26
1.838ProTyr: 1.838 ± 0.333
0.0ProXaa: 0.0 ± 0.0
Gln
5.438GlnAla: 5.438 ± 0.877
0.383GlnCys: 0.383 ± 0.166
2.451GlnAsp: 2.451 ± 0.407
2.987GlnGlu: 2.987 ± 0.631
1.302GlnPhe: 1.302 ± 0.311
3.447GlnGly: 3.447 ± 0.651
0.613GlnHis: 0.613 ± 0.258
2.068GlnIle: 2.068 ± 0.418
1.762GlnLys: 1.762 ± 0.37
3.447GlnLeu: 3.447 ± 0.614
1.302GlnMet: 1.302 ± 0.352
1.915GlnAsn: 1.915 ± 0.332
2.145GlnPro: 2.145 ± 0.523
3.217GlnGln: 3.217 ± 0.956
1.991GlnArg: 1.991 ± 0.507
3.447GlnSer: 3.447 ± 0.644
2.604GlnThr: 2.604 ± 0.518
2.221GlnVal: 2.221 ± 0.511
0.383GlnTrp: 0.383 ± 0.182
0.689GlnTyr: 0.689 ± 0.226
0.0GlnXaa: 0.0 ± 0.0
Arg
4.213ArgAla: 4.213 ± 0.474
0.689ArgCys: 0.689 ± 0.182
3.064ArgAsp: 3.064 ± 0.416
4.979ArgGlu: 4.979 ± 0.611
0.919ArgPhe: 0.919 ± 0.251
3.6ArgGly: 3.6 ± 0.468
0.536ArgHis: 0.536 ± 0.182
3.37ArgIle: 3.37 ± 0.575
3.906ArgLys: 3.906 ± 0.606
4.213ArgLeu: 4.213 ± 0.505
1.302ArgMet: 1.302 ± 0.293
2.987ArgAsn: 2.987 ± 0.44
2.298ArgPro: 2.298 ± 0.442
2.681ArgGln: 2.681 ± 0.6
3.217ArgArg: 3.217 ± 0.495
3.064ArgSer: 3.064 ± 0.444
2.528ArgThr: 2.528 ± 0.489
3.983ArgVal: 3.983 ± 0.595
0.843ArgTrp: 0.843 ± 0.22
1.838ArgTyr: 1.838 ± 0.386
0.0ArgXaa: 0.0 ± 0.0
Ser
6.74SerAla: 6.74 ± 0.777
0.306SerCys: 0.306 ± 0.137
3.064SerAsp: 3.064 ± 0.559
4.442SerGlu: 4.442 ± 0.709
2.298SerPhe: 2.298 ± 0.459
6.97SerGly: 6.97 ± 0.836
1.149SerHis: 1.149 ± 0.303
3.294SerIle: 3.294 ± 0.574
3.37SerLys: 3.37 ± 0.559
5.744SerLeu: 5.744 ± 0.629
2.068SerMet: 2.068 ± 0.412
3.294SerAsn: 3.294 ± 0.479
3.37SerPro: 3.37 ± 0.903
3.37SerGln: 3.37 ± 0.619
3.6SerArg: 3.6 ± 0.556
5.438SerSer: 5.438 ± 0.682
3.14SerThr: 3.14 ± 0.719
4.979SerVal: 4.979 ± 0.659
0.613SerTrp: 0.613 ± 0.204
2.298SerTyr: 2.298 ± 0.434
0.0SerXaa: 0.0 ± 0.0
Thr
5.208ThrAla: 5.208 ± 0.764
0.383ThrCys: 0.383 ± 0.251
2.528ThrAsp: 2.528 ± 0.482
2.987ThrGlu: 2.987 ± 0.501
1.379ThrPhe: 1.379 ± 0.322
6.817ThrGly: 6.817 ± 1.26
0.996ThrHis: 0.996 ± 0.32
3.447ThrIle: 3.447 ± 0.639
3.37ThrLys: 3.37 ± 0.465
3.906ThrLeu: 3.906 ± 0.528
1.149ThrMet: 1.149 ± 0.311
2.145ThrAsn: 2.145 ± 0.506
4.366ThrPro: 4.366 ± 0.661
1.685ThrGln: 1.685 ± 0.347
2.068ThrArg: 2.068 ± 0.417
3.6ThrSer: 3.6 ± 0.64
3.37ThrThr: 3.37 ± 0.583
3.983ThrVal: 3.983 ± 0.652
0.919ThrTrp: 0.919 ± 0.315
1.762ThrTyr: 1.762 ± 0.378
0.0ThrXaa: 0.0 ± 0.0
Val
4.749ValAla: 4.749 ± 0.72
0.843ValCys: 0.843 ± 0.25
4.596ValAsp: 4.596 ± 0.591
4.672ValGlu: 4.672 ± 0.791
2.221ValPhe: 2.221 ± 0.534
5.208ValGly: 5.208 ± 0.765
1.072ValHis: 1.072 ± 0.337
3.6ValIle: 3.6 ± 0.483
3.523ValLys: 3.523 ± 0.544
3.83ValLeu: 3.83 ± 0.503
1.685ValMet: 1.685 ± 0.399
3.064ValAsn: 3.064 ± 0.558
2.528ValPro: 2.528 ± 0.431
1.991ValGln: 1.991 ± 0.442
3.064ValArg: 3.064 ± 0.575
5.898ValSer: 5.898 ± 0.721
4.596ValThr: 4.596 ± 0.704
3.6ValVal: 3.6 ± 0.685
1.225ValTrp: 1.225 ± 0.427
2.604ValTyr: 2.604 ± 0.501
0.0ValXaa: 0.0 ± 0.0
Trp
1.608TrpAla: 1.608 ± 0.33
0.153TrpCys: 0.153 ± 0.102
1.225TrpAsp: 1.225 ± 0.351
1.225TrpGlu: 1.225 ± 0.374
0.613TrpPhe: 0.613 ± 0.182
0.843TrpGly: 0.843 ± 0.364
0.46TrpHis: 0.46 ± 0.164
0.843TrpIle: 0.843 ± 0.292
0.996TrpLys: 0.996 ± 0.264
0.613TrpLeu: 0.613 ± 0.221
0.46TrpMet: 0.46 ± 0.254
0.613TrpAsn: 0.613 ± 0.267
0.613TrpPro: 0.613 ± 0.206
0.919TrpGln: 0.919 ± 0.223
0.919TrpArg: 0.919 ± 0.318
1.455TrpSer: 1.455 ± 0.323
0.689TrpThr: 0.689 ± 0.231
0.919TrpVal: 0.919 ± 0.28
0.153TrpTrp: 0.153 ± 0.111
0.23TrpTyr: 0.23 ± 0.134
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.217TyrAla: 3.217 ± 0.456
0.766TyrCys: 0.766 ± 0.274
1.915TyrAsp: 1.915 ± 0.397
1.991TyrGlu: 1.991 ± 0.436
1.225TyrPhe: 1.225 ± 0.35
2.145TyrGly: 2.145 ± 0.36
0.46TyrHis: 0.46 ± 0.227
1.762TyrIle: 1.762 ± 0.415
1.302TyrLys: 1.302 ± 0.381
1.915TyrLeu: 1.915 ± 0.433
0.766TyrMet: 0.766 ± 0.22
1.838TyrAsn: 1.838 ± 0.357
1.225TyrPro: 1.225 ± 0.303
1.455TyrGln: 1.455 ± 0.33
2.757TyrArg: 2.757 ± 0.405
1.608TyrSer: 1.608 ± 0.351
1.532TyrThr: 1.532 ± 0.332
1.915TyrVal: 1.915 ± 0.408
0.613TyrTrp: 0.613 ± 0.269
0.843TyrTyr: 0.843 ± 0.247
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (13057 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski