Amino acid dipepetide frequency for Bacillus virus Glittering

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.162AlaAla: 2.162 ± 0.753
0.676AlaCys: 0.676 ± 0.312
3.514AlaAsp: 3.514 ± 0.574
3.919AlaGlu: 3.919 ± 0.488
3.243AlaPhe: 3.243 ± 0.409
4.459AlaGly: 4.459 ± 0.621
0.878AlaHis: 0.878 ± 0.265
4.73AlaIle: 4.73 ± 0.85
4.662AlaLys: 4.662 ± 0.582
5.203AlaLeu: 5.203 ± 0.658
2.23AlaMet: 2.23 ± 0.511
3.311AlaAsn: 3.311 ± 0.463
2.568AlaPro: 2.568 ± 0.597
2.297AlaGln: 2.297 ± 0.41
3.108AlaArg: 3.108 ± 0.507
3.649AlaSer: 3.649 ± 0.52
3.446AlaThr: 3.446 ± 0.581
4.324AlaVal: 4.324 ± 0.554
1.081AlaTrp: 1.081 ± 0.283
3.108AlaTyr: 3.108 ± 0.364
0.0AlaXaa: 0.0 ± 0.0
Cys
0.405CysAla: 0.405 ± 0.198
0.203CysCys: 0.203 ± 0.1
0.811CysAsp: 0.811 ± 0.264
0.608CysGlu: 0.608 ± 0.21
0.608CysPhe: 0.608 ± 0.194
0.676CysGly: 0.676 ± 0.204
0.405CysHis: 0.405 ± 0.224
0.473CysIle: 0.473 ± 0.195
1.081CysLys: 1.081 ± 0.308
0.743CysLeu: 0.743 ± 0.244
0.068CysMet: 0.068 ± 0.067
0.27CysAsn: 0.27 ± 0.175
0.27CysPro: 0.27 ± 0.135
0.338CysGln: 0.338 ± 0.137
0.27CysArg: 0.27 ± 0.128
0.473CysSer: 0.473 ± 0.174
0.878CysThr: 0.878 ± 0.256
0.27CysVal: 0.27 ± 0.136
0.135CysTrp: 0.135 ± 0.093
0.338CysTyr: 0.338 ± 0.144
0.0CysXaa: 0.0 ± 0.0
Asp
3.919AspAla: 3.919 ± 0.515
0.743AspCys: 0.743 ± 0.3
3.446AspAsp: 3.446 ± 0.493
4.257AspGlu: 4.257 ± 0.57
3.108AspPhe: 3.108 ± 0.514
4.122AspGly: 4.122 ± 0.523
1.419AspHis: 1.419 ± 0.322
5.0AspIle: 5.0 ± 0.557
4.932AspLys: 4.932 ± 0.658
4.932AspLeu: 4.932 ± 0.537
1.486AspMet: 1.486 ± 0.32
2.432AspAsn: 2.432 ± 0.456
3.176AspPro: 3.176 ± 0.42
2.162AspGln: 2.162 ± 0.364
3.581AspArg: 3.581 ± 0.475
3.041AspSer: 3.041 ± 0.345
2.973AspThr: 2.973 ± 0.498
4.054AspVal: 4.054 ± 0.431
1.014AspTrp: 1.014 ± 0.26
2.973AspTyr: 2.973 ± 0.391
0.0AspXaa: 0.0 ± 0.0
Glu
5.608GluAla: 5.608 ± 0.709
0.473GluCys: 0.473 ± 0.181
4.189GluAsp: 4.189 ± 0.515
9.054GluGlu: 9.054 ± 1.066
3.851GluPhe: 3.851 ± 0.63
4.932GluGly: 4.932 ± 0.567
1.824GluHis: 1.824 ± 0.391
4.324GluIle: 4.324 ± 0.516
5.541GluLys: 5.541 ± 0.681
8.176GluLeu: 8.176 ± 0.829
2.23GluMet: 2.23 ± 0.383
3.311GluAsn: 3.311 ± 0.489
1.689GluPro: 1.689 ± 0.439
3.851GluGln: 3.851 ± 0.534
3.514GluArg: 3.514 ± 0.534
4.122GluSer: 4.122 ± 0.579
3.784GluThr: 3.784 ± 0.509
4.932GluVal: 4.932 ± 0.583
0.743GluTrp: 0.743 ± 0.197
3.108GluTyr: 3.108 ± 0.519
0.0GluXaa: 0.0 ± 0.0
Phe
2.5PheAla: 2.5 ± 0.537
0.338PheCys: 0.338 ± 0.136
3.716PheAsp: 3.716 ± 0.497
3.716PheGlu: 3.716 ± 0.531
1.486PhePhe: 1.486 ± 0.373
3.041PheGly: 3.041 ± 0.409
0.946PheHis: 0.946 ± 0.287
3.176PheIle: 3.176 ± 0.484
3.176PheLys: 3.176 ± 0.501
3.716PheLeu: 3.716 ± 0.489
1.689PheMet: 1.689 ± 0.379
1.892PheAsn: 1.892 ± 0.331
1.419PhePro: 1.419 ± 0.316
1.284PheGln: 1.284 ± 0.287
1.622PheArg: 1.622 ± 0.324
2.365PheSer: 2.365 ± 0.385
3.919PheThr: 3.919 ± 0.59
2.77PheVal: 2.77 ± 0.357
0.27PheTrp: 0.27 ± 0.13
1.959PheTyr: 1.959 ± 0.409
0.0PheXaa: 0.0 ± 0.0
Gly
5.068GlyAla: 5.068 ± 0.641
0.608GlyCys: 0.608 ± 0.192
5.0GlyAsp: 5.0 ± 0.622
4.797GlyGlu: 4.797 ± 0.515
3.446GlyPhe: 3.446 ± 0.43
4.662GlyGly: 4.662 ± 0.677
1.014GlyHis: 1.014 ± 0.308
4.595GlyIle: 4.595 ± 0.502
5.27GlyLys: 5.27 ± 0.654
5.068GlyLeu: 5.068 ± 0.581
2.027GlyMet: 2.027 ± 0.463
3.649GlyAsn: 3.649 ± 1.002
0.135GlyPro: 0.135 ± 0.115
2.432GlyGln: 2.432 ± 0.453
3.108GlyArg: 3.108 ± 0.448
4.797GlySer: 4.797 ± 0.594
3.311GlyThr: 3.311 ± 0.538
4.932GlyVal: 4.932 ± 0.795
0.946GlyTrp: 0.946 ± 0.224
2.365GlyTyr: 2.365 ± 0.428
0.0GlyXaa: 0.0 ± 0.0
His
1.014HisAla: 1.014 ± 0.252
0.203HisCys: 0.203 ± 0.174
0.743HisAsp: 0.743 ± 0.259
1.284HisGlu: 1.284 ± 0.326
1.419HisPhe: 1.419 ± 0.312
1.892HisGly: 1.892 ± 0.452
0.405HisHis: 0.405 ± 0.141
1.892HisIle: 1.892 ± 0.413
1.419HisLys: 1.419 ± 0.27
1.622HisLeu: 1.622 ± 0.427
0.743HisMet: 0.743 ± 0.172
0.946HisAsn: 0.946 ± 0.215
1.014HisPro: 1.014 ± 0.228
0.878HisGln: 0.878 ± 0.298
0.811HisArg: 0.811 ± 0.231
1.824HisSer: 1.824 ± 0.414
1.014HisThr: 1.014 ± 0.321
0.811HisVal: 0.811 ± 0.269
0.338HisTrp: 0.338 ± 0.152
1.081HisTyr: 1.081 ± 0.334
0.0HisXaa: 0.0 ± 0.0
Ile
4.797IleAla: 4.797 ± 0.576
0.676IleCys: 0.676 ± 0.256
5.068IleAsp: 5.068 ± 0.595
5.135IleGlu: 5.135 ± 0.546
2.297IlePhe: 2.297 ± 0.369
4.459IleGly: 4.459 ± 0.553
1.959IleHis: 1.959 ± 0.277
4.324IleIle: 4.324 ± 0.587
6.284IleLys: 6.284 ± 0.712
4.797IleLeu: 4.797 ± 0.552
2.297IleMet: 2.297 ± 0.456
3.378IleAsn: 3.378 ± 0.513
2.568IlePro: 2.568 ± 0.405
2.568IleGln: 2.568 ± 0.359
3.176IleArg: 3.176 ± 0.421
3.716IleSer: 3.716 ± 0.502
4.122IleThr: 4.122 ± 0.482
4.054IleVal: 4.054 ± 0.632
0.405IleTrp: 0.405 ± 0.125
2.23IleTyr: 2.23 ± 0.444
0.0IleXaa: 0.0 ± 0.0
Lys
4.527LysAla: 4.527 ± 0.633
0.608LysCys: 0.608 ± 0.25
5.27LysAsp: 5.27 ± 0.667
7.095LysGlu: 7.095 ± 0.82
3.514LysPhe: 3.514 ± 0.563
4.595LysGly: 4.595 ± 0.794
2.635LysHis: 2.635 ± 0.47
3.176LysIle: 3.176 ± 0.454
7.027LysLys: 7.027 ± 1.003
5.676LysLeu: 5.676 ± 0.651
2.365LysMet: 2.365 ± 0.459
3.581LysAsn: 3.581 ± 0.38
3.041LysPro: 3.041 ± 0.392
3.851LysGln: 3.851 ± 0.569
4.595LysArg: 4.595 ± 0.73
4.189LysSer: 4.189 ± 0.58
4.459LysThr: 4.459 ± 0.572
5.541LysVal: 5.541 ± 0.702
0.541LysTrp: 0.541 ± 0.189
3.041LysTyr: 3.041 ± 0.466
0.0LysXaa: 0.0 ± 0.0
Leu
4.797LeuAla: 4.797 ± 0.693
0.743LeuCys: 0.743 ± 0.231
4.73LeuAsp: 4.73 ± 0.596
6.689LeuGlu: 6.689 ± 0.857
2.5LeuPhe: 2.5 ± 0.358
5.878LeuGly: 5.878 ± 0.655
1.486LeuHis: 1.486 ± 0.25
5.068LeuIle: 5.068 ± 0.669
7.297LeuLys: 7.297 ± 0.823
5.608LeuLeu: 5.608 ± 0.689
1.892LeuMet: 1.892 ± 0.334
4.054LeuAsn: 4.054 ± 0.532
2.838LeuPro: 2.838 ± 0.502
2.365LeuGln: 2.365 ± 0.446
3.378LeuArg: 3.378 ± 0.385
5.338LeuSer: 5.338 ± 0.614
5.473LeuThr: 5.473 ± 0.583
4.932LeuVal: 4.932 ± 0.551
0.608LeuTrp: 0.608 ± 0.178
2.838LeuTyr: 2.838 ± 0.535
0.0LeuXaa: 0.0 ± 0.0
Met
2.23MetAla: 2.23 ± 0.336
0.203MetCys: 0.203 ± 0.127
1.824MetAsp: 1.824 ± 0.307
1.351MetGlu: 1.351 ± 0.317
1.351MetPhe: 1.351 ± 0.344
1.554MetGly: 1.554 ± 0.418
0.473MetHis: 0.473 ± 0.16
1.959MetIle: 1.959 ± 0.36
2.635MetLys: 2.635 ± 0.37
1.892MetLeu: 1.892 ± 0.347
0.743MetMet: 0.743 ± 0.262
1.284MetAsn: 1.284 ± 0.278
0.743MetPro: 0.743 ± 0.223
0.811MetGln: 0.811 ± 0.254
0.946MetArg: 0.946 ± 0.27
3.108MetSer: 3.108 ± 0.448
2.095MetThr: 2.095 ± 0.357
1.216MetVal: 1.216 ± 0.26
0.203MetTrp: 0.203 ± 0.11
0.946MetTyr: 0.946 ± 0.3
0.0MetXaa: 0.0 ± 0.0
Asn
2.568AsnAla: 2.568 ± 0.504
0.676AsnCys: 0.676 ± 0.27
2.5AsnAsp: 2.5 ± 0.374
3.851AsnGlu: 3.851 ± 0.463
2.162AsnPhe: 2.162 ± 0.311
3.986AsnGly: 3.986 ± 0.537
1.014AsnHis: 1.014 ± 0.245
3.378AsnIle: 3.378 ± 0.531
4.054AsnLys: 4.054 ± 0.451
3.581AsnLeu: 3.581 ± 0.486
1.757AsnMet: 1.757 ± 0.322
2.905AsnAsn: 2.905 ± 0.491
1.892AsnPro: 1.892 ± 0.314
1.622AsnGln: 1.622 ± 0.356
2.027AsnArg: 2.027 ± 0.384
2.77AsnSer: 2.77 ± 0.424
3.108AsnThr: 3.108 ± 0.718
3.649AsnVal: 3.649 ± 0.714
0.541AsnTrp: 0.541 ± 0.173
1.622AsnTyr: 1.622 ± 0.298
0.0AsnXaa: 0.0 ± 0.0
Pro
2.432ProAla: 2.432 ± 0.412
0.203ProCys: 0.203 ± 0.12
2.635ProAsp: 2.635 ± 0.501
3.108ProGlu: 3.108 ± 0.453
1.824ProPhe: 1.824 ± 0.371
0.0ProGly: 0.0 ± 0.0
0.878ProHis: 0.878 ± 0.286
2.432ProIle: 2.432 ± 0.475
2.297ProLys: 2.297 ± 0.429
3.446ProLeu: 3.446 ± 0.666
1.149ProMet: 1.149 ± 0.278
1.419ProAsn: 1.419 ± 0.272
1.014ProPro: 1.014 ± 0.334
0.946ProGln: 0.946 ± 0.239
1.486ProArg: 1.486 ± 0.317
2.297ProSer: 2.297 ± 0.532
1.622ProThr: 1.622 ± 0.296
1.824ProVal: 1.824 ± 0.332
0.203ProTrp: 0.203 ± 0.117
0.676ProTyr: 0.676 ± 0.226
0.0ProXaa: 0.0 ± 0.0
Gln
2.5GlnAla: 2.5 ± 0.426
0.338GlnCys: 0.338 ± 0.17
2.365GlnAsp: 2.365 ± 0.432
3.716GlnGlu: 3.716 ± 0.401
1.757GlnPhe: 1.757 ± 0.323
2.095GlnGly: 2.095 ± 0.404
0.473GlnHis: 0.473 ± 0.151
2.568GlnIle: 2.568 ± 0.464
2.973GlnLys: 2.973 ± 0.473
3.311GlnLeu: 3.311 ± 0.359
0.878GlnMet: 0.878 ± 0.222
1.216GlnAsn: 1.216 ± 0.395
1.284GlnPro: 1.284 ± 0.268
0.946GlnGln: 0.946 ± 0.319
1.757GlnArg: 1.757 ± 0.364
2.297GlnSer: 2.297 ± 0.449
1.959GlnThr: 1.959 ± 0.364
2.568GlnVal: 2.568 ± 0.462
0.608GlnTrp: 0.608 ± 0.222
1.284GlnTyr: 1.284 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
3.378ArgAla: 3.378 ± 0.562
0.27ArgCys: 0.27 ± 0.143
2.77ArgAsp: 2.77 ± 0.363
3.716ArgGlu: 3.716 ± 0.557
1.959ArgPhe: 1.959 ± 0.362
2.838ArgGly: 2.838 ± 0.44
0.878ArgHis: 0.878 ± 0.244
3.446ArgIle: 3.446 ± 0.464
3.311ArgLys: 3.311 ± 0.483
3.378ArgLeu: 3.378 ± 0.553
0.743ArgMet: 0.743 ± 0.229
2.162ArgAsn: 2.162 ± 0.419
1.149ArgPro: 1.149 ± 0.33
2.027ArgGln: 2.027 ± 0.325
2.905ArgArg: 2.905 ± 0.501
2.5ArgSer: 2.5 ± 0.387
2.365ArgThr: 2.365 ± 0.57
3.041ArgVal: 3.041 ± 0.473
0.608ArgTrp: 0.608 ± 0.179
1.757ArgTyr: 1.757 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
3.514SerAla: 3.514 ± 0.615
0.676SerCys: 0.676 ± 0.271
3.243SerAsp: 3.243 ± 0.641
4.527SerGlu: 4.527 ± 0.527
2.905SerPhe: 2.905 ± 0.463
5.203SerGly: 5.203 ± 0.745
1.486SerHis: 1.486 ± 0.329
5.068SerIle: 5.068 ± 0.655
5.338SerLys: 5.338 ± 0.668
4.73SerLeu: 4.73 ± 0.646
1.351SerMet: 1.351 ± 0.278
4.054SerAsn: 4.054 ± 0.647
1.689SerPro: 1.689 ± 0.394
2.703SerGln: 2.703 ± 0.379
2.162SerArg: 2.162 ± 0.283
4.595SerSer: 4.595 ± 0.744
3.176SerThr: 3.176 ± 0.672
3.581SerVal: 3.581 ± 0.473
1.014SerTrp: 1.014 ± 0.289
1.622SerTyr: 1.622 ± 0.347
0.0SerXaa: 0.0 ± 0.0
Thr
4.122ThrAla: 4.122 ± 0.685
0.338ThrCys: 0.338 ± 0.145
3.176ThrAsp: 3.176 ± 0.519
4.797ThrGlu: 4.797 ± 0.601
3.041ThrPhe: 3.041 ± 0.405
4.797ThrGly: 4.797 ± 0.716
0.946ThrHis: 0.946 ± 0.253
3.986ThrIle: 3.986 ± 0.494
3.378ThrLys: 3.378 ± 0.526
4.189ThrLeu: 4.189 ± 0.645
0.946ThrMet: 0.946 ± 0.295
3.243ThrAsn: 3.243 ± 0.577
2.838ThrPro: 2.838 ± 0.577
1.689ThrGln: 1.689 ± 0.383
2.23ThrArg: 2.23 ± 0.386
4.054ThrSer: 4.054 ± 0.938
2.973ThrThr: 2.973 ± 0.467
4.122ThrVal: 4.122 ± 0.659
0.878ThrTrp: 0.878 ± 0.299
2.838ThrTyr: 2.838 ± 0.495
0.0ThrXaa: 0.0 ± 0.0
Val
4.527ValAla: 4.527 ± 0.541
0.878ValCys: 0.878 ± 0.241
4.054ValAsp: 4.054 ± 0.468
3.784ValGlu: 3.784 ± 0.543
2.703ValPhe: 2.703 ± 0.457
3.716ValGly: 3.716 ± 0.517
1.216ValHis: 1.216 ± 0.293
5.405ValIle: 5.405 ± 0.704
5.068ValLys: 5.068 ± 0.565
4.662ValLeu: 4.662 ± 0.659
1.757ValMet: 1.757 ± 0.281
3.311ValAsn: 3.311 ± 0.387
1.892ValPro: 1.892 ± 0.276
2.703ValGln: 2.703 ± 0.377
2.5ValArg: 2.5 ± 0.363
3.581ValSer: 3.581 ± 0.545
4.392ValThr: 4.392 ± 0.668
4.189ValVal: 4.189 ± 0.487
0.473ValTrp: 0.473 ± 0.197
3.176ValTyr: 3.176 ± 0.364
0.0ValXaa: 0.0 ± 0.0
Trp
0.405TrpAla: 0.405 ± 0.133
0.27TrpCys: 0.27 ± 0.123
0.743TrpAsp: 0.743 ± 0.243
0.676TrpGlu: 0.676 ± 0.239
0.405TrpPhe: 0.405 ± 0.178
0.676TrpGly: 0.676 ± 0.22
0.338TrpHis: 0.338 ± 0.15
0.676TrpIle: 0.676 ± 0.169
0.878TrpLys: 0.878 ± 0.244
0.946TrpLeu: 0.946 ± 0.242
0.405TrpMet: 0.405 ± 0.172
0.878TrpAsn: 0.878 ± 0.277
0.0TrpPro: 0.0 ± 0.0
0.203TrpGln: 0.203 ± 0.112
0.473TrpArg: 0.473 ± 0.232
1.081TrpSer: 1.081 ± 0.341
0.608TrpThr: 0.608 ± 0.235
0.743TrpVal: 0.743 ± 0.257
0.203TrpTrp: 0.203 ± 0.122
0.541TrpTyr: 0.541 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.095TyrAla: 2.095 ± 0.371
0.203TyrCys: 0.203 ± 0.127
3.041TyrAsp: 3.041 ± 0.393
3.176TyrGlu: 3.176 ± 0.432
1.419TyrPhe: 1.419 ± 0.361
3.649TyrGly: 3.649 ± 0.442
0.676TyrHis: 0.676 ± 0.218
2.635TyrIle: 2.635 ± 0.465
2.703TyrLys: 2.703 ± 0.592
2.838TyrLeu: 2.838 ± 0.441
0.743TyrMet: 0.743 ± 0.262
2.365TyrAsn: 2.365 ± 0.538
0.676TyrPro: 0.676 ± 0.255
1.216TyrGln: 1.216 ± 0.248
1.554TyrArg: 1.554 ± 0.483
2.838TyrSer: 2.838 ± 0.379
2.905TyrThr: 2.905 ± 0.435
2.432TyrVal: 2.432 ± 0.443
0.338TyrTrp: 0.338 ± 0.143
1.284TyrTyr: 1.284 ± 0.267
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (14801 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski