Amino acid dipepetide frequency for Pyrobaculum spherical virus (isolate United States/Yellowstone) (PSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.408AlaAla: 6.408 ± 0.88
0.356AlaCys: 0.356 ± 0.19
4.035AlaAsp: 4.035 ± 0.718
3.679AlaGlu: 3.679 ± 0.839
3.679AlaPhe: 3.679 ± 0.801
5.933AlaGly: 5.933 ± 0.928
1.78AlaHis: 1.78 ± 0.426
7.832AlaIle: 7.832 ± 0.901
3.916AlaLys: 3.916 ± 0.765
11.867AlaLeu: 11.867 ± 1.278
3.085AlaMet: 3.085 ± 0.539
1.305AlaAsn: 1.305 ± 0.395
2.967AlaPro: 2.967 ± 0.483
2.136AlaGln: 2.136 ± 0.523
4.509AlaArg: 4.509 ± 0.949
4.272AlaSer: 4.272 ± 0.731
4.628AlaThr: 4.628 ± 0.845
10.087AlaVal: 10.087 ± 1.166
1.543AlaTrp: 1.543 ± 0.521
3.916AlaTyr: 3.916 ± 0.617
0.0AlaXaa: 0.0 ± 0.0
Cys
0.831CysAla: 0.831 ± 0.334
0.0CysCys: 0.0 ± 0.0
0.237CysAsp: 0.237 ± 0.145
0.831CysGlu: 0.831 ± 0.285
0.0CysPhe: 0.0 ± 0.0
0.949CysGly: 0.949 ± 0.34
0.0CysHis: 0.0 ± 0.0
0.475CysIle: 0.475 ± 0.207
0.712CysLys: 0.712 ± 0.26
0.831CysLeu: 0.831 ± 0.357
0.237CysMet: 0.237 ± 0.164
0.712CysAsn: 0.712 ± 0.314
0.475CysPro: 0.475 ± 0.247
0.237CysGln: 0.237 ± 0.185
0.949CysArg: 0.949 ± 0.346
0.237CysSer: 0.237 ± 0.16
0.475CysThr: 0.475 ± 0.226
1.424CysVal: 1.424 ± 0.42
0.475CysTrp: 0.475 ± 0.227
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.611AspAla: 2.611 ± 0.504
0.712AspCys: 0.712 ± 0.247
1.899AspAsp: 1.899 ± 0.658
1.543AspGlu: 1.543 ± 0.355
2.017AspPhe: 2.017 ± 0.478
3.323AspGly: 3.323 ± 0.633
0.475AspHis: 0.475 ± 0.274
3.085AspIle: 3.085 ± 0.606
2.611AspLys: 2.611 ± 0.521
3.204AspLeu: 3.204 ± 0.539
1.78AspMet: 1.78 ± 0.361
1.305AspAsn: 1.305 ± 0.426
2.255AspPro: 2.255 ± 0.378
0.831AspGln: 0.831 ± 0.328
2.729AspArg: 2.729 ± 0.583
1.187AspSer: 1.187 ± 0.367
1.899AspThr: 1.899 ± 0.442
4.984AspVal: 4.984 ± 0.794
0.712AspTrp: 0.712 ± 0.363
2.373AspTyr: 2.373 ± 0.434
0.0AspXaa: 0.0 ± 0.0
Glu
5.696GluAla: 5.696 ± 0.999
0.831GluCys: 0.831 ± 0.361
1.78GluAsp: 1.78 ± 0.47
4.865GluGlu: 4.865 ± 1.084
2.255GluPhe: 2.255 ± 0.461
3.56GluGly: 3.56 ± 0.838
0.356GluHis: 0.356 ± 0.168
3.323GluIle: 3.323 ± 0.673
3.56GluLys: 3.56 ± 0.677
3.916GluLeu: 3.916 ± 0.649
0.712GluMet: 0.712 ± 0.278
1.543GluAsn: 1.543 ± 0.532
2.373GluPro: 2.373 ± 0.492
0.237GluGln: 0.237 ± 0.171
2.373GluArg: 2.373 ± 0.703
2.255GluSer: 2.255 ± 0.482
2.848GluThr: 2.848 ± 0.754
4.628GluVal: 4.628 ± 0.818
0.356GluTrp: 0.356 ± 0.205
2.729GluTyr: 2.729 ± 0.62
0.0GluXaa: 0.0 ± 0.0
Phe
3.085PheAla: 3.085 ± 0.679
0.475PheCys: 0.475 ± 0.225
1.305PheAsp: 1.305 ± 0.4
1.899PheGlu: 1.899 ± 0.466
2.017PhePhe: 2.017 ± 0.558
4.153PheGly: 4.153 ± 0.627
0.712PheHis: 0.712 ± 0.272
3.085PheIle: 3.085 ± 0.6
2.729PheLys: 2.729 ± 0.627
3.323PheLeu: 3.323 ± 0.685
1.305PheMet: 1.305 ± 0.408
1.78PheAsn: 1.78 ± 0.399
0.949PhePro: 0.949 ± 0.298
0.949PheGln: 0.949 ± 0.281
1.068PheArg: 1.068 ± 0.308
2.611PheSer: 2.611 ± 0.647
3.085PheThr: 3.085 ± 0.552
4.272PheVal: 4.272 ± 0.807
0.475PheTrp: 0.475 ± 0.301
1.899PheTyr: 1.899 ± 0.49
0.0PheXaa: 0.0 ± 0.0
Gly
6.408GlyAla: 6.408 ± 1.078
0.949GlyCys: 0.949 ± 0.293
2.848GlyAsp: 2.848 ± 0.518
2.967GlyGlu: 2.967 ± 0.639
3.679GlyPhe: 3.679 ± 0.691
6.645GlyGly: 6.645 ± 1.153
0.949GlyHis: 0.949 ± 0.392
4.272GlyIle: 4.272 ± 0.826
3.204GlyLys: 3.204 ± 0.487
8.663GlyLeu: 8.663 ± 1.441
1.543GlyMet: 1.543 ± 0.418
2.611GlyAsn: 2.611 ± 0.524
2.255GlyPro: 2.255 ± 0.531
1.187GlyGln: 1.187 ± 0.374
3.323GlyArg: 3.323 ± 0.686
3.441GlySer: 3.441 ± 0.768
3.204GlyThr: 3.204 ± 0.703
7.357GlyVal: 7.357 ± 1.001
1.78GlyTrp: 1.78 ± 0.513
4.628GlyTyr: 4.628 ± 0.852
0.119GlyXaa: 0.119 ± 0.14
His
1.543HisAla: 1.543 ± 0.427
0.0HisCys: 0.0 ± 0.0
1.424HisAsp: 1.424 ± 0.366
0.119HisGlu: 0.119 ± 0.107
0.712HisPhe: 0.712 ± 0.314
1.068HisGly: 1.068 ± 0.326
0.119HisHis: 0.119 ± 0.12
1.899HisIle: 1.899 ± 0.505
0.712HisLys: 0.712 ± 0.31
1.068HisLeu: 1.068 ± 0.315
0.356HisMet: 0.356 ± 0.225
0.712HisAsn: 0.712 ± 0.331
0.949HisPro: 0.949 ± 0.304
0.237HisGln: 0.237 ± 0.151
0.475HisArg: 0.475 ± 0.205
0.593HisSer: 0.593 ± 0.288
0.949HisThr: 0.949 ± 0.348
2.492HisVal: 2.492 ± 0.574
0.119HisTrp: 0.119 ± 0.12
1.068HisTyr: 1.068 ± 0.333
0.0HisXaa: 0.0 ± 0.0
Ile
5.933IleAla: 5.933 ± 1.009
0.712IleCys: 0.712 ± 0.249
3.323IleAsp: 3.323 ± 0.641
3.204IleGlu: 3.204 ± 0.62
3.679IlePhe: 3.679 ± 0.69
4.984IleGly: 4.984 ± 0.869
0.712IleHis: 0.712 ± 0.333
4.984IleIle: 4.984 ± 0.733
2.967IleLys: 2.967 ± 0.538
6.527IleLeu: 6.527 ± 0.943
2.255IleMet: 2.255 ± 0.474
2.255IleAsn: 2.255 ± 0.509
3.56IlePro: 3.56 ± 0.631
1.78IleGln: 1.78 ± 0.546
3.441IleArg: 3.441 ± 0.704
3.204IleSer: 3.204 ± 0.446
4.035IleThr: 4.035 ± 0.757
5.221IleVal: 5.221 ± 0.806
0.712IleTrp: 0.712 ± 0.261
3.916IleTyr: 3.916 ± 0.781
0.0IleXaa: 0.0 ± 0.0
Lys
5.221LysAla: 5.221 ± 0.894
0.949LysCys: 0.949 ± 0.459
2.611LysAsp: 2.611 ± 0.523
3.323LysGlu: 3.323 ± 0.617
2.255LysPhe: 2.255 ± 0.413
2.729LysGly: 2.729 ± 0.574
1.068LysHis: 1.068 ± 0.383
2.373LysIle: 2.373 ± 0.509
2.848LysLys: 2.848 ± 0.678
3.56LysLeu: 3.56 ± 0.741
2.492LysMet: 2.492 ± 0.699
1.424LysAsn: 1.424 ± 0.381
2.373LysPro: 2.373 ± 0.523
0.712LysGln: 0.712 ± 0.272
2.492LysArg: 2.492 ± 0.543
2.729LysSer: 2.729 ± 0.627
3.323LysThr: 3.323 ± 0.53
5.815LysVal: 5.815 ± 0.858
1.068LysTrp: 1.068 ± 0.35
2.967LysTyr: 2.967 ± 0.649
0.0LysXaa: 0.0 ± 0.0
Leu
10.68LeuAla: 10.68 ± 1.324
0.831LeuCys: 0.831 ± 0.326
3.56LeuAsp: 3.56 ± 0.62
4.509LeuGlu: 4.509 ± 0.781
4.391LeuPhe: 4.391 ± 0.928
8.781LeuGly: 8.781 ± 1.271
0.831LeuHis: 0.831 ± 0.28
7.12LeuIle: 7.12 ± 0.845
6.764LeuLys: 6.764 ± 0.96
9.493LeuLeu: 9.493 ± 1.436
2.729LeuMet: 2.729 ± 0.549
2.729LeuAsn: 2.729 ± 0.461
3.323LeuPro: 3.323 ± 0.685
2.373LeuGln: 2.373 ± 0.564
5.459LeuArg: 5.459 ± 0.831
7.595LeuSer: 7.595 ± 0.958
3.916LeuThr: 3.916 ± 0.613
9.019LeuVal: 9.019 ± 1.19
1.187LeuTrp: 1.187 ± 0.38
6.408LeuTyr: 6.408 ± 0.762
0.0LeuXaa: 0.0 ± 0.0
Met
4.153MetAla: 4.153 ± 0.615
0.356MetCys: 0.356 ± 0.19
0.593MetAsp: 0.593 ± 0.237
0.949MetGlu: 0.949 ± 0.371
1.543MetPhe: 1.543 ± 0.383
1.899MetGly: 1.899 ± 0.47
0.831MetHis: 0.831 ± 0.342
1.187MetIle: 1.187 ± 0.425
0.712MetLys: 0.712 ± 0.266
3.56MetLeu: 3.56 ± 0.8
0.475MetMet: 0.475 ± 0.254
0.712MetAsn: 0.712 ± 0.283
2.136MetPro: 2.136 ± 0.431
0.237MetGln: 0.237 ± 0.156
2.492MetArg: 2.492 ± 0.473
1.899MetSer: 1.899 ± 0.434
0.949MetThr: 0.949 ± 0.298
3.679MetVal: 3.679 ± 0.582
0.237MetTrp: 0.237 ± 0.163
0.593MetTyr: 0.593 ± 0.235
0.0MetXaa: 0.0 ± 0.0
Asn
3.323AsnAla: 3.323 ± 0.498
0.119AsnCys: 0.119 ± 0.121
1.424AsnAsp: 1.424 ± 0.373
1.424AsnGlu: 1.424 ± 0.403
1.424AsnPhe: 1.424 ± 0.412
3.323AsnGly: 3.323 ± 0.654
0.593AsnHis: 0.593 ± 0.228
2.373AsnIle: 2.373 ± 0.604
2.492AsnLys: 2.492 ± 0.689
2.611AsnLeu: 2.611 ± 0.461
1.068AsnMet: 1.068 ± 0.252
1.187AsnAsn: 1.187 ± 0.344
2.255AsnPro: 2.255 ± 0.56
0.475AsnGln: 0.475 ± 0.217
1.661AsnArg: 1.661 ± 0.35
1.305AsnSer: 1.305 ± 0.414
1.78AsnThr: 1.78 ± 0.448
3.441AsnVal: 3.441 ± 0.673
0.475AsnTrp: 0.475 ± 0.236
1.543AsnTyr: 1.543 ± 0.473
0.0AsnXaa: 0.0 ± 0.0
Pro
2.017ProAla: 2.017 ± 0.484
0.475ProCys: 0.475 ± 0.263
1.424ProAsp: 1.424 ± 0.467
2.967ProGlu: 2.967 ± 0.7
1.899ProPhe: 1.899 ± 0.386
1.424ProGly: 1.424 ± 0.383
0.593ProHis: 0.593 ± 0.241
2.729ProIle: 2.729 ± 0.487
2.729ProLys: 2.729 ± 0.67
5.34ProLeu: 5.34 ± 0.887
1.661ProMet: 1.661 ± 0.341
1.661ProAsn: 1.661 ± 0.393
2.373ProPro: 2.373 ± 0.764
1.424ProGln: 1.424 ± 0.401
2.967ProArg: 2.967 ± 0.684
2.729ProSer: 2.729 ± 0.658
2.729ProThr: 2.729 ± 0.608
3.085ProVal: 3.085 ± 0.633
0.119ProTrp: 0.119 ± 0.107
1.899ProTyr: 1.899 ± 0.454
0.0ProXaa: 0.0 ± 0.0
Gln
2.017GlnAla: 2.017 ± 0.541
0.237GlnCys: 0.237 ± 0.157
0.712GlnAsp: 0.712 ± 0.297
1.543GlnGlu: 1.543 ± 0.362
0.593GlnPhe: 0.593 ± 0.247
1.187GlnGly: 1.187 ± 0.468
0.119GlnHis: 0.119 ± 0.101
2.492GlnIle: 2.492 ± 0.484
0.949GlnLys: 0.949 ± 0.352
2.255GlnLeu: 2.255 ± 0.662
0.356GlnMet: 0.356 ± 0.183
0.593GlnAsn: 0.593 ± 0.232
0.949GlnPro: 0.949 ± 0.318
1.068GlnGln: 1.068 ± 0.454
1.187GlnArg: 1.187 ± 0.359
0.712GlnSer: 0.712 ± 0.289
1.305GlnThr: 1.305 ± 0.338
2.255GlnVal: 2.255 ± 0.514
0.356GlnTrp: 0.356 ± 0.205
1.305GlnTyr: 1.305 ± 0.52
0.0GlnXaa: 0.0 ± 0.0
Arg
3.679ArgAla: 3.679 ± 0.626
0.356ArgCys: 0.356 ± 0.217
2.017ArgAsp: 2.017 ± 0.387
3.441ArgGlu: 3.441 ± 0.621
1.305ArgPhe: 1.305 ± 0.401
3.679ArgGly: 3.679 ± 0.666
2.136ArgHis: 2.136 ± 0.466
3.679ArgIle: 3.679 ± 0.598
2.492ArgLys: 2.492 ± 0.512
7.001ArgLeu: 7.001 ± 1.054
1.543ArgMet: 1.543 ± 0.546
1.899ArgAsn: 1.899 ± 0.538
1.068ArgPro: 1.068 ± 0.37
1.305ArgGln: 1.305 ± 0.311
4.984ArgArg: 4.984 ± 0.879
2.848ArgSer: 2.848 ± 0.499
2.492ArgThr: 2.492 ± 0.625
4.628ArgVal: 4.628 ± 0.804
0.831ArgTrp: 0.831 ± 0.345
1.899ArgTyr: 1.899 ± 0.458
0.0ArgXaa: 0.0 ± 0.0
Ser
3.797SerAla: 3.797 ± 0.684
0.356SerCys: 0.356 ± 0.184
2.611SerAsp: 2.611 ± 0.646
2.967SerGlu: 2.967 ± 0.676
1.661SerPhe: 1.661 ± 0.371
4.153SerGly: 4.153 ± 0.771
0.949SerHis: 0.949 ± 0.32
3.679SerIle: 3.679 ± 0.841
2.136SerLys: 2.136 ± 0.515
4.865SerLeu: 4.865 ± 0.719
2.255SerMet: 2.255 ± 0.477
1.899SerAsn: 1.899 ± 0.433
2.611SerPro: 2.611 ± 0.44
1.424SerGln: 1.424 ± 0.332
1.899SerArg: 1.899 ± 0.4
2.967SerSer: 2.967 ± 0.741
4.035SerThr: 4.035 ± 0.772
4.272SerVal: 4.272 ± 0.74
1.187SerTrp: 1.187 ± 0.345
2.848SerTyr: 2.848 ± 0.493
0.0SerXaa: 0.0 ± 0.0
Thr
5.221ThrAla: 5.221 ± 0.693
0.593ThrCys: 0.593 ± 0.242
1.78ThrAsp: 1.78 ± 0.433
2.017ThrGlu: 2.017 ± 0.46
1.187ThrPhe: 1.187 ± 0.36
2.967ThrGly: 2.967 ± 0.708
1.305ThrHis: 1.305 ± 0.391
4.035ThrIle: 4.035 ± 0.584
3.679ThrLys: 3.679 ± 0.637
5.577ThrLeu: 5.577 ± 0.906
1.543ThrMet: 1.543 ± 0.43
2.729ThrAsn: 2.729 ± 0.558
3.441ThrPro: 3.441 ± 0.596
2.136ThrGln: 2.136 ± 0.572
1.661ThrArg: 1.661 ± 0.481
3.679ThrSer: 3.679 ± 0.828
5.221ThrThr: 5.221 ± 1.073
4.984ThrVal: 4.984 ± 0.816
0.593ThrTrp: 0.593 ± 0.303
2.373ThrTyr: 2.373 ± 0.412
0.0ThrXaa: 0.0 ± 0.0
Val
9.019ValAla: 9.019 ± 0.963
0.949ValCys: 0.949 ± 0.378
5.34ValAsp: 5.34 ± 0.666
4.035ValGlu: 4.035 ± 0.62
3.797ValPhe: 3.797 ± 0.642
6.645ValGly: 6.645 ± 0.964
1.424ValHis: 1.424 ± 0.475
5.459ValIle: 5.459 ± 0.907
4.984ValLys: 4.984 ± 0.741
9.493ValLeu: 9.493 ± 1.151
2.017ValMet: 2.017 ± 0.465
4.509ValAsn: 4.509 ± 0.927
3.797ValPro: 3.797 ± 0.575
1.78ValGln: 1.78 ± 0.516
5.34ValArg: 5.34 ± 0.908
5.696ValSer: 5.696 ± 0.903
6.527ValThr: 6.527 ± 0.817
10.561ValVal: 10.561 ± 1.502
2.729ValTrp: 2.729 ± 0.58
4.984ValTyr: 4.984 ± 0.854
0.356ValXaa: 0.356 ± 0.22
Trp
2.136TrpAla: 2.136 ± 0.449
0.356TrpCys: 0.356 ± 0.208
0.475TrpAsp: 0.475 ± 0.225
0.712TrpGlu: 0.712 ± 0.362
0.831TrpPhe: 0.831 ± 0.296
1.661TrpGly: 1.661 ± 0.491
0.475TrpHis: 0.475 ± 0.333
0.356TrpIle: 0.356 ± 0.164
0.475TrpLys: 0.475 ± 0.239
2.017TrpLeu: 2.017 ± 0.499
0.593TrpMet: 0.593 ± 0.366
0.119TrpAsn: 0.119 ± 0.122
0.237TrpPro: 0.237 ± 0.172
0.356TrpGln: 0.356 ± 0.26
1.187TrpArg: 1.187 ± 0.379
0.831TrpSer: 0.831 ± 0.25
0.475TrpThr: 0.475 ± 0.261
1.543TrpVal: 1.543 ± 0.451
0.475TrpTrp: 0.475 ± 0.286
0.831TrpTyr: 0.831 ± 0.271
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.153TyrAla: 4.153 ± 0.812
0.475TyrCys: 0.475 ± 0.212
2.017TyrAsp: 2.017 ± 0.403
3.204TyrGlu: 3.204 ± 0.626
2.255TyrPhe: 2.255 ± 0.527
2.848TyrGly: 2.848 ± 0.726
1.187TyrHis: 1.187 ± 0.379
2.848TyrIle: 2.848 ± 0.641
1.661TyrLys: 1.661 ± 0.572
6.527TyrLeu: 6.527 ± 1.225
1.068TyrMet: 1.068 ± 0.284
2.611TyrAsn: 2.611 ± 0.6
1.899TyrPro: 1.899 ± 0.397
1.187TyrGln: 1.187 ± 0.532
3.085TyrArg: 3.085 ± 0.583
1.78TyrSer: 1.78 ± 0.344
2.848TyrThr: 2.848 ± 0.488
5.815TyrVal: 5.815 ± 0.836
0.712TyrTrp: 0.712 ± 0.272
4.272TyrTyr: 4.272 ± 0.711
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.119XaaAsp: 0.119 ± 0.126
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.119XaaGly: 0.119 ± 0.113
0.0XaaHis: 0.0 ± 0.0
0.119XaaIle: 0.119 ± 0.118
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.119XaaGln: 0.119 ± 0.14
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (8428 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski