Amino acid dipepetide frequency for Hyperthermophilic Archaeal Virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.217AlaAla: 4.217 ± 0.967
0.781AlaCys: 0.781 ± 0.361
4.061AlaAsp: 4.061 ± 1.024
5.779AlaGlu: 5.779 ± 1.219
3.593AlaPhe: 3.593 ± 0.607
3.28AlaGly: 3.28 ± 0.703
0.469AlaHis: 0.469 ± 0.263
5.623AlaIle: 5.623 ± 1.203
4.061AlaLys: 4.061 ± 0.871
7.185AlaLeu: 7.185 ± 0.857
1.25AlaMet: 1.25 ± 0.37
2.187AlaAsn: 2.187 ± 0.57
2.343AlaPro: 2.343 ± 0.689
1.093AlaGln: 1.093 ± 0.466
2.968AlaArg: 2.968 ± 0.581
6.092AlaSer: 6.092 ± 1.385
2.968AlaThr: 2.968 ± 0.969
10.465AlaVal: 10.465 ± 1.62
1.718AlaTrp: 1.718 ± 0.572
4.998AlaTyr: 4.998 ± 0.937
0.0AlaXaa: 0.0 ± 0.0
Cys
1.25CysAla: 1.25 ± 0.443
0.312CysCys: 0.312 ± 0.203
0.469CysAsp: 0.469 ± 0.251
1.25CysGlu: 1.25 ± 0.372
0.781CysPhe: 0.781 ± 0.348
1.25CysGly: 1.25 ± 0.413
0.0CysHis: 0.0 ± 0.0
2.031CysIle: 2.031 ± 0.585
0.312CysLys: 0.312 ± 0.197
1.874CysLeu: 1.874 ± 0.599
0.156CysMet: 0.156 ± 0.157
0.625CysAsn: 0.625 ± 0.292
0.625CysPro: 0.625 ± 0.295
0.781CysGln: 0.781 ± 0.352
1.562CysArg: 1.562 ± 0.519
0.625CysSer: 0.625 ± 0.385
1.718CysThr: 1.718 ± 0.49
1.874CysVal: 1.874 ± 0.549
0.0CysTrp: 0.0 ± 0.0
0.937CysTyr: 0.937 ± 0.401
0.0CysXaa: 0.0 ± 0.0
Asp
2.031AspAla: 2.031 ± 0.581
0.469AspCys: 0.469 ± 0.236
1.718AspAsp: 1.718 ± 0.494
2.499AspGlu: 2.499 ± 0.719
2.187AspPhe: 2.187 ± 0.517
3.124AspGly: 3.124 ± 0.832
0.312AspHis: 0.312 ± 0.202
4.374AspIle: 4.374 ± 1.098
3.436AspLys: 3.436 ± 0.729
3.749AspLeu: 3.749 ± 0.91
1.25AspMet: 1.25 ± 0.443
1.406AspAsn: 1.406 ± 0.446
2.812AspPro: 2.812 ± 0.873
1.562AspGln: 1.562 ± 0.39
2.187AspArg: 2.187 ± 0.609
4.53AspSer: 4.53 ± 1.371
1.874AspThr: 1.874 ± 0.549
5.311AspVal: 5.311 ± 1.197
0.469AspTrp: 0.469 ± 0.256
5.311AspTyr: 5.311 ± 1.857
0.0AspXaa: 0.0 ± 0.0
Glu
3.124GluAla: 3.124 ± 0.625
1.718GluCys: 1.718 ± 0.636
2.812GluAsp: 2.812 ± 0.6
5.623GluGlu: 5.623 ± 1.598
2.499GluPhe: 2.499 ± 0.594
3.905GluGly: 3.905 ± 1.025
0.781GluHis: 0.781 ± 0.334
3.749GluIle: 3.749 ± 1.073
4.998GluLys: 4.998 ± 1.377
6.56GluLeu: 6.56 ± 1.002
1.093GluMet: 1.093 ± 0.397
2.031GluAsn: 2.031 ± 0.584
2.499GluPro: 2.499 ± 0.774
1.874GluGln: 1.874 ± 0.528
3.749GluArg: 3.749 ± 1.039
2.187GluSer: 2.187 ± 0.432
1.874GluThr: 1.874 ± 0.489
5.467GluVal: 5.467 ± 1.354
0.469GluTrp: 0.469 ± 0.232
3.905GluTyr: 3.905 ± 0.74
0.0GluXaa: 0.0 ± 0.0
Phe
3.593PheAla: 3.593 ± 0.776
0.625PheCys: 0.625 ± 0.316
2.031PheAsp: 2.031 ± 0.539
2.812PheGlu: 2.812 ± 0.645
3.28PhePhe: 3.28 ± 0.649
1.562PheGly: 1.562 ± 0.432
0.469PheHis: 0.469 ± 0.244
3.436PheIle: 3.436 ± 0.809
1.874PheLys: 1.874 ± 0.568
3.749PheLeu: 3.749 ± 0.65
0.937PheMet: 0.937 ± 0.342
1.406PheAsn: 1.406 ± 0.453
1.874PhePro: 1.874 ± 0.575
1.718PheGln: 1.718 ± 0.655
3.436PheArg: 3.436 ± 0.64
3.124PheSer: 3.124 ± 0.871
2.187PheThr: 2.187 ± 0.479
1.406PheVal: 1.406 ± 0.473
0.781PheTrp: 0.781 ± 0.353
1.562PheTyr: 1.562 ± 0.523
0.0PheXaa: 0.0 ± 0.0
Gly
2.812GlyAla: 2.812 ± 0.577
0.781GlyCys: 0.781 ± 0.324
2.655GlyAsp: 2.655 ± 0.711
2.968GlyGlu: 2.968 ± 0.981
2.968GlyPhe: 2.968 ± 0.759
3.749GlyGly: 3.749 ± 0.897
0.312GlyHis: 0.312 ± 0.213
5.155GlyIle: 5.155 ± 0.88
5.779GlyLys: 5.779 ± 1.267
3.436GlyLeu: 3.436 ± 0.825
1.874GlyMet: 1.874 ± 0.537
1.25GlyAsn: 1.25 ± 0.491
0.625GlyPro: 0.625 ± 0.306
1.25GlyGln: 1.25 ± 0.517
2.812GlyArg: 2.812 ± 0.714
3.749GlySer: 3.749 ± 0.649
4.061GlyThr: 4.061 ± 0.995
2.968GlyVal: 2.968 ± 0.597
0.781GlyTrp: 0.781 ± 0.418
2.812GlyTyr: 2.812 ± 0.535
0.0GlyXaa: 0.0 ± 0.0
His
1.562HisAla: 1.562 ± 0.451
0.156HisCys: 0.156 ± 0.171
0.469HisAsp: 0.469 ± 0.319
0.625HisGlu: 0.625 ± 0.34
0.625HisPhe: 0.625 ± 0.326
0.312HisGly: 0.312 ± 0.203
0.156HisHis: 0.156 ± 0.162
1.406HisIle: 1.406 ± 0.523
1.25HisLys: 1.25 ± 0.479
0.781HisLeu: 0.781 ± 0.365
0.156HisMet: 0.156 ± 0.157
0.625HisAsn: 0.625 ± 0.283
0.312HisPro: 0.312 ± 0.218
0.0HisGln: 0.0 ± 0.0
0.781HisArg: 0.781 ± 0.303
0.937HisSer: 0.937 ± 0.318
0.469HisThr: 0.469 ± 0.292
0.625HisVal: 0.625 ± 0.29
0.156HisTrp: 0.156 ± 0.143
0.625HisTyr: 0.625 ± 0.297
0.0HisXaa: 0.0 ± 0.0
Ile
7.029IleAla: 7.029 ± 1.066
0.937IleCys: 0.937 ± 0.314
3.593IleAsp: 3.593 ± 0.782
4.686IleGlu: 4.686 ± 0.986
4.061IlePhe: 4.061 ± 0.705
3.905IleGly: 3.905 ± 0.969
1.406IleHis: 1.406 ± 0.485
5.467IleIle: 5.467 ± 0.786
4.53IleLys: 4.53 ± 1.003
5.936IleLeu: 5.936 ± 1.051
2.187IleMet: 2.187 ± 0.578
2.031IleAsn: 2.031 ± 0.557
2.343IlePro: 2.343 ± 0.604
1.562IleGln: 1.562 ± 0.462
4.374IleArg: 4.374 ± 0.88
5.467IleSer: 5.467 ± 1.504
4.53IleThr: 4.53 ± 1.013
5.779IleVal: 5.779 ± 1.015
1.562IleTrp: 1.562 ± 0.453
3.436IleTyr: 3.436 ± 0.819
0.0IleXaa: 0.0 ± 0.0
Lys
6.56LysAla: 6.56 ± 0.951
0.937LysCys: 0.937 ± 0.325
3.436LysAsp: 3.436 ± 0.907
4.061LysGlu: 4.061 ± 0.877
1.25LysPhe: 1.25 ± 0.36
2.812LysGly: 2.812 ± 0.481
0.781LysHis: 0.781 ± 0.294
4.374LysIle: 4.374 ± 0.926
4.53LysLys: 4.53 ± 1.073
5.936LysLeu: 5.936 ± 0.884
1.25LysMet: 1.25 ± 0.361
2.031LysAsn: 2.031 ± 0.56
1.562LysPro: 1.562 ± 0.518
1.874LysGln: 1.874 ± 0.534
2.655LysArg: 2.655 ± 0.922
3.28LysSer: 3.28 ± 0.76
2.968LysThr: 2.968 ± 0.687
6.717LysVal: 6.717 ± 1.122
1.562LysTrp: 1.562 ± 0.492
4.53LysTyr: 4.53 ± 0.947
0.0LysXaa: 0.0 ± 0.0
Leu
6.717LeuAla: 6.717 ± 0.891
1.25LeuCys: 1.25 ± 0.537
4.842LeuAsp: 4.842 ± 1.0
5.467LeuGlu: 5.467 ± 0.988
4.686LeuPhe: 4.686 ± 0.874
5.467LeuGly: 5.467 ± 0.892
1.406LeuHis: 1.406 ± 0.449
4.842LeuIle: 4.842 ± 0.854
4.842LeuLys: 4.842 ± 1.005
6.248LeuLeu: 6.248 ± 1.25
2.812LeuMet: 2.812 ± 0.71
3.28LeuAsn: 3.28 ± 0.739
4.53LeuPro: 4.53 ± 0.715
0.937LeuGln: 0.937 ± 0.374
5.779LeuArg: 5.779 ± 0.641
5.311LeuSer: 5.311 ± 0.844
5.311LeuThr: 5.311 ± 0.861
4.217LeuVal: 4.217 ± 0.81
1.093LeuTrp: 1.093 ± 0.496
4.217LeuTyr: 4.217 ± 1.113
0.0LeuXaa: 0.0 ± 0.0
Met
1.25MetAla: 1.25 ± 0.482
0.781MetCys: 0.781 ± 0.293
0.781MetAsp: 0.781 ± 0.345
0.781MetGlu: 0.781 ± 0.258
1.093MetPhe: 1.093 ± 0.443
1.093MetGly: 1.093 ± 0.447
0.0MetHis: 0.0 ± 0.0
1.406MetIle: 1.406 ± 0.597
1.874MetLys: 1.874 ± 0.469
2.968MetLeu: 2.968 ± 0.685
0.937MetMet: 0.937 ± 0.362
1.25MetAsn: 1.25 ± 0.389
2.187MetPro: 2.187 ± 0.531
0.469MetGln: 0.469 ± 0.312
1.562MetArg: 1.562 ± 0.545
2.031MetSer: 2.031 ± 0.603
1.093MetThr: 1.093 ± 0.406
1.874MetVal: 1.874 ± 0.484
0.312MetTrp: 0.312 ± 0.203
1.25MetTyr: 1.25 ± 0.454
0.0MetXaa: 0.0 ± 0.0
Asn
3.436AsnAla: 3.436 ± 0.814
0.156AsnCys: 0.156 ± 0.149
1.406AsnAsp: 1.406 ± 0.456
1.874AsnGlu: 1.874 ± 0.598
0.625AsnPhe: 0.625 ± 0.296
2.187AsnGly: 2.187 ± 0.631
0.0AsnHis: 0.0 ± 0.0
3.28AsnIle: 3.28 ± 0.684
1.25AsnLys: 1.25 ± 0.481
1.874AsnLeu: 1.874 ± 0.457
0.937AsnMet: 0.937 ± 0.422
0.781AsnAsn: 0.781 ± 0.35
2.343AsnPro: 2.343 ± 0.503
1.093AsnGln: 1.093 ± 0.488
2.187AsnArg: 2.187 ± 0.693
1.874AsnSer: 1.874 ± 0.48
3.124AsnThr: 3.124 ± 0.763
2.812AsnVal: 2.812 ± 0.645
0.937AsnTrp: 0.937 ± 0.346
1.562AsnTyr: 1.562 ± 0.534
0.0AsnXaa: 0.0 ± 0.0
Pro
2.343ProAla: 2.343 ± 0.647
0.781ProCys: 0.781 ± 0.305
3.593ProAsp: 3.593 ± 0.742
3.593ProGlu: 3.593 ± 0.763
2.187ProPhe: 2.187 ± 0.54
2.031ProGly: 2.031 ± 0.543
0.469ProHis: 0.469 ± 0.346
4.53ProIle: 4.53 ± 0.768
1.718ProLys: 1.718 ± 0.453
3.593ProLeu: 3.593 ± 0.775
1.562ProMet: 1.562 ± 0.576
1.406ProAsn: 1.406 ± 0.417
3.436ProPro: 3.436 ± 1.354
0.937ProGln: 0.937 ± 0.482
1.562ProArg: 1.562 ± 0.638
2.812ProSer: 2.812 ± 0.542
1.718ProThr: 1.718 ± 0.58
4.374ProVal: 4.374 ± 0.882
0.625ProTrp: 0.625 ± 0.298
2.031ProTyr: 2.031 ± 0.529
0.0ProXaa: 0.0 ± 0.0
Gln
1.874GlnAla: 1.874 ± 0.635
1.25GlnCys: 1.25 ± 0.382
0.937GlnAsp: 0.937 ± 0.413
0.625GlnGlu: 0.625 ± 0.365
1.25GlnPhe: 1.25 ± 0.395
1.25GlnGly: 1.25 ± 0.485
0.469GlnHis: 0.469 ± 0.265
1.406GlnIle: 1.406 ± 0.485
2.187GlnLys: 2.187 ± 0.605
2.031GlnLeu: 2.031 ± 0.523
0.937GlnMet: 0.937 ± 0.356
0.937GlnAsn: 0.937 ± 0.3
1.406GlnPro: 1.406 ± 0.509
1.093GlnGln: 1.093 ± 0.436
1.093GlnArg: 1.093 ± 0.496
0.312GlnSer: 0.312 ± 0.197
1.25GlnThr: 1.25 ± 0.346
1.25GlnVal: 1.25 ± 0.411
0.625GlnTrp: 0.625 ± 0.288
1.718GlnTyr: 1.718 ± 0.571
0.0GlnXaa: 0.0 ± 0.0
Arg
3.124ArgAla: 3.124 ± 0.606
1.562ArgCys: 1.562 ± 0.638
1.718ArgAsp: 1.718 ± 0.503
3.436ArgGlu: 3.436 ± 0.696
2.031ArgPhe: 2.031 ± 0.665
2.968ArgGly: 2.968 ± 0.764
1.25ArgHis: 1.25 ± 0.408
4.53ArgIle: 4.53 ± 0.849
4.061ArgLys: 4.061 ± 1.0
4.998ArgLeu: 4.998 ± 1.017
1.406ArgMet: 1.406 ± 0.404
3.436ArgAsn: 3.436 ± 0.811
2.499ArgPro: 2.499 ± 0.638
1.406ArgGln: 1.406 ± 0.394
5.623ArgArg: 5.623 ± 1.429
2.655ArgSer: 2.655 ± 1.14
2.968ArgThr: 2.968 ± 0.666
3.593ArgVal: 3.593 ± 0.667
1.562ArgTrp: 1.562 ± 0.504
4.686ArgTyr: 4.686 ± 0.841
0.0ArgXaa: 0.0 ± 0.0
Ser
6.717SerAla: 6.717 ± 1.674
1.093SerCys: 1.093 ± 0.455
3.749SerAsp: 3.749 ± 1.614
2.655SerGlu: 2.655 ± 0.636
2.031SerPhe: 2.031 ± 0.484
4.686SerGly: 4.686 ± 0.935
0.937SerHis: 0.937 ± 0.355
5.155SerIle: 5.155 ± 1.179
2.343SerLys: 2.343 ± 0.684
5.936SerLeu: 5.936 ± 1.567
1.562SerMet: 1.562 ± 0.539
2.655SerAsn: 2.655 ± 0.658
3.28SerPro: 3.28 ± 0.577
1.25SerGln: 1.25 ± 0.407
4.53SerArg: 4.53 ± 1.28
7.81SerSer: 7.81 ± 2.842
3.749SerThr: 3.749 ± 0.679
5.467SerVal: 5.467 ± 1.308
0.781SerTrp: 0.781 ± 0.357
3.124SerTyr: 3.124 ± 1.011
0.0SerXaa: 0.0 ± 0.0
Thr
3.28ThrAla: 3.28 ± 0.807
1.406ThrCys: 1.406 ± 0.351
2.499ThrAsp: 2.499 ± 0.585
2.968ThrGlu: 2.968 ± 0.774
1.093ThrPhe: 1.093 ± 0.379
1.093ThrGly: 1.093 ± 0.401
0.469ThrHis: 0.469 ± 0.255
4.686ThrIle: 4.686 ± 0.834
3.28ThrLys: 3.28 ± 0.804
4.217ThrLeu: 4.217 ± 0.829
0.781ThrMet: 0.781 ± 0.306
1.093ThrAsn: 1.093 ± 0.362
3.436ThrPro: 3.436 ± 0.849
0.937ThrGln: 0.937 ± 0.337
3.124ThrArg: 3.124 ± 0.964
5.467ThrSer: 5.467 ± 1.315
3.436ThrThr: 3.436 ± 0.902
5.311ThrVal: 5.311 ± 0.958
0.469ThrTrp: 0.469 ± 0.241
2.187ThrTyr: 2.187 ± 0.555
0.0ThrXaa: 0.0 ± 0.0
Val
7.185ValAla: 7.185 ± 1.258
1.406ValCys: 1.406 ± 0.468
5.623ValAsp: 5.623 ± 0.821
5.155ValGlu: 5.155 ± 1.148
3.124ValPhe: 3.124 ± 0.614
4.061ValGly: 4.061 ± 0.979
1.562ValHis: 1.562 ± 0.528
3.124ValIle: 3.124 ± 0.735
7.966ValLys: 7.966 ± 1.18
7.185ValLeu: 7.185 ± 1.123
2.031ValMet: 2.031 ± 0.685
2.499ValAsn: 2.499 ± 0.827
4.217ValPro: 4.217 ± 0.864
2.343ValGln: 2.343 ± 0.622
4.374ValArg: 4.374 ± 0.813
6.56ValSer: 6.56 ± 2.201
2.655ValThr: 2.655 ± 0.475
8.435ValVal: 8.435 ± 1.086
1.093ValTrp: 1.093 ± 0.357
4.686ValTyr: 4.686 ± 0.913
0.0ValXaa: 0.0 ± 0.0
Trp
1.25TrpAla: 1.25 ± 0.523
0.781TrpCys: 0.781 ± 0.319
0.781TrpAsp: 0.781 ± 0.35
0.781TrpGlu: 0.781 ± 0.419
0.937TrpPhe: 0.937 ± 0.312
0.937TrpGly: 0.937 ± 0.511
0.156TrpHis: 0.156 ± 0.153
1.25TrpIle: 1.25 ± 0.364
0.937TrpLys: 0.937 ± 0.416
1.25TrpLeu: 1.25 ± 0.446
0.781TrpMet: 0.781 ± 0.331
0.937TrpAsn: 0.937 ± 0.385
0.469TrpPro: 0.469 ± 0.256
0.781TrpGln: 0.781 ± 0.366
1.25TrpArg: 1.25 ± 0.431
0.469TrpSer: 0.469 ± 0.326
0.312TrpThr: 0.312 ± 0.226
0.937TrpVal: 0.937 ± 0.37
0.469TrpTrp: 0.469 ± 0.262
0.625TrpTyr: 0.625 ± 0.315
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.936TyrAla: 5.936 ± 0.972
1.25TyrCys: 1.25 ± 0.418
3.436TyrAsp: 3.436 ± 0.792
3.28TyrGlu: 3.28 ± 0.701
1.562TyrPhe: 1.562 ± 0.575
3.124TyrGly: 3.124 ± 0.581
0.625TyrHis: 0.625 ± 0.303
4.998TyrIle: 4.998 ± 0.674
1.874TyrLys: 1.874 ± 0.597
3.749TyrLeu: 3.749 ± 0.668
0.937TyrMet: 0.937 ± 0.378
1.874TyrAsn: 1.874 ± 0.723
2.499TyrPro: 2.499 ± 0.672
0.937TyrGln: 0.937 ± 0.396
3.905TyrArg: 3.905 ± 0.643
4.217TyrSer: 4.217 ± 0.955
2.968TyrThr: 2.968 ± 0.796
6.404TyrVal: 6.404 ± 0.818
0.625TyrTrp: 0.625 ± 0.29
3.28TyrTyr: 3.28 ± 1.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 40 proteins (6403 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski