Amino acid dipepetide frequency for Halovirus HSTV-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.864AlaAla: 9.864 ± 1.099
0.789AlaCys: 0.789 ± 0.331
11.245AlaAsp: 11.245 ± 0.832
9.173AlaGlu: 9.173 ± 1.095
2.861AlaPhe: 2.861 ± 0.478
7.201AlaGly: 7.201 ± 0.922
1.085AlaHis: 1.085 ± 0.265
4.537AlaIle: 4.537 ± 0.687
2.663AlaLys: 2.663 ± 0.469
6.313AlaLeu: 6.313 ± 0.995
1.775AlaMet: 1.775 ± 0.352
3.354AlaAsn: 3.354 ± 0.56
2.959AlaPro: 2.959 ± 0.469
2.663AlaGln: 2.663 ± 0.461
4.241AlaArg: 4.241 ± 0.652
3.847AlaSer: 3.847 ± 0.834
7.792AlaThr: 7.792 ± 1.009
8.582AlaVal: 8.582 ± 1.123
1.677AlaTrp: 1.677 ± 0.415
1.973AlaTyr: 1.973 ± 0.438
0.0AlaXaa: 0.0 ± 0.0
Cys
0.395CysAla: 0.395 ± 0.195
0.0CysCys: 0.0 ± 0.0
1.48CysAsp: 1.48 ± 0.46
1.085CysGlu: 1.085 ± 0.459
0.0CysPhe: 0.0 ± 0.0
0.789CysGly: 0.789 ± 0.246
0.099CysHis: 0.099 ± 0.094
0.395CysIle: 0.395 ± 0.222
0.296CysLys: 0.296 ± 0.146
0.296CysLeu: 0.296 ± 0.149
0.592CysMet: 0.592 ± 0.265
0.296CysAsn: 0.296 ± 0.166
0.493CysPro: 0.493 ± 0.249
0.395CysGln: 0.395 ± 0.258
0.395CysArg: 0.395 ± 0.161
0.099CysSer: 0.099 ± 0.087
0.395CysThr: 0.395 ± 0.171
1.381CysVal: 1.381 ± 0.495
0.296CysTrp: 0.296 ± 0.182
0.197CysTyr: 0.197 ± 0.111
0.0CysXaa: 0.0 ± 0.0
Asp
11.738AspAla: 11.738 ± 0.902
0.888AspCys: 0.888 ± 0.323
7.003AspAsp: 7.003 ± 0.937
7.792AspGlu: 7.792 ± 1.014
2.565AspPhe: 2.565 ± 0.6
9.864AspGly: 9.864 ± 1.01
1.874AspHis: 1.874 ± 0.317
4.735AspIle: 4.735 ± 0.604
1.973AspLys: 1.973 ± 0.496
6.609AspLeu: 6.609 ± 0.79
2.367AspMet: 2.367 ± 0.406
2.762AspAsn: 2.762 ± 0.477
4.241AspPro: 4.241 ± 0.577
0.986AspGln: 0.986 ± 0.308
7.398AspArg: 7.398 ± 1.058
5.129AspSer: 5.129 ± 0.812
7.792AspThr: 7.792 ± 1.323
8.088AspVal: 8.088 ± 1.054
1.381AspTrp: 1.381 ± 0.362
2.959AspTyr: 2.959 ± 0.465
0.0AspXaa: 0.0 ± 0.0
Glu
7.694GluAla: 7.694 ± 1.05
0.986GluCys: 0.986 ± 0.382
6.214GluAsp: 6.214 ± 0.849
4.932GluGlu: 4.932 ± 0.596
3.255GluPhe: 3.255 ± 0.606
4.537GluGly: 4.537 ± 0.896
2.367GluHis: 2.367 ± 0.426
3.551GluIle: 3.551 ± 0.554
2.565GluLys: 2.565 ± 0.595
8.384GluLeu: 8.384 ± 1.081
2.466GluMet: 2.466 ± 0.494
2.663GluAsn: 2.663 ± 0.545
2.762GluPro: 2.762 ± 0.54
4.439GluGln: 4.439 ± 0.802
6.017GluArg: 6.017 ± 0.685
4.735GluSer: 4.735 ± 0.677
6.214GluThr: 6.214 ± 0.908
6.017GluVal: 6.017 ± 0.789
1.184GluTrp: 1.184 ± 0.355
2.269GluTyr: 2.269 ± 0.668
0.0GluXaa: 0.0 ± 0.0
Phe
2.269PheAla: 2.269 ± 0.472
0.197PheCys: 0.197 ± 0.145
3.946PheAsp: 3.946 ± 0.525
2.959PheGlu: 2.959 ± 0.529
0.888PhePhe: 0.888 ± 0.3
2.17PheGly: 2.17 ± 0.474
0.493PheHis: 0.493 ± 0.223
1.085PheIle: 1.085 ± 0.519
1.085PheLys: 1.085 ± 0.358
1.184PheLeu: 1.184 ± 0.517
0.493PheMet: 0.493 ± 0.214
1.282PheAsn: 1.282 ± 0.395
0.888PhePro: 0.888 ± 0.442
0.789PheGln: 0.789 ± 0.232
1.677PheArg: 1.677 ± 0.423
1.578PheSer: 1.578 ± 0.325
1.48PheThr: 1.48 ± 0.313
1.874PheVal: 1.874 ± 0.449
0.197PheTrp: 0.197 ± 0.144
0.197PheTyr: 0.197 ± 0.136
0.0PheXaa: 0.0 ± 0.0
Gly
6.51GlyAla: 6.51 ± 0.914
0.888GlyCys: 0.888 ± 0.271
8.187GlyAsp: 8.187 ± 1.05
7.595GlyGlu: 7.595 ± 0.977
2.466GlyPhe: 2.466 ± 0.499
7.694GlyGly: 7.694 ± 1.754
0.986GlyHis: 0.986 ± 0.315
2.959GlyIle: 2.959 ± 0.657
1.973GlyLys: 1.973 ± 0.443
3.946GlyLeu: 3.946 ± 0.551
1.775GlyMet: 1.775 ± 0.438
4.241GlyAsn: 4.241 ± 0.75
2.861GlyPro: 2.861 ± 0.488
2.071GlyGln: 2.071 ± 0.38
3.156GlyArg: 3.156 ± 0.655
6.412GlySer: 6.412 ± 1.288
6.313GlyThr: 6.313 ± 0.851
4.833GlyVal: 4.833 ± 0.67
1.381GlyTrp: 1.381 ± 0.403
1.677GlyTyr: 1.677 ± 0.447
0.0GlyXaa: 0.0 ± 0.0
His
1.184HisAla: 1.184 ± 0.307
0.099HisCys: 0.099 ± 0.093
1.48HisAsp: 1.48 ± 0.318
0.986HisGlu: 0.986 ± 0.272
0.395HisPhe: 0.395 ± 0.184
1.775HisGly: 1.775 ± 0.419
0.592HisHis: 0.592 ± 0.243
0.69HisIle: 0.69 ± 0.21
0.789HisLys: 0.789 ± 0.268
0.888HisLeu: 0.888 ± 0.312
0.395HisMet: 0.395 ± 0.198
0.592HisAsn: 0.592 ± 0.302
1.282HisPro: 1.282 ± 0.286
0.493HisGln: 0.493 ± 0.209
1.48HisArg: 1.48 ± 0.433
1.381HisSer: 1.381 ± 0.332
1.381HisThr: 1.381 ± 0.478
1.381HisVal: 1.381 ± 0.336
0.197HisTrp: 0.197 ± 0.123
0.789HisTyr: 0.789 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
4.34IleAla: 4.34 ± 0.565
0.099IleCys: 0.099 ± 0.09
4.143IleAsp: 4.143 ± 0.648
3.65IleGlu: 3.65 ± 0.72
0.888IlePhe: 0.888 ± 0.311
4.044IleGly: 4.044 ± 0.829
0.986IleHis: 0.986 ± 0.276
2.17IleIle: 2.17 ± 0.564
1.184IleLys: 1.184 ± 0.326
2.071IleLeu: 2.071 ± 0.411
0.395IleMet: 0.395 ± 0.188
2.269IleAsn: 2.269 ± 0.481
2.269IlePro: 2.269 ± 0.461
2.17IleGln: 2.17 ± 0.531
3.847IleArg: 3.847 ± 0.656
3.058IleSer: 3.058 ± 0.805
3.156IleThr: 3.156 ± 0.753
2.367IleVal: 2.367 ± 0.513
0.493IleTrp: 0.493 ± 0.203
0.986IleTyr: 0.986 ± 0.353
0.0IleXaa: 0.0 ± 0.0
Lys
2.565LysAla: 2.565 ± 0.53
0.296LysCys: 0.296 ± 0.16
2.663LysAsp: 2.663 ± 0.417
2.466LysGlu: 2.466 ± 0.63
0.592LysPhe: 0.592 ± 0.233
1.973LysGly: 1.973 ± 0.358
0.493LysHis: 0.493 ± 0.22
1.48LysIle: 1.48 ± 0.349
0.197LysLys: 0.197 ± 0.127
2.269LysLeu: 2.269 ± 0.457
0.197LysMet: 0.197 ± 0.166
1.184LysAsn: 1.184 ± 0.371
1.677LysPro: 1.677 ± 0.339
1.282LysGln: 1.282 ± 0.443
1.578LysArg: 1.578 ± 0.355
1.874LysSer: 1.874 ± 0.611
1.973LysThr: 1.973 ± 0.432
1.578LysVal: 1.578 ± 0.414
1.282LysTrp: 1.282 ± 0.338
0.592LysTyr: 0.592 ± 0.265
0.0LysXaa: 0.0 ± 0.0
Leu
6.806LeuAla: 6.806 ± 0.837
0.592LeuCys: 0.592 ± 0.255
5.82LeuAsp: 5.82 ± 0.813
4.932LeuGlu: 4.932 ± 0.74
0.888LeuPhe: 0.888 ± 0.332
4.143LeuGly: 4.143 ± 0.656
1.282LeuHis: 1.282 ± 0.37
2.367LeuIle: 2.367 ± 0.521
1.48LeuLys: 1.48 ± 0.477
5.129LeuLeu: 5.129 ± 0.697
1.184LeuMet: 1.184 ± 0.304
2.663LeuAsn: 2.663 ± 0.591
2.959LeuPro: 2.959 ± 0.637
4.044LeuGln: 4.044 ± 0.616
4.439LeuArg: 4.439 ± 0.579
6.412LeuSer: 6.412 ± 0.721
4.833LeuThr: 4.833 ± 0.71
4.143LeuVal: 4.143 ± 0.632
1.381LeuTrp: 1.381 ± 0.368
1.381LeuTyr: 1.381 ± 0.341
0.0LeuXaa: 0.0 ± 0.0
Met
2.269MetAla: 2.269 ± 0.388
0.197MetCys: 0.197 ± 0.148
1.677MetAsp: 1.677 ± 0.425
1.184MetGlu: 1.184 ± 0.378
0.099MetPhe: 0.099 ± 0.093
0.888MetGly: 0.888 ± 0.325
0.493MetHis: 0.493 ± 0.226
0.592MetIle: 0.592 ± 0.284
0.592MetLys: 0.592 ± 0.218
1.677MetLeu: 1.677 ± 0.525
0.592MetMet: 0.592 ± 0.232
0.69MetAsn: 0.69 ± 0.247
1.775MetPro: 1.775 ± 0.371
0.888MetGln: 0.888 ± 0.324
0.592MetArg: 0.592 ± 0.27
2.663MetSer: 2.663 ± 0.461
2.466MetThr: 2.466 ± 0.541
1.282MetVal: 1.282 ± 0.402
0.395MetTrp: 0.395 ± 0.204
0.197MetTyr: 0.197 ± 0.166
0.0MetXaa: 0.0 ± 0.0
Asn
3.156AsnAla: 3.156 ± 0.526
0.296AsnCys: 0.296 ± 0.12
3.255AsnAsp: 3.255 ± 0.477
2.861AsnGlu: 2.861 ± 0.516
0.986AsnPhe: 0.986 ± 0.282
2.663AsnGly: 2.663 ± 0.462
0.493AsnHis: 0.493 ± 0.207
2.071AsnIle: 2.071 ± 0.615
0.69AsnLys: 0.69 ± 0.274
1.874AsnLeu: 1.874 ± 0.421
0.888AsnMet: 0.888 ± 0.231
1.874AsnAsn: 1.874 ± 0.749
2.762AsnPro: 2.762 ± 0.359
1.085AsnGln: 1.085 ± 0.358
2.269AsnArg: 2.269 ± 0.321
2.269AsnSer: 2.269 ± 0.49
2.466AsnThr: 2.466 ± 0.544
2.466AsnVal: 2.466 ± 0.631
0.592AsnTrp: 0.592 ± 0.251
1.381AsnTyr: 1.381 ± 0.471
0.0AsnXaa: 0.0 ± 0.0
Pro
3.452ProAla: 3.452 ± 0.614
0.888ProCys: 0.888 ± 0.258
4.833ProAsp: 4.833 ± 0.74
3.748ProGlu: 3.748 ± 0.612
1.282ProPhe: 1.282 ± 0.323
4.044ProGly: 4.044 ± 0.679
0.986ProHis: 0.986 ± 0.299
1.381ProIle: 1.381 ± 0.449
0.888ProLys: 0.888 ± 0.263
2.861ProLeu: 2.861 ± 0.595
0.888ProMet: 0.888 ± 0.332
0.986ProAsn: 0.986 ± 0.344
1.973ProPro: 1.973 ± 0.384
2.071ProGln: 2.071 ± 0.394
1.973ProArg: 1.973 ± 0.321
2.861ProSer: 2.861 ± 0.457
3.748ProThr: 3.748 ± 0.474
3.058ProVal: 3.058 ± 0.425
0.69ProTrp: 0.69 ± 0.236
1.184ProTyr: 1.184 ± 0.284
0.0ProXaa: 0.0 ± 0.0
Gln
3.452GlnAla: 3.452 ± 0.707
0.0GlnCys: 0.0 ± 0.0
3.255GlnAsp: 3.255 ± 0.716
3.946GlnGlu: 3.946 ± 0.617
1.282GlnPhe: 1.282 ± 0.41
1.48GlnGly: 1.48 ± 0.373
0.296GlnHis: 0.296 ± 0.167
1.973GlnIle: 1.973 ± 0.509
1.184GlnLys: 1.184 ± 0.331
2.071GlnLeu: 2.071 ± 0.491
0.69GlnMet: 0.69 ± 0.253
1.775GlnAsn: 1.775 ± 0.469
1.184GlnPro: 1.184 ± 0.279
2.269GlnGln: 2.269 ± 0.485
3.156GlnArg: 3.156 ± 0.648
2.367GlnSer: 2.367 ± 0.502
1.48GlnThr: 1.48 ± 0.326
2.17GlnVal: 2.17 ± 0.433
0.69GlnTrp: 0.69 ± 0.275
1.381GlnTyr: 1.381 ± 0.284
0.0GlnXaa: 0.0 ± 0.0
Arg
6.116ArgAla: 6.116 ± 0.776
0.888ArgCys: 0.888 ± 0.262
4.537ArgAsp: 4.537 ± 0.549
5.721ArgGlu: 5.721 ± 0.964
1.775ArgPhe: 1.775 ± 0.38
3.65ArgGly: 3.65 ± 0.729
1.184ArgHis: 1.184 ± 0.393
3.748ArgIle: 3.748 ± 0.699
1.973ArgLys: 1.973 ± 0.412
4.932ArgLeu: 4.932 ± 0.559
1.282ArgMet: 1.282 ± 0.391
2.071ArgAsn: 2.071 ± 0.608
1.874ArgPro: 1.874 ± 0.392
3.058ArgGln: 3.058 ± 0.573
4.34ArgArg: 4.34 ± 0.828
2.861ArgSer: 2.861 ± 0.641
3.255ArgThr: 3.255 ± 0.679
5.031ArgVal: 5.031 ± 0.77
1.184ArgTrp: 1.184 ± 0.436
1.282ArgTyr: 1.282 ± 0.348
0.0ArgXaa: 0.0 ± 0.0
Ser
5.129SerAla: 5.129 ± 0.506
0.69SerCys: 0.69 ± 0.259
6.806SerAsp: 6.806 ± 1.01
4.932SerGlu: 4.932 ± 0.76
2.071SerPhe: 2.071 ± 0.573
7.003SerGly: 7.003 ± 0.914
1.578SerHis: 1.578 ± 0.487
3.452SerIle: 3.452 ± 0.591
2.269SerLys: 2.269 ± 0.552
4.143SerLeu: 4.143 ± 0.701
0.789SerMet: 0.789 ± 0.22
1.578SerAsn: 1.578 ± 0.467
3.058SerPro: 3.058 ± 0.55
1.282SerGln: 1.282 ± 0.35
4.044SerArg: 4.044 ± 0.608
3.156SerSer: 3.156 ± 0.746
3.156SerThr: 3.156 ± 0.667
4.932SerVal: 4.932 ± 0.771
1.282SerTrp: 1.282 ± 0.27
1.48SerTyr: 1.48 ± 0.372
0.0SerXaa: 0.0 ± 0.0
Thr
6.412ThrAla: 6.412 ± 0.837
0.395ThrCys: 0.395 ± 0.18
8.779ThrAsp: 8.779 ± 1.192
6.214ThrGlu: 6.214 ± 0.67
2.17ThrPhe: 2.17 ± 0.423
4.735ThrGly: 4.735 ± 0.788
0.986ThrHis: 0.986 ± 0.343
2.959ThrIle: 2.959 ± 0.441
1.775ThrLys: 1.775 ± 0.392
6.609ThrLeu: 6.609 ± 0.958
1.677ThrMet: 1.677 ± 0.455
1.578ThrAsn: 1.578 ± 0.352
3.354ThrPro: 3.354 ± 0.51
1.578ThrGln: 1.578 ± 0.462
3.551ThrArg: 3.551 ± 0.655
3.551ThrSer: 3.551 ± 0.695
6.116ThrThr: 6.116 ± 0.788
7.102ThrVal: 7.102 ± 0.931
1.085ThrTrp: 1.085 ± 0.318
2.367ThrTyr: 2.367 ± 0.563
0.0ThrXaa: 0.0 ± 0.0
Val
6.707ValAla: 6.707 ± 1.184
0.69ValCys: 0.69 ± 0.262
6.905ValAsp: 6.905 ± 0.901
6.412ValGlu: 6.412 ± 1.011
1.775ValPhe: 1.775 ± 0.378
5.622ValGly: 5.622 ± 0.898
1.184ValHis: 1.184 ± 0.323
2.663ValIle: 2.663 ± 0.58
3.058ValLys: 3.058 ± 0.508
3.847ValLeu: 3.847 ± 0.584
1.578ValMet: 1.578 ± 0.37
2.663ValAsn: 2.663 ± 0.548
3.452ValPro: 3.452 ± 0.679
3.058ValGln: 3.058 ± 0.456
4.241ValArg: 4.241 ± 0.663
6.214ValSer: 6.214 ± 0.854
5.622ValThr: 5.622 ± 1.085
5.228ValVal: 5.228 ± 0.843
0.888ValTrp: 0.888 ± 0.308
2.466ValTyr: 2.466 ± 0.547
0.0ValXaa: 0.0 ± 0.0
Trp
1.48TrpAla: 1.48 ± 0.437
0.099TrpCys: 0.099 ± 0.099
1.973TrpAsp: 1.973 ± 0.432
1.085TrpGlu: 1.085 ± 0.352
0.197TrpPhe: 0.197 ± 0.136
0.986TrpGly: 0.986 ± 0.278
0.493TrpHis: 0.493 ± 0.219
0.69TrpIle: 0.69 ± 0.228
0.592TrpLys: 0.592 ± 0.242
1.184TrpLeu: 1.184 ± 0.33
0.493TrpMet: 0.493 ± 0.208
0.592TrpAsn: 0.592 ± 0.221
0.789TrpPro: 0.789 ± 0.282
0.69TrpGln: 0.69 ± 0.241
1.184TrpArg: 1.184 ± 0.407
0.888TrpSer: 0.888 ± 0.267
1.973TrpThr: 1.973 ± 0.384
0.789TrpVal: 0.789 ± 0.313
0.296TrpTrp: 0.296 ± 0.189
0.493TrpTyr: 0.493 ± 0.25
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.058TyrAla: 3.058 ± 0.514
0.395TyrCys: 0.395 ± 0.206
3.946TyrAsp: 3.946 ± 0.793
1.677TyrGlu: 1.677 ± 0.454
0.296TyrPhe: 0.296 ± 0.215
2.663TyrGly: 2.663 ± 0.421
0.296TyrHis: 0.296 ± 0.142
1.085TyrIle: 1.085 ± 0.395
0.986TyrLys: 0.986 ± 0.254
0.592TyrLeu: 0.592 ± 0.191
0.592TyrMet: 0.592 ± 0.215
1.282TyrAsn: 1.282 ± 0.314
1.184TyrPro: 1.184 ± 0.312
0.789TyrGln: 0.789 ± 0.328
1.184TyrArg: 1.184 ± 0.358
1.48TyrSer: 1.48 ± 0.375
1.381TyrThr: 1.381 ± 0.364
1.973TyrVal: 1.973 ± 0.51
0.395TyrTrp: 0.395 ± 0.179
1.48TyrTyr: 1.48 ± 0.457
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (10139 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski