Amino acid dipepetide frequency for Hainan hebius popei torovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.933AlaAla: 1.933 ± 0.628
0.967AlaCys: 0.967 ± 0.253
1.074AlaAsp: 1.074 ± 0.383
0.537AlaGlu: 0.537 ± 0.316
1.718AlaPhe: 1.718 ± 0.517
1.181AlaGly: 1.181 ± 0.397
0.537AlaHis: 0.537 ± 0.75
2.041AlaIle: 2.041 ± 0.589
2.363AlaLys: 2.363 ± 0.703
3.329AlaLeu: 3.329 ± 0.613
0.967AlaMet: 0.967 ± 0.795
1.396AlaAsn: 1.396 ± 0.426
1.074AlaPro: 1.074 ± 0.865
1.504AlaGln: 1.504 ± 0.582
0.967AlaArg: 0.967 ± 0.377
2.578AlaSer: 2.578 ± 0.926
1.933AlaThr: 1.933 ± 0.777
3.437AlaVal: 3.437 ± 1.554
0.215AlaTrp: 0.215 ± 0.111
2.363AlaTyr: 2.363 ± 0.525
0.0AlaXaa: 0.0 ± 0.0
Cys
0.644CysAla: 0.644 ± 0.209
1.933CysCys: 1.933 ± 0.62
3.222CysAsp: 3.222 ± 0.74
1.826CysGlu: 1.826 ± 0.727
1.504CysPhe: 1.504 ± 0.566
1.718CysGly: 1.718 ± 0.678
0.537CysHis: 0.537 ± 0.277
2.685CysIle: 2.685 ± 0.779
2.792CysLys: 2.792 ± 0.946
2.47CysLeu: 2.47 ± 0.992
1.074CysMet: 1.074 ± 0.59
3.759CysAsn: 3.759 ± 0.836
1.074CysPro: 1.074 ± 0.607
0.537CysGln: 0.537 ± 0.277
0.537CysArg: 0.537 ± 0.833
3.652CysSer: 3.652 ± 0.713
0.859CysThr: 0.859 ± 0.276
4.189CysVal: 4.189 ± 0.768
0.322CysTrp: 0.322 ± 0.166
2.685CysTyr: 2.685 ± 0.769
0.0CysXaa: 0.0 ± 0.0
Asp
1.826AspAla: 1.826 ± 1.348
3.652AspCys: 3.652 ± 1.176
3.974AspAsp: 3.974 ± 1.03
3.007AspGlu: 3.007 ± 1.302
3.866AspPhe: 3.866 ± 1.065
3.437AspGly: 3.437 ± 0.62
0.752AspHis: 0.752 ± 0.387
4.296AspIle: 4.296 ± 1.084
5.477AspLys: 5.477 ± 1.206
6.014AspLeu: 6.014 ± 1.401
1.504AspMet: 1.504 ± 0.477
4.833AspAsn: 4.833 ± 1.751
1.074AspPro: 1.074 ± 0.365
0.537AspGln: 0.537 ± 0.192
1.826AspArg: 1.826 ± 0.605
4.833AspSer: 4.833 ± 1.45
0.967AspThr: 0.967 ± 0.452
9.236AspVal: 9.236 ± 1.004
0.537AspTrp: 0.537 ± 0.697
5.907AspTyr: 5.907 ± 0.637
0.0AspXaa: 0.0 ± 0.0
Glu
1.826GluAla: 1.826 ± 0.356
1.181GluCys: 1.181 ± 0.508
2.148GluAsp: 2.148 ± 0.329
2.792GluGlu: 2.792 ± 0.975
2.363GluPhe: 2.363 ± 0.788
2.792GluGly: 2.792 ± 0.74
0.752GluHis: 0.752 ± 0.332
2.47GluIle: 2.47 ± 0.789
2.9GluLys: 2.9 ± 0.454
3.544GluLeu: 3.544 ± 1.145
0.43GluMet: 0.43 ± 0.189
2.041GluAsn: 2.041 ± 0.652
0.537GluPro: 0.537 ± 0.277
0.644GluGln: 0.644 ± 0.209
1.396GluArg: 1.396 ± 0.536
3.222GluSer: 3.222 ± 0.669
1.504GluThr: 1.504 ± 0.353
5.585GluVal: 5.585 ± 1.5
0.322GluTrp: 0.322 ± 0.338
2.363GluTyr: 2.363 ± 1.202
0.107GluXaa: 0.107 ± 0.055
Phe
1.289PheAla: 1.289 ± 0.493
1.826PheCys: 1.826 ± 0.563
4.511PheAsp: 4.511 ± 0.923
2.363PheGlu: 2.363 ± 1.202
2.041PhePhe: 2.041 ± 0.906
4.511PheGly: 4.511 ± 2.0
0.322PheHis: 0.322 ± 0.166
3.437PheIle: 3.437 ± 1.175
5.048PheLys: 5.048 ± 0.71
3.866PheLeu: 3.866 ± 1.553
1.826PheMet: 1.826 ± 0.878
6.122PheAsn: 6.122 ± 0.842
1.504PhePro: 1.504 ± 0.764
1.611PheGln: 1.611 ± 1.012
2.363PheArg: 2.363 ± 1.342
5.37PheSer: 5.37 ± 1.792
2.255PheThr: 2.255 ± 0.52
6.337PheVal: 6.337 ± 1.212
0.537PheTrp: 0.537 ± 0.412
3.759PheTyr: 3.759 ± 0.982
0.0PheXaa: 0.0 ± 0.0
Gly
0.644GlyAla: 0.644 ± 0.411
3.007GlyCys: 3.007 ± 0.877
4.403GlyAsp: 4.403 ± 0.564
2.363GlyGlu: 2.363 ± 0.941
3.007GlyPhe: 3.007 ± 0.754
3.437GlyGly: 3.437 ± 0.838
1.289GlyHis: 1.289 ± 0.493
4.081GlyIle: 4.081 ± 0.925
4.296GlyLys: 4.296 ± 1.028
4.726GlyLeu: 4.726 ± 0.784
0.644GlyMet: 0.644 ± 0.332
4.081GlyAsn: 4.081 ± 0.726
1.074GlyPro: 1.074 ± 0.553
0.967GlyGln: 0.967 ± 0.39
2.792GlyArg: 2.792 ± 1.404
4.833GlySer: 4.833 ± 1.471
1.396GlyThr: 1.396 ± 0.719
7.625GlyVal: 7.625 ± 0.95
0.644GlyTrp: 0.644 ± 0.982
3.759GlyTyr: 3.759 ± 1.478
0.0GlyXaa: 0.0 ± 0.0
His
0.537HisAla: 0.537 ± 0.354
0.107HisCys: 0.107 ± 0.055
0.967HisAsp: 0.967 ± 0.498
0.859HisGlu: 0.859 ± 0.353
1.396HisPhe: 1.396 ± 0.514
1.074HisGly: 1.074 ± 0.477
0.322HisHis: 0.322 ± 0.166
0.43HisIle: 0.43 ± 0.221
0.967HisLys: 0.967 ± 0.805
1.289HisLeu: 1.289 ± 0.418
0.43HisMet: 0.43 ± 0.424
1.181HisAsn: 1.181 ± 0.53
0.107HisPro: 0.107 ± 0.055
0.644HisGln: 0.644 ± 0.319
0.322HisArg: 0.322 ± 0.166
0.752HisSer: 0.752 ± 0.387
0.752HisThr: 0.752 ± 0.418
1.396HisVal: 1.396 ± 0.833
0.215HisTrp: 0.215 ± 0.111
0.967HisTyr: 0.967 ± 0.412
0.0HisXaa: 0.0 ± 0.0
Ile
1.933IleAla: 1.933 ± 0.587
1.289IleCys: 1.289 ± 0.493
3.866IleAsp: 3.866 ± 0.759
3.115IleGlu: 3.115 ± 0.754
4.189IlePhe: 4.189 ± 0.929
3.974IleGly: 3.974 ± 1.079
0.752IleHis: 0.752 ± 0.387
3.222IleIle: 3.222 ± 2.926
4.726IleLys: 4.726 ± 1.609
4.94IleLeu: 4.94 ± 1.773
1.181IleMet: 1.181 ± 0.693
3.866IleAsn: 3.866 ± 1.847
2.255IlePro: 2.255 ± 0.809
1.181IleGln: 1.181 ± 1.418
2.041IleArg: 2.041 ± 0.524
3.115IleSer: 3.115 ± 1.352
1.396IleThr: 1.396 ± 1.019
7.625IleVal: 7.625 ± 1.658
0.322IleTrp: 0.322 ± 0.435
3.115IleTyr: 3.115 ± 0.573
0.0IleXaa: 0.0 ± 0.0
Lys
4.511LysAla: 4.511 ± 0.677
3.115LysCys: 3.115 ± 0.618
3.544LysAsp: 3.544 ± 0.863
2.041LysGlu: 2.041 ± 0.422
5.585LysPhe: 5.585 ± 0.76
2.792LysGly: 2.792 ± 0.913
1.181LysHis: 1.181 ± 0.413
3.007LysIle: 3.007 ± 1.078
2.792LysLys: 2.792 ± 0.868
8.27LysLeu: 8.27 ± 2.527
1.396LysMet: 1.396 ± 0.77
3.974LysAsn: 3.974 ± 0.557
2.9LysPro: 2.9 ± 0.876
1.826LysGln: 1.826 ± 0.547
2.9LysArg: 2.9 ± 0.595
4.081LysSer: 4.081 ± 0.875
2.685LysThr: 2.685 ± 0.662
6.659LysVal: 6.659 ± 1.234
0.537LysTrp: 0.537 ± 0.572
5.048LysTyr: 5.048 ± 0.789
0.0LysXaa: 0.0 ± 0.0
Leu
4.081LeuAla: 4.081 ± 0.631
3.974LeuCys: 3.974 ± 0.595
6.551LeuAsp: 6.551 ± 0.968
3.544LeuGlu: 3.544 ± 0.688
6.659LeuPhe: 6.659 ± 1.493
4.726LeuGly: 4.726 ± 1.343
1.504LeuHis: 1.504 ± 0.507
5.155LeuIle: 5.155 ± 4.24
7.411LeuLys: 7.411 ± 1.512
9.129LeuLeu: 9.129 ± 3.333
2.685LeuMet: 2.685 ± 2.737
4.833LeuAsn: 4.833 ± 0.937
2.148LeuPro: 2.148 ± 0.847
3.007LeuGln: 3.007 ± 1.146
3.115LeuArg: 3.115 ± 1.096
5.692LeuSer: 5.692 ± 0.989
3.974LeuThr: 3.974 ± 0.979
9.988LeuVal: 9.988 ± 6.969
1.074LeuTrp: 1.074 ± 0.383
5.263LeuTyr: 5.263 ± 0.866
0.0LeuXaa: 0.0 ± 0.0
Met
0.859MetAla: 0.859 ± 0.621
0.967MetCys: 0.967 ± 0.657
0.644MetAsp: 0.644 ± 0.209
0.43MetGlu: 0.43 ± 0.42
2.47MetPhe: 2.47 ± 0.794
1.396MetGly: 1.396 ± 0.414
0.215MetHis: 0.215 ± 0.111
1.074MetIle: 1.074 ± 0.252
0.644MetLys: 0.644 ± 0.637
3.437MetLeu: 3.437 ± 3.163
0.752MetMet: 0.752 ± 0.387
0.537MetAsn: 0.537 ± 0.277
0.967MetPro: 0.967 ± 0.319
0.752MetGln: 0.752 ± 0.635
0.644MetArg: 0.644 ± 0.637
1.289MetSer: 1.289 ± 1.196
1.181MetThr: 1.181 ± 0.388
2.363MetVal: 2.363 ± 1.227
0.0MetTrp: 0.0 ± 0.0
1.718MetTyr: 1.718 ± 0.705
0.0MetXaa: 0.0 ± 0.0
Asn
1.826AsnAla: 1.826 ± 0.841
2.363AsnCys: 2.363 ± 0.735
3.115AsnAsp: 3.115 ± 1.048
3.115AsnGlu: 3.115 ± 0.44
5.692AsnPhe: 5.692 ± 1.085
4.189AsnGly: 4.189 ± 1.076
0.644AsnHis: 0.644 ± 0.405
5.585AsnIle: 5.585 ± 1.69
4.511AsnLys: 4.511 ± 0.981
5.155AsnLeu: 5.155 ± 1.316
0.967AsnMet: 0.967 ± 0.498
5.477AsnAsn: 5.477 ± 1.228
0.967AsnPro: 0.967 ± 0.537
1.074AsnGln: 1.074 ± 0.252
2.578AsnArg: 2.578 ± 0.795
5.263AsnSer: 5.263 ± 0.875
1.611AsnThr: 1.611 ± 1.029
9.236AsnVal: 9.236 ± 1.961
0.537AsnTrp: 0.537 ± 0.277
4.403AsnTyr: 4.403 ± 1.654
0.0AsnXaa: 0.0 ± 0.0
Pro
0.752ProAla: 0.752 ± 0.418
0.43ProCys: 0.43 ± 0.221
1.074ProAsp: 1.074 ± 0.589
0.752ProGlu: 0.752 ± 0.454
1.181ProPhe: 1.181 ± 0.703
0.752ProGly: 0.752 ± 0.387
0.644ProHis: 0.644 ± 0.676
1.504ProIle: 1.504 ± 0.443
2.578ProLys: 2.578 ± 1.15
2.148ProLeu: 2.148 ± 0.65
0.644ProMet: 0.644 ± 0.411
1.611ProAsn: 1.611 ± 0.4
0.967ProPro: 0.967 ± 0.519
0.752ProGln: 0.752 ± 0.934
0.43ProArg: 0.43 ± 0.453
2.792ProSer: 2.792 ± 0.589
0.752ProThr: 0.752 ± 0.388
2.9ProVal: 2.9 ± 0.516
0.322ProTrp: 0.322 ± 0.202
1.933ProTyr: 1.933 ± 0.505
0.0ProXaa: 0.0 ± 0.0
Gln
1.289GlnAla: 1.289 ± 0.664
1.074GlnCys: 1.074 ± 0.589
1.074GlnAsp: 1.074 ± 0.476
1.933GlnGlu: 1.933 ± 0.505
2.363GlnPhe: 2.363 ± 0.827
1.611GlnGly: 1.611 ± 0.575
0.322GlnHis: 0.322 ± 0.659
1.074GlnIle: 1.074 ± 1.387
0.43GlnLys: 0.43 ± 0.221
2.792GlnLeu: 2.792 ± 1.178
0.537GlnMet: 0.537 ± 0.277
1.074GlnAsn: 1.074 ± 0.553
0.752GlnPro: 0.752 ± 0.418
0.644GlnGln: 0.644 ± 0.332
1.074GlnArg: 1.074 ± 0.252
1.181GlnSer: 1.181 ± 0.609
1.074GlnThr: 1.074 ± 0.252
2.685GlnVal: 2.685 ± 0.817
0.322GlnTrp: 0.322 ± 0.166
1.504GlnTyr: 1.504 ± 1.109
0.0GlnXaa: 0.0 ± 0.0
Arg
1.181ArgAla: 1.181 ± 0.693
1.826ArgCys: 1.826 ± 1.027
1.504ArgAsp: 1.504 ± 0.795
0.859ArgGlu: 0.859 ± 0.378
1.718ArgPhe: 1.718 ± 1.302
2.255ArgGly: 2.255 ± 0.391
0.859ArgHis: 0.859 ± 0.467
2.041ArgIle: 2.041 ± 0.657
1.718ArgLys: 1.718 ± 0.333
4.833ArgLeu: 4.833 ± 2.762
0.322ArgMet: 0.322 ± 0.202
2.041ArgAsn: 2.041 ± 0.611
1.718ArgPro: 1.718 ± 0.672
1.611ArgGln: 1.611 ± 0.483
2.148ArgArg: 2.148 ± 2.602
2.255ArgSer: 2.255 ± 1.151
0.967ArgThr: 0.967 ± 0.452
2.685ArgVal: 2.685 ± 0.982
0.322ArgTrp: 0.322 ± 0.202
1.826ArgTyr: 1.826 ± 0.467
0.0ArgXaa: 0.0 ± 0.0
Ser
1.933SerAla: 1.933 ± 0.996
2.578SerCys: 2.578 ± 0.572
6.337SerAsp: 6.337 ± 1.159
2.041SerGlu: 2.041 ± 0.801
4.833SerPhe: 4.833 ± 0.971
5.048SerGly: 5.048 ± 1.356
1.074SerHis: 1.074 ± 0.511
3.329SerIle: 3.329 ± 0.85
4.726SerLys: 4.726 ± 0.931
7.733SerLeu: 7.733 ± 1.191
1.396SerMet: 1.396 ± 0.426
5.155SerAsn: 5.155 ± 1.148
1.074SerPro: 1.074 ± 1.195
1.718SerGln: 1.718 ± 0.672
2.792SerArg: 2.792 ± 1.189
6.659SerSer: 6.659 ± 1.154
2.148SerThr: 2.148 ± 1.688
7.411SerVal: 7.411 ± 1.11
0.644SerTrp: 0.644 ± 0.209
3.759SerTyr: 3.759 ± 0.836
0.0SerXaa: 0.0 ± 0.0
Thr
1.289ThrAla: 1.289 ± 0.574
0.967ThrCys: 0.967 ± 0.498
2.041ThrAsp: 2.041 ± 0.832
1.933ThrGlu: 1.933 ± 0.638
1.504ThrPhe: 1.504 ± 0.664
2.255ThrGly: 2.255 ± 0.935
0.43ThrHis: 0.43 ± 0.221
2.255ThrIle: 2.255 ± 1.248
2.255ThrLys: 2.255 ± 0.475
2.685ThrLeu: 2.685 ± 1.563
0.859ThrMet: 0.859 ± 0.276
2.363ThrAsn: 2.363 ± 0.836
1.181ThrPro: 1.181 ± 0.451
0.752ThrGln: 0.752 ± 0.491
1.504ThrArg: 1.504 ± 0.911
1.611ThrSer: 1.611 ± 0.619
1.718ThrThr: 1.718 ± 0.333
2.685ThrVal: 2.685 ± 0.769
0.43ThrTrp: 0.43 ± 0.604
2.47ThrTyr: 2.47 ± 0.534
0.0ThrXaa: 0.0 ± 0.0
Val
2.578ValAla: 2.578 ± 0.413
3.544ValCys: 3.544 ± 0.733
11.277ValAsp: 11.277 ± 1.617
5.048ValGlu: 5.048 ± 1.119
5.263ValPhe: 5.263 ± 2.276
6.551ValGly: 6.551 ± 2.198
1.504ValHis: 1.504 ± 0.446
6.229ValIle: 6.229 ± 3.282
7.84ValLys: 7.84 ± 1.859
12.244ValLeu: 12.244 ± 3.1
2.47ValMet: 2.47 ± 2.942
7.948ValAsn: 7.948 ± 1.17
2.148ValPro: 2.148 ± 0.329
3.437ValGln: 3.437 ± 0.948
3.115ValArg: 3.115 ± 0.666
7.196ValSer: 7.196 ± 1.815
3.866ValThr: 3.866 ± 1.079
12.351ValVal: 12.351 ± 2.972
0.644ValTrp: 0.644 ± 0.332
6.122ValTyr: 6.122 ± 0.465
0.0ValXaa: 0.0 ± 0.0
Trp
0.215TrpAla: 0.215 ± 0.229
0.644TrpCys: 0.644 ± 0.332
0.537TrpAsp: 0.537 ± 0.192
0.107TrpGlu: 0.107 ± 0.055
0.322TrpPhe: 0.322 ± 0.166
0.322TrpGly: 0.322 ± 0.435
0.107TrpHis: 0.107 ± 0.055
0.537TrpIle: 0.537 ± 0.277
0.215TrpLys: 0.215 ± 0.111
1.289TrpLeu: 1.289 ± 0.418
0.215TrpMet: 0.215 ± 0.111
0.215TrpAsn: 0.215 ± 0.229
0.43TrpPro: 0.43 ± 0.42
0.537TrpGln: 0.537 ± 0.316
0.644TrpArg: 0.644 ± 0.535
0.967TrpSer: 0.967 ± 0.839
0.537TrpThr: 0.537 ± 0.89
0.322TrpVal: 0.322 ± 0.751
0.107TrpTrp: 0.107 ± 0.055
0.43TrpTyr: 0.43 ± 0.189
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.074TyrAla: 1.074 ± 0.632
2.578TyrCys: 2.578 ± 0.61
5.692TyrAsp: 5.692 ± 1.793
2.255TyrGlu: 2.255 ± 0.475
3.222TyrPhe: 3.222 ± 1.014
5.048TyrGly: 5.048 ± 1.741
0.967TyrHis: 0.967 ± 0.607
3.652TyrIle: 3.652 ± 0.758
5.048TyrLys: 5.048 ± 1.446
4.618TyrLeu: 4.618 ± 0.714
1.933TyrMet: 1.933 ± 0.573
5.8TyrAsn: 5.8 ± 1.761
0.967TyrPro: 0.967 ± 0.452
1.074TyrGln: 1.074 ± 0.365
1.504TyrArg: 1.504 ± 0.764
5.048TyrSer: 5.048 ± 1.059
1.611TyrThr: 1.611 ± 0.318
6.659TyrVal: 6.659 ± 1.838
0.644TyrTrp: 0.644 ± 0.543
5.048TyrTyr: 5.048 ± 1.317
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.107XaaLys: 0.107 ± 0.055
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (9312 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski