Amino acid dipepetide frequency for Hubei virga-like virus 16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.84AlaAla: 4.84 ± 1.291
0.726AlaCys: 0.726 ± 0.396
3.146AlaAsp: 3.146 ± 0.792
2.662AlaGlu: 2.662 ± 1.147
2.662AlaPhe: 2.662 ± 1.934
3.872AlaGly: 3.872 ± 1.266
0.968AlaHis: 0.968 ± 0.436
3.63AlaIle: 3.63 ± 1.369
2.178AlaLys: 2.178 ± 0.798
6.776AlaLeu: 6.776 ± 1.388
1.694AlaMet: 1.694 ± 0.655
2.662AlaAsn: 2.662 ± 0.896
1.694AlaPro: 1.694 ± 0.89
1.694AlaGln: 1.694 ± 0.994
3.146AlaArg: 3.146 ± 1.324
5.324AlaSer: 5.324 ± 2.272
2.662AlaThr: 2.662 ± 1.091
6.05AlaVal: 6.05 ± 1.485
0.0AlaTrp: 0.0 ± 0.0
2.904AlaTyr: 2.904 ± 0.891
0.0AlaXaa: 0.0 ± 0.0
Cys
1.452CysAla: 1.452 ± 0.767
0.242CysCys: 0.242 ± 0.132
1.21CysAsp: 1.21 ± 0.661
1.694CysGlu: 1.694 ± 0.925
0.726CysPhe: 0.726 ± 0.437
0.968CysGly: 0.968 ± 0.529
0.242CysHis: 0.242 ± 0.132
0.968CysIle: 0.968 ± 1.333
1.21CysLys: 1.21 ± 0.661
1.452CysLeu: 1.452 ± 1.543
0.484CysMet: 0.484 ± 0.264
0.484CysAsn: 0.484 ± 0.264
1.452CysPro: 1.452 ± 0.541
0.968CysGln: 0.968 ± 0.436
0.968CysArg: 0.968 ± 0.436
1.21CysSer: 1.21 ± 0.473
0.484CysThr: 0.484 ± 0.446
1.936CysVal: 1.936 ± 0.732
0.484CysTrp: 0.484 ± 0.446
1.452CysTyr: 1.452 ± 0.558
0.0CysXaa: 0.0 ± 0.0
Asp
2.42AspAla: 2.42 ± 1.049
0.968AspCys: 0.968 ± 0.529
3.388AspAsp: 3.388 ± 1.048
2.42AspGlu: 2.42 ± 0.495
3.872AspPhe: 3.872 ± 1.187
3.872AspGly: 3.872 ± 1.007
1.21AspHis: 1.21 ± 0.661
1.936AspIle: 1.936 ± 1.318
2.42AspLys: 2.42 ± 1.019
6.534AspLeu: 6.534 ± 2.198
0.726AspMet: 0.726 ± 0.871
1.452AspAsn: 1.452 ± 0.638
2.904AspPro: 2.904 ± 1.253
0.242AspGln: 0.242 ± 0.132
3.63AspArg: 3.63 ± 0.931
4.598AspSer: 4.598 ± 2.155
3.63AspThr: 3.63 ± 1.475
5.808AspVal: 5.808 ± 2.701
0.726AspTrp: 0.726 ± 0.396
3.63AspTyr: 3.63 ± 1.388
0.0AspXaa: 0.0 ± 0.0
Glu
2.662GluAla: 2.662 ± 1.077
0.726GluCys: 0.726 ± 0.396
2.178GluAsp: 2.178 ± 0.852
3.146GluGlu: 3.146 ± 1.167
3.146GluPhe: 3.146 ± 2.527
1.936GluGly: 1.936 ± 0.504
1.452GluHis: 1.452 ± 0.539
2.662GluIle: 2.662 ± 0.789
3.146GluLys: 3.146 ± 1.079
7.26GluLeu: 7.26 ± 1.99
1.452GluMet: 1.452 ± 0.793
2.662GluAsn: 2.662 ± 0.895
1.21GluPro: 1.21 ± 0.473
1.694GluGln: 1.694 ± 0.502
4.598GluArg: 4.598 ± 0.846
4.598GluSer: 4.598 ± 1.196
2.904GluThr: 2.904 ± 0.898
4.114GluVal: 4.114 ± 2.247
0.242GluTrp: 0.242 ± 0.132
2.662GluTyr: 2.662 ± 1.147
0.0GluXaa: 0.0 ± 0.0
Phe
4.84PheAla: 4.84 ± 0.664
1.452PheCys: 1.452 ± 1.036
4.114PheAsp: 4.114 ± 1.319
2.662PheGlu: 2.662 ± 0.516
1.694PhePhe: 1.694 ± 1.228
3.146PheGly: 3.146 ± 1.253
1.694PheHis: 1.694 ± 0.63
2.42PheIle: 2.42 ± 2.727
3.63PheLys: 3.63 ± 1.654
6.776PheLeu: 6.776 ± 6.945
0.726PheMet: 0.726 ± 0.437
2.904PheAsn: 2.904 ± 1.911
2.42PhePro: 2.42 ± 0.634
1.694PheGln: 1.694 ± 0.727
2.42PheArg: 2.42 ± 0.719
3.388PheSer: 3.388 ± 2.961
3.146PheThr: 3.146 ± 0.963
6.05PheVal: 6.05 ± 2.57
0.968PheTrp: 0.968 ± 0.436
1.694PheTyr: 1.694 ± 0.502
0.0PheXaa: 0.0 ± 0.0
Gly
2.662GlyAla: 2.662 ± 0.996
1.452GlyCys: 1.452 ± 0.875
3.872GlyAsp: 3.872 ± 1.769
4.356GlyGlu: 4.356 ± 1.624
4.114GlyPhe: 4.114 ± 1.001
3.872GlyGly: 3.872 ± 0.796
0.726GlyHis: 0.726 ± 0.396
1.452GlyIle: 1.452 ± 1.581
3.63GlyLys: 3.63 ± 1.177
3.872GlyLeu: 3.872 ± 1.357
0.484GlyMet: 0.484 ± 0.264
1.936GlyAsn: 1.936 ± 1.057
1.936GlyPro: 1.936 ± 0.556
1.452GlyGln: 1.452 ± 0.961
2.662GlyArg: 2.662 ± 1.296
2.178GlySer: 2.178 ± 0.956
3.146GlyThr: 3.146 ± 0.795
6.534GlyVal: 6.534 ± 1.675
0.242GlyTrp: 0.242 ± 0.132
2.904GlyTyr: 2.904 ± 1.2
0.0GlyXaa: 0.0 ± 0.0
His
1.452HisAla: 1.452 ± 0.874
0.726HisCys: 0.726 ± 0.396
1.936HisAsp: 1.936 ± 0.88
1.452HisGlu: 1.452 ± 0.793
1.21HisPhe: 1.21 ± 0.54
0.726HisGly: 0.726 ± 0.396
0.484HisHis: 0.484 ± 0.264
1.21HisIle: 1.21 ± 0.473
0.726HisLys: 0.726 ± 0.411
1.936HisLeu: 1.936 ± 0.732
0.484HisMet: 0.484 ± 0.264
0.484HisAsn: 0.484 ± 0.446
0.726HisPro: 0.726 ± 0.437
0.484HisGln: 0.484 ± 0.477
1.694HisArg: 1.694 ± 1.207
1.21HisSer: 1.21 ± 0.464
0.726HisThr: 0.726 ± 0.396
1.936HisVal: 1.936 ± 1.057
0.0HisTrp: 0.0 ± 0.0
0.968HisTyr: 0.968 ± 0.418
0.0HisXaa: 0.0 ± 0.0
Ile
2.904IleAla: 2.904 ± 0.875
0.726IleCys: 0.726 ± 1.397
2.904IleAsp: 2.904 ± 1.263
3.63IleGlu: 3.63 ± 0.793
4.598IlePhe: 4.598 ± 2.072
3.146IleGly: 3.146 ± 0.586
0.726IleHis: 0.726 ± 0.618
1.936IleIle: 1.936 ± 0.835
1.694IleLys: 1.694 ± 0.63
3.388IleLeu: 3.388 ± 1.709
0.968IleMet: 0.968 ± 0.656
1.21IleAsn: 1.21 ± 1.503
2.42IlePro: 2.42 ± 0.958
0.968IleGln: 0.968 ± 1.356
2.662IleArg: 2.662 ± 1.008
3.63IleSer: 3.63 ± 2.335
3.146IleThr: 3.146 ± 1.632
6.534IleVal: 6.534 ± 1.769
0.242IleTrp: 0.242 ± 0.132
1.936IleTyr: 1.936 ± 0.556
0.0IleXaa: 0.0 ± 0.0
Lys
2.662LysAla: 2.662 ± 1.425
0.242LysCys: 0.242 ± 0.132
1.936LysAsp: 1.936 ± 1.057
4.114LysGlu: 4.114 ± 0.67
3.146LysPhe: 3.146 ± 0.386
3.63LysGly: 3.63 ± 1.222
1.452LysHis: 1.452 ± 0.539
2.42LysIle: 2.42 ± 1.322
3.146LysLys: 3.146 ± 1.223
5.324LysLeu: 5.324 ± 1.582
0.726LysMet: 0.726 ± 0.419
2.42LysAsn: 2.42 ± 1.525
4.598LysPro: 4.598 ± 1.565
1.21LysGln: 1.21 ± 0.661
4.114LysArg: 4.114 ± 0.783
2.42LysSer: 2.42 ± 0.766
1.936LysThr: 1.936 ± 1.207
3.63LysVal: 3.63 ± 0.774
0.242LysTrp: 0.242 ± 0.132
1.694LysTyr: 1.694 ± 0.627
0.0LysXaa: 0.0 ± 0.0
Leu
5.566LeuAla: 5.566 ± 1.072
1.936LeuCys: 1.936 ± 0.635
6.05LeuAsp: 6.05 ± 1.4
6.534LeuGlu: 6.534 ± 1.702
5.808LeuPhe: 5.808 ± 4.788
5.082LeuGly: 5.082 ± 2.011
1.452LeuHis: 1.452 ± 0.802
6.05LeuIle: 6.05 ± 5.523
5.808LeuLys: 5.808 ± 1.439
10.165LeuLeu: 10.165 ± 3.709
1.452LeuMet: 1.452 ± 0.587
2.662LeuAsn: 2.662 ± 1.09
5.808LeuPro: 5.808 ± 0.926
3.388LeuGln: 3.388 ± 2.457
7.26LeuArg: 7.26 ± 1.449
8.228LeuSer: 8.228 ± 2.625
7.744LeuThr: 7.744 ± 1.415
9.197LeuVal: 9.197 ± 2.037
0.242LeuTrp: 0.242 ± 0.132
3.388LeuTyr: 3.388 ± 1.222
0.0LeuXaa: 0.0 ± 0.0
Met
0.968MetAla: 0.968 ± 0.832
0.484MetCys: 0.484 ± 0.264
0.484MetAsp: 0.484 ± 0.264
0.484MetGlu: 0.484 ± 0.458
0.968MetPhe: 0.968 ± 0.596
0.726MetGly: 0.726 ± 0.437
0.484MetHis: 0.484 ± 0.264
0.484MetIle: 0.484 ± 0.264
1.452MetLys: 1.452 ± 0.793
2.178MetLeu: 2.178 ± 0.956
0.484MetMet: 0.484 ± 0.264
1.452MetAsn: 1.452 ± 0.93
0.484MetPro: 0.484 ± 0.446
0.968MetGln: 0.968 ± 0.455
0.484MetArg: 0.484 ± 0.264
1.452MetSer: 1.452 ± 1.536
1.936MetThr: 1.936 ± 1.057
2.904MetVal: 2.904 ± 1.365
0.0MetTrp: 0.0 ± 0.0
0.726MetTyr: 0.726 ± 0.396
0.0MetXaa: 0.0 ± 0.0
Asn
1.21AsnAla: 1.21 ± 0.763
0.726AsnCys: 0.726 ± 0.396
0.968AsnAsp: 0.968 ± 0.455
1.694AsnGlu: 1.694 ± 0.783
2.904AsnPhe: 2.904 ± 0.825
1.21AsnGly: 1.21 ± 0.661
0.968AsnHis: 0.968 ± 0.436
2.662AsnIle: 2.662 ± 1.034
1.694AsnLys: 1.694 ± 0.856
3.872AsnLeu: 3.872 ± 1.884
0.484AsnMet: 0.484 ± 0.458
0.726AsnAsn: 0.726 ± 0.396
1.936AsnPro: 1.936 ± 1.457
1.452AsnGln: 1.452 ± 0.831
2.662AsnArg: 2.662 ± 0.805
3.388AsnSer: 3.388 ± 1.442
3.388AsnThr: 3.388 ± 1.049
2.178AsnVal: 2.178 ± 0.985
0.242AsnTrp: 0.242 ± 0.513
2.178AsnTyr: 2.178 ± 1.312
0.0AsnXaa: 0.0 ± 0.0
Pro
3.146ProAla: 3.146 ± 1.153
0.726ProCys: 0.726 ± 0.437
3.63ProAsp: 3.63 ± 0.977
1.452ProGlu: 1.452 ± 0.541
0.968ProPhe: 0.968 ± 0.737
2.662ProGly: 2.662 ± 1.008
0.0ProHis: 0.0 ± 0.0
2.662ProIle: 2.662 ± 0.727
2.178ProLys: 2.178 ± 0.985
3.872ProLeu: 3.872 ± 0.828
0.968ProMet: 0.968 ± 0.384
0.726ProAsn: 0.726 ± 0.618
1.694ProPro: 1.694 ± 1.207
1.936ProGln: 1.936 ± 0.504
1.21ProArg: 1.21 ± 0.464
3.872ProSer: 3.872 ± 1.825
3.146ProThr: 3.146 ± 1.391
5.566ProVal: 5.566 ± 1.616
0.0ProTrp: 0.0 ± 0.0
3.388ProTyr: 3.388 ± 0.765
0.0ProXaa: 0.0 ± 0.0
Gln
1.21GlnAla: 1.21 ± 0.763
0.968GlnCys: 0.968 ± 0.418
0.484GlnAsp: 0.484 ± 0.264
1.694GlnGlu: 1.694 ± 1.943
1.452GlnPhe: 1.452 ± 0.802
1.21GlnGly: 1.21 ± 0.464
1.21GlnHis: 1.21 ± 0.473
2.42GlnIle: 2.42 ± 1.592
2.178GlnLys: 2.178 ± 0.509
4.114GlnLeu: 4.114 ± 0.881
0.484GlnMet: 0.484 ± 0.264
1.694GlnAsn: 1.694 ± 1.181
0.968GlnPro: 0.968 ± 0.613
1.452GlnGln: 1.452 ± 1.161
1.452GlnArg: 1.452 ± 0.539
0.968GlnSer: 0.968 ± 0.455
1.694GlnThr: 1.694 ± 0.944
2.178GlnVal: 2.178 ± 0.649
0.0GlnTrp: 0.0 ± 0.0
1.936GlnTyr: 1.936 ± 0.648
0.0GlnXaa: 0.0 ± 0.0
Arg
3.872ArgAla: 3.872 ± 2.354
0.968ArgCys: 0.968 ± 0.436
3.872ArgAsp: 3.872 ± 1.343
1.694ArgGlu: 1.694 ± 0.696
3.872ArgPhe: 3.872 ± 1.149
2.178ArgGly: 2.178 ± 0.842
1.21ArgHis: 1.21 ± 0.464
3.63ArgIle: 3.63 ± 0.745
1.936ArgLys: 1.936 ± 0.777
6.292ArgLeu: 6.292 ± 1.902
1.21ArgMet: 1.21 ± 0.464
3.388ArgAsn: 3.388 ± 0.752
1.694ArgPro: 1.694 ± 0.562
2.42ArgGln: 2.42 ± 0.97
2.904ArgArg: 2.904 ± 0.597
4.84ArgSer: 4.84 ± 1.309
3.388ArgThr: 3.388 ± 1.26
5.082ArgVal: 5.082 ± 1.114
0.484ArgTrp: 0.484 ± 0.264
3.146ArgTyr: 3.146 ± 1.044
0.0ArgXaa: 0.0 ± 0.0
Ser
4.84SerAla: 4.84 ± 0.962
1.452SerCys: 1.452 ± 0.823
4.598SerAsp: 4.598 ± 1.213
5.566SerGlu: 5.566 ± 1.264
4.356SerPhe: 4.356 ± 2.516
4.598SerGly: 4.598 ± 1.299
0.726SerHis: 0.726 ± 0.437
2.178SerIle: 2.178 ± 1.866
2.904SerLys: 2.904 ± 2.184
7.986SerLeu: 7.986 ± 3.866
1.936SerMet: 1.936 ± 1.489
2.662SerAsn: 2.662 ± 0.516
3.146SerPro: 3.146 ± 1.112
2.178SerGln: 2.178 ± 1.056
3.63SerArg: 3.63 ± 1.594
5.082SerSer: 5.082 ± 2.004
6.292SerThr: 6.292 ± 0.671
6.05SerVal: 6.05 ± 1.721
0.484SerTrp: 0.484 ± 0.264
3.388SerTyr: 3.388 ± 0.703
0.0SerXaa: 0.0 ± 0.0
Thr
4.114ThrAla: 4.114 ± 1.102
1.694ThrCys: 1.694 ± 0.925
4.598ThrAsp: 4.598 ± 1.073
3.388ThrGlu: 3.388 ± 1.221
2.42ThrPhe: 2.42 ± 0.825
4.356ThrGly: 4.356 ± 1.348
1.21ThrHis: 1.21 ± 0.885
2.904ThrIle: 2.904 ± 0.993
3.388ThrLys: 3.388 ± 1.095
5.808ThrLeu: 5.808 ± 1.355
1.21ThrMet: 1.21 ± 0.661
1.694ThrAsn: 1.694 ± 0.633
1.694ThrPro: 1.694 ± 0.627
1.694ThrGln: 1.694 ± 0.818
4.356ThrArg: 4.356 ± 0.75
4.84ThrSer: 4.84 ± 1.652
4.598ThrThr: 4.598 ± 1.965
5.808ThrVal: 5.808 ± 2.488
0.484ThrTrp: 0.484 ± 0.264
2.42ThrTyr: 2.42 ± 0.634
0.0ThrXaa: 0.0 ± 0.0
Val
5.808ValAla: 5.808 ± 1.289
2.904ValCys: 2.904 ± 0.567
4.114ValAsp: 4.114 ± 1.351
4.114ValGlu: 4.114 ± 1.06
5.324ValPhe: 5.324 ± 3.776
4.114ValGly: 4.114 ± 1.204
2.904ValHis: 2.904 ± 1.586
4.84ValIle: 4.84 ± 1.17
5.566ValLys: 5.566 ± 1.55
12.343ValLeu: 12.343 ± 2.005
1.694ValMet: 1.694 ± 0.928
2.662ValAsn: 2.662 ± 0.917
4.84ValPro: 4.84 ± 1.762
1.21ValGln: 1.21 ± 0.508
4.356ValArg: 4.356 ± 2.379
7.744ValSer: 7.744 ± 1.973
5.808ValThr: 5.808 ± 1.324
10.649ValVal: 10.649 ± 2.742
0.726ValTrp: 0.726 ± 0.871
4.598ValTyr: 4.598 ± 2.332
0.0ValXaa: 0.0 ± 0.0
Trp
0.242TrpAla: 0.242 ± 0.132
0.242TrpCys: 0.242 ± 0.132
0.242TrpAsp: 0.242 ± 0.132
0.726TrpGlu: 0.726 ± 0.396
0.726TrpPhe: 0.726 ± 0.411
0.242TrpGly: 0.242 ± 0.132
0.0TrpHis: 0.0 ± 0.0
0.726TrpIle: 0.726 ± 0.618
0.242TrpLys: 0.242 ± 0.132
0.484TrpLeu: 0.484 ± 0.446
0.242TrpMet: 0.242 ± 0.132
0.0TrpAsn: 0.0 ± 0.0
0.242TrpPro: 0.242 ± 0.513
0.0TrpGln: 0.0 ± 0.0
0.484TrpArg: 0.484 ± 0.264
0.726TrpSer: 0.726 ± 0.437
0.242TrpThr: 0.242 ± 0.132
0.242TrpVal: 0.242 ± 0.132
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.42TyrAla: 2.42 ± 0.634
0.726TyrCys: 0.726 ± 0.396
2.662TyrAsp: 2.662 ± 1.454
0.968TyrGlu: 0.968 ± 0.596
3.872TyrPhe: 3.872 ± 2.136
1.936TyrGly: 1.936 ± 0.534
1.452TyrHis: 1.452 ± 0.657
2.178TyrIle: 2.178 ± 0.902
1.936TyrLys: 1.936 ± 0.658
3.388TyrLeu: 3.388 ± 1.487
1.452TyrMet: 1.452 ± 0.711
2.662TyrAsn: 2.662 ± 1.503
1.936TyrPro: 1.936 ± 1.34
2.662TyrGln: 2.662 ± 0.516
3.388TyrArg: 3.388 ± 1.216
4.356TyrSer: 4.356 ± 1.129
2.662TyrThr: 2.662 ± 0.516
4.114TyrVal: 4.114 ± 1.245
0.242TyrTrp: 0.242 ± 0.132
1.694TyrTyr: 1.694 ± 0.502
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4133 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski