Amino acid dipepetide frequency for Chuzan virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.931AlaAla: 5.931 ± 1.43
0.824AlaCys: 0.824 ± 0.458
2.965AlaAsp: 2.965 ± 0.604
4.283AlaGlu: 4.283 ± 0.977
2.306AlaPhe: 2.306 ± 0.759
2.965AlaGly: 2.965 ± 1.231
0.988AlaHis: 0.988 ± 0.471
3.46AlaIle: 3.46 ± 0.692
2.965AlaLys: 2.965 ± 0.634
7.908AlaLeu: 7.908 ± 1.674
2.636AlaMet: 2.636 ± 0.909
2.306AlaAsn: 2.306 ± 0.621
3.954AlaPro: 3.954 ± 1.218
1.483AlaGln: 1.483 ± 0.636
3.295AlaArg: 3.295 ± 0.655
3.624AlaSer: 3.624 ± 0.779
2.636AlaThr: 2.636 ± 0.751
3.295AlaVal: 3.295 ± 0.495
0.988AlaTrp: 0.988 ± 0.231
2.306AlaTyr: 2.306 ± 0.508
0.0AlaXaa: 0.0 ± 0.0
Cys
0.824CysAla: 0.824 ± 0.272
0.659CysCys: 0.659 ± 0.38
1.318CysAsp: 1.318 ± 0.576
0.329CysGlu: 0.329 ± 0.182
0.824CysPhe: 0.824 ± 0.519
1.153CysGly: 1.153 ± 0.431
0.0CysHis: 0.0 ± 0.0
0.659CysIle: 0.659 ± 0.299
0.659CysLys: 0.659 ± 0.479
1.153CysLeu: 1.153 ± 0.337
0.165CysMet: 0.165 ± 0.19
0.165CysAsn: 0.165 ± 0.171
0.165CysPro: 0.165 ± 0.205
0.165CysGln: 0.165 ± 0.171
0.659CysArg: 0.659 ± 0.475
0.824CysSer: 0.824 ± 0.491
0.329CysThr: 0.329 ± 0.197
1.483CysVal: 1.483 ± 0.367
0.0CysTrp: 0.0 ± 0.0
0.659CysTyr: 0.659 ± 0.511
0.0CysXaa: 0.0 ± 0.0
Asp
4.942AspAla: 4.942 ± 0.778
0.329AspCys: 0.329 ± 0.182
2.965AspAsp: 2.965 ± 0.755
5.107AspGlu: 5.107 ± 0.871
3.954AspPhe: 3.954 ± 0.679
5.437AspGly: 5.437 ± 1.477
0.329AspHis: 0.329 ± 0.261
3.46AspIle: 3.46 ± 0.712
2.471AspLys: 2.471 ± 0.839
5.601AspLeu: 5.601 ± 1.081
1.647AspMet: 1.647 ± 0.762
1.812AspAsn: 1.812 ± 0.536
2.636AspPro: 2.636 ± 0.399
2.142AspGln: 2.142 ± 0.542
3.954AspArg: 3.954 ± 0.466
3.789AspSer: 3.789 ± 0.731
2.636AspThr: 2.636 ± 0.777
4.778AspVal: 4.778 ± 0.925
0.824AspTrp: 0.824 ± 0.514
1.977AspTyr: 1.977 ± 0.325
0.0AspXaa: 0.0 ± 0.0
Glu
3.789GluAla: 3.789 ± 0.826
0.494GluCys: 0.494 ± 0.33
4.119GluAsp: 4.119 ± 0.867
5.766GluGlu: 5.766 ± 1.082
3.295GluPhe: 3.295 ± 1.194
3.46GluGly: 3.46 ± 0.868
0.988GluHis: 0.988 ± 0.347
4.778GluIle: 4.778 ± 0.696
4.942GluLys: 4.942 ± 1.037
4.778GluLeu: 4.778 ± 1.147
2.471GluMet: 2.471 ± 0.332
2.471GluAsn: 2.471 ± 0.592
2.801GluPro: 2.801 ± 0.758
3.13GluGln: 3.13 ± 0.678
5.107GluArg: 5.107 ± 0.77
4.778GluSer: 4.778 ± 1.178
3.13GluThr: 3.13 ± 0.746
4.778GluVal: 4.778 ± 1.148
1.977GluTrp: 1.977 ± 0.656
2.636GluTyr: 2.636 ± 0.712
0.0GluXaa: 0.0 ± 0.0
Phe
2.142PheAla: 2.142 ± 0.698
0.329PheCys: 0.329 ± 0.277
3.295PheAsp: 3.295 ± 0.687
2.801PheGlu: 2.801 ± 0.319
2.306PhePhe: 2.306 ± 0.434
2.636PheGly: 2.636 ± 0.766
0.494PheHis: 0.494 ± 0.225
2.801PheIle: 2.801 ± 0.731
1.977PheLys: 1.977 ± 0.367
4.613PheLeu: 4.613 ± 0.562
0.824PheMet: 0.824 ± 0.297
2.142PheAsn: 2.142 ± 0.263
0.659PhePro: 0.659 ± 0.285
1.977PheGln: 1.977 ± 0.647
3.954PheArg: 3.954 ± 0.9
4.613PheSer: 4.613 ± 0.925
1.647PheThr: 1.647 ± 0.329
2.636PheVal: 2.636 ± 1.021
0.329PheTrp: 0.329 ± 0.167
1.977PheTyr: 1.977 ± 0.393
0.0PheXaa: 0.0 ± 0.0
Gly
3.789GlyAla: 3.789 ± 1.258
0.494GlyCys: 0.494 ± 0.243
4.778GlyAsp: 4.778 ± 0.843
4.448GlyGlu: 4.448 ± 1.025
3.295GlyPhe: 3.295 ± 0.517
3.46GlyGly: 3.46 ± 1.722
1.647GlyHis: 1.647 ± 0.611
4.283GlyIle: 4.283 ± 0.913
3.13GlyLys: 3.13 ± 0.8
3.46GlyLeu: 3.46 ± 0.984
2.471GlyMet: 2.471 ± 0.608
2.306GlyAsn: 2.306 ± 0.428
1.812GlyPro: 1.812 ± 0.444
1.153GlyGln: 1.153 ± 0.6
3.624GlyArg: 3.624 ± 0.933
3.13GlySer: 3.13 ± 0.513
2.801GlyThr: 2.801 ± 0.688
5.601GlyVal: 5.601 ± 1.233
0.329GlyTrp: 0.329 ± 0.257
1.318GlyTyr: 1.318 ± 0.376
0.0GlyXaa: 0.0 ± 0.0
His
1.977HisAla: 1.977 ± 0.507
0.165HisCys: 0.165 ± 0.171
1.153HisAsp: 1.153 ± 0.482
0.824HisGlu: 0.824 ± 0.244
0.494HisPhe: 0.494 ± 0.202
1.318HisGly: 1.318 ± 0.323
0.824HisHis: 0.824 ± 0.259
1.318HisIle: 1.318 ± 0.307
1.153HisLys: 1.153 ± 0.364
2.306HisLeu: 2.306 ± 0.553
0.824HisMet: 0.824 ± 0.35
0.659HisAsn: 0.659 ± 0.335
1.318HisPro: 1.318 ± 0.422
0.988HisGln: 0.988 ± 0.38
1.647HisArg: 1.647 ± 0.591
0.494HisSer: 0.494 ± 0.274
0.824HisThr: 0.824 ± 0.296
0.988HisVal: 0.988 ± 0.326
0.659HisTrp: 0.659 ± 0.411
0.329HisTyr: 0.329 ± 0.203
0.0HisXaa: 0.0 ± 0.0
Ile
5.437IleAla: 5.437 ± 1.102
0.494IleCys: 0.494 ± 0.195
3.789IleAsp: 3.789 ± 0.784
5.272IleGlu: 5.272 ± 0.646
2.306IlePhe: 2.306 ± 0.719
3.624IleGly: 3.624 ± 0.73
1.977IleHis: 1.977 ± 0.686
4.778IleIle: 4.778 ± 0.734
4.283IleLys: 4.283 ± 0.828
5.107IleLeu: 5.107 ± 1.007
1.977IleMet: 1.977 ± 0.452
2.801IleAsn: 2.801 ± 0.954
3.624IlePro: 3.624 ± 0.631
3.624IleGln: 3.624 ± 0.705
5.931IleArg: 5.931 ± 0.961
6.919IleSer: 6.919 ± 0.912
4.119IleThr: 4.119 ± 0.953
3.954IleVal: 3.954 ± 0.751
0.988IleTrp: 0.988 ± 0.393
3.789IleTyr: 3.789 ± 0.79
0.0IleXaa: 0.0 ± 0.0
Lys
3.295LysAla: 3.295 ± 1.173
0.659LysCys: 0.659 ± 0.364
3.954LysAsp: 3.954 ± 1.39
4.942LysGlu: 4.942 ± 1.049
1.318LysPhe: 1.318 ± 0.459
2.306LysGly: 2.306 ± 0.562
2.142LysHis: 2.142 ± 0.556
5.272LysIle: 5.272 ± 1.007
5.272LysLys: 5.272 ± 1.056
3.624LysLeu: 3.624 ± 0.867
2.471LysMet: 2.471 ± 0.491
2.142LysAsn: 2.142 ± 0.628
1.977LysPro: 1.977 ± 0.313
1.647LysGln: 1.647 ± 0.765
4.778LysArg: 4.778 ± 1.13
2.142LysSer: 2.142 ± 0.549
2.965LysThr: 2.965 ± 0.976
4.283LysVal: 4.283 ± 0.443
1.153LysTrp: 1.153 ± 0.62
2.965LysTyr: 2.965 ± 0.827
0.0LysXaa: 0.0 ± 0.0
Leu
4.942LeuAla: 4.942 ± 0.752
1.647LeuCys: 1.647 ± 0.308
4.613LeuAsp: 4.613 ± 0.976
4.942LeuGlu: 4.942 ± 0.97
4.283LeuPhe: 4.283 ± 0.996
4.119LeuGly: 4.119 ± 0.69
1.647LeuHis: 1.647 ± 0.725
6.26LeuIle: 6.26 ± 1.072
6.425LeuLys: 6.425 ± 0.712
7.084LeuLeu: 7.084 ± 0.824
3.789LeuMet: 3.789 ± 1.351
4.448LeuAsn: 4.448 ± 1.098
3.624LeuPro: 3.624 ± 0.904
3.295LeuGln: 3.295 ± 1.031
5.766LeuArg: 5.766 ± 0.919
5.601LeuSer: 5.601 ± 0.594
3.954LeuThr: 3.954 ± 0.577
3.295LeuVal: 3.295 ± 0.473
0.824LeuTrp: 0.824 ± 0.341
2.306LeuTyr: 2.306 ± 0.397
0.0LeuXaa: 0.0 ± 0.0
Met
1.483MetAla: 1.483 ± 0.637
0.494MetCys: 0.494 ± 0.211
1.318MetAsp: 1.318 ± 0.441
1.647MetGlu: 1.647 ± 0.527
1.647MetPhe: 1.647 ± 0.322
1.812MetGly: 1.812 ± 0.761
0.824MetHis: 0.824 ± 0.387
4.778MetIle: 4.778 ± 0.916
1.318MetLys: 1.318 ± 0.412
3.624MetLeu: 3.624 ± 0.913
1.812MetMet: 1.812 ± 0.693
2.306MetAsn: 2.306 ± 0.337
0.329MetPro: 0.329 ± 0.208
1.647MetGln: 1.647 ± 0.416
3.295MetArg: 3.295 ± 0.417
2.965MetSer: 2.965 ± 0.716
1.812MetThr: 1.812 ± 0.625
0.988MetVal: 0.988 ± 0.299
0.659MetTrp: 0.659 ± 0.332
1.483MetTyr: 1.483 ± 0.343
0.0MetXaa: 0.0 ± 0.0
Asn
3.295AsnAla: 3.295 ± 0.831
0.494AsnCys: 0.494 ± 0.346
1.977AsnAsp: 1.977 ± 0.519
2.636AsnGlu: 2.636 ± 0.436
1.483AsnPhe: 1.483 ± 0.498
2.306AsnGly: 2.306 ± 0.575
0.988AsnHis: 0.988 ± 0.308
4.119AsnIle: 4.119 ± 0.427
1.647AsnLys: 1.647 ± 0.636
3.13AsnLeu: 3.13 ± 0.687
2.142AsnMet: 2.142 ± 0.591
1.647AsnAsn: 1.647 ± 0.512
1.318AsnPro: 1.318 ± 0.372
1.483AsnGln: 1.483 ± 0.54
2.965AsnArg: 2.965 ± 0.965
1.977AsnSer: 1.977 ± 0.769
1.812AsnThr: 1.812 ± 0.447
3.624AsnVal: 3.624 ± 0.821
0.329AsnTrp: 0.329 ± 0.203
2.306AsnTyr: 2.306 ± 0.382
0.0AsnXaa: 0.0 ± 0.0
Pro
1.483ProAla: 1.483 ± 0.534
0.494ProCys: 0.494 ± 0.266
2.306ProAsp: 2.306 ± 0.594
3.13ProGlu: 3.13 ± 0.807
1.483ProPhe: 1.483 ± 0.589
2.142ProGly: 2.142 ± 0.701
0.824ProHis: 0.824 ± 0.293
3.295ProIle: 3.295 ± 0.559
2.471ProLys: 2.471 ± 0.891
2.965ProLeu: 2.965 ± 0.409
1.153ProMet: 1.153 ± 0.719
1.812ProAsn: 1.812 ± 0.795
1.647ProPro: 1.647 ± 0.483
1.812ProGln: 1.812 ± 0.602
2.306ProArg: 2.306 ± 0.662
1.483ProSer: 1.483 ± 0.596
2.636ProThr: 2.636 ± 0.624
1.977ProVal: 1.977 ± 0.278
0.329ProTrp: 0.329 ± 0.193
2.636ProTyr: 2.636 ± 0.725
0.0ProXaa: 0.0 ± 0.0
Gln
1.647GlnAla: 1.647 ± 0.636
0.0GlnCys: 0.0 ± 0.0
1.483GlnAsp: 1.483 ± 0.496
3.13GlnGlu: 3.13 ± 0.865
2.306GlnPhe: 2.306 ± 0.678
1.812GlnGly: 1.812 ± 0.385
0.494GlnHis: 0.494 ± 0.346
3.624GlnIle: 3.624 ± 0.652
2.142GlnLys: 2.142 ± 0.971
2.801GlnLeu: 2.801 ± 0.678
1.318GlnMet: 1.318 ± 0.535
2.306GlnAsn: 2.306 ± 0.642
1.318GlnPro: 1.318 ± 0.289
1.977GlnGln: 1.977 ± 0.629
3.295GlnArg: 3.295 ± 0.649
2.801GlnSer: 2.801 ± 0.65
2.471GlnThr: 2.471 ± 0.471
1.483GlnVal: 1.483 ± 0.396
0.494GlnTrp: 0.494 ± 0.281
1.318GlnTyr: 1.318 ± 0.324
0.0GlnXaa: 0.0 ± 0.0
Arg
4.778ArgAla: 4.778 ± 1.023
1.153ArgCys: 1.153 ± 0.32
4.119ArgAsp: 4.119 ± 0.631
4.613ArgGlu: 4.613 ± 0.771
4.119ArgPhe: 4.119 ± 0.949
4.778ArgGly: 4.778 ± 0.765
1.318ArgHis: 1.318 ± 0.533
4.283ArgIle: 4.283 ± 1.018
3.789ArgLys: 3.789 ± 0.87
5.437ArgLeu: 5.437 ± 0.843
2.306ArgMet: 2.306 ± 0.679
3.13ArgAsn: 3.13 ± 0.489
1.812ArgPro: 1.812 ± 0.535
2.471ArgGln: 2.471 ± 0.415
4.119ArgArg: 4.119 ± 0.465
3.46ArgSer: 3.46 ± 0.745
4.613ArgThr: 4.613 ± 0.807
6.096ArgVal: 6.096 ± 0.855
0.659ArgTrp: 0.659 ± 0.252
2.306ArgTyr: 2.306 ± 0.588
0.0ArgXaa: 0.0 ± 0.0
Ser
3.295SerAla: 3.295 ± 0.651
0.659SerCys: 0.659 ± 0.32
4.778SerAsp: 4.778 ± 0.944
3.789SerGlu: 3.789 ± 0.427
2.801SerPhe: 2.801 ± 0.664
4.613SerGly: 4.613 ± 0.737
0.824SerHis: 0.824 ± 0.373
4.119SerIle: 4.119 ± 0.789
3.624SerLys: 3.624 ± 0.921
5.766SerLeu: 5.766 ± 0.655
2.306SerMet: 2.306 ± 0.597
2.142SerAsn: 2.142 ± 0.725
2.142SerPro: 2.142 ± 0.37
1.977SerGln: 1.977 ± 0.419
4.613SerArg: 4.613 ± 0.776
3.954SerSer: 3.954 ± 0.748
4.613SerThr: 4.613 ± 0.929
4.778SerVal: 4.778 ± 1.087
0.824SerTrp: 0.824 ± 0.31
3.13SerTyr: 3.13 ± 0.564
0.0SerXaa: 0.0 ± 0.0
Thr
2.142ThrAla: 2.142 ± 0.697
0.494ThrCys: 0.494 ± 0.39
2.801ThrAsp: 2.801 ± 0.439
3.46ThrGlu: 3.46 ± 0.915
1.483ThrPhe: 1.483 ± 0.51
2.306ThrGly: 2.306 ± 0.699
1.812ThrHis: 1.812 ± 0.528
5.107ThrIle: 5.107 ± 0.688
3.13ThrLys: 3.13 ± 0.694
4.942ThrLeu: 4.942 ± 0.931
2.142ThrMet: 2.142 ± 0.395
2.801ThrAsn: 2.801 ± 0.564
2.471ThrPro: 2.471 ± 1.145
2.636ThrGln: 2.636 ± 0.527
2.801ThrArg: 2.801 ± 0.558
3.46ThrSer: 3.46 ± 0.771
2.471ThrThr: 2.471 ± 0.944
2.801ThrVal: 2.801 ± 0.899
0.494ThrTrp: 0.494 ± 0.297
2.471ThrTyr: 2.471 ± 0.611
0.0ThrXaa: 0.0 ± 0.0
Val
3.624ValAla: 3.624 ± 0.809
0.824ValCys: 0.824 ± 0.274
4.613ValAsp: 4.613 ± 0.844
4.778ValGlu: 4.778 ± 0.572
1.483ValPhe: 1.483 ± 0.521
3.295ValGly: 3.295 ± 0.5
1.318ValHis: 1.318 ± 0.376
4.613ValIle: 4.613 ± 0.768
4.119ValLys: 4.119 ± 1.153
5.107ValLeu: 5.107 ± 0.984
2.306ValMet: 2.306 ± 0.624
1.977ValAsn: 1.977 ± 0.681
3.295ValPro: 3.295 ± 0.654
3.13ValGln: 3.13 ± 0.636
4.942ValArg: 4.942 ± 0.474
4.778ValSer: 4.778 ± 0.835
2.965ValThr: 2.965 ± 1.278
3.13ValVal: 3.13 ± 0.461
0.494ValTrp: 0.494 ± 0.281
2.636ValTyr: 2.636 ± 0.587
0.0ValXaa: 0.0 ± 0.0
Trp
0.329TrpAla: 0.329 ± 0.189
0.494TrpCys: 0.494 ± 0.205
0.988TrpAsp: 0.988 ± 0.346
1.318TrpGlu: 1.318 ± 0.33
0.824TrpPhe: 0.824 ± 0.328
0.988TrpGly: 0.988 ± 0.357
0.329TrpHis: 0.329 ± 0.249
1.318TrpIle: 1.318 ± 0.502
1.318TrpLys: 1.318 ± 0.633
0.988TrpLeu: 0.988 ± 0.336
0.329TrpMet: 0.329 ± 0.304
0.824TrpAsn: 0.824 ± 0.377
0.0TrpPro: 0.0 ± 0.0
0.494TrpGln: 0.494 ± 0.275
0.329TrpArg: 0.329 ± 0.168
0.165TrpSer: 0.165 ± 0.157
0.494TrpThr: 0.494 ± 0.243
0.659TrpVal: 0.659 ± 0.224
0.165TrpTrp: 0.165 ± 0.157
0.329TrpTyr: 0.329 ± 0.228
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.977TyrAla: 1.977 ± 0.62
0.988TyrCys: 0.988 ± 0.38
3.46TyrAsp: 3.46 ± 0.726
2.471TyrGlu: 2.471 ± 0.535
2.142TyrPhe: 2.142 ± 0.477
2.801TyrGly: 2.801 ± 1.003
0.494TyrHis: 0.494 ± 0.234
2.471TyrIle: 2.471 ± 0.472
2.471TyrLys: 2.471 ± 0.679
2.471TyrLeu: 2.471 ± 0.481
0.988TyrMet: 0.988 ± 0.394
1.483TyrAsn: 1.483 ± 0.387
1.483TyrPro: 1.483 ± 0.579
0.988TyrGln: 0.988 ± 0.543
2.142TyrArg: 2.142 ± 0.556
3.789TyrSer: 3.789 ± 1.54
3.295TyrThr: 3.295 ± 0.648
2.801TyrVal: 2.801 ± 0.799
0.165TyrTrp: 0.165 ± 0.171
1.647TyrTyr: 1.647 ± 0.379
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6071 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski