Amino acid dipepetide frequency for Hubei diptera virus 10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.438AlaAla: 2.438 ± 0.8
1.463AlaCys: 1.463 ± 0.472
2.926AlaAsp: 2.926 ± 1.208
2.682AlaGlu: 2.682 ± 1.247
1.951AlaPhe: 1.951 ± 0.707
2.438AlaGly: 2.438 ± 0.8
0.732AlaHis: 0.732 ± 0.367
2.926AlaIle: 2.926 ± 0.952
3.901AlaLys: 3.901 ± 1.7
5.852AlaLeu: 5.852 ± 1.622
0.975AlaMet: 0.975 ± 0.454
2.438AlaAsn: 2.438 ± 0.78
1.463AlaPro: 1.463 ± 0.388
2.682AlaGln: 2.682 ± 1.247
1.951AlaArg: 1.951 ± 0.617
3.658AlaSer: 3.658 ± 0.611
2.195AlaThr: 2.195 ± 0.945
2.682AlaVal: 2.682 ± 1.225
0.732AlaTrp: 0.732 ± 0.319
1.219AlaTyr: 1.219 ± 0.397
0.0AlaXaa: 0.0 ± 0.0
Cys
0.732CysAla: 0.732 ± 0.301
0.0CysCys: 0.0 ± 0.0
0.488CysAsp: 0.488 ± 0.423
0.732CysGlu: 0.732 ± 0.366
0.488CysPhe: 0.488 ± 0.309
0.975CysGly: 0.975 ± 0.411
0.244CysHis: 0.244 ± 0.154
0.975CysIle: 0.975 ± 0.614
0.975CysLys: 0.975 ± 0.414
1.951CysLeu: 1.951 ± 0.598
0.488CysMet: 0.488 ± 0.244
1.219CysAsn: 1.219 ± 0.527
1.951CysPro: 1.951 ± 0.857
0.244CysGln: 0.244 ± 0.303
0.244CysArg: 0.244 ± 0.303
1.463CysSer: 1.463 ± 0.582
0.732CysThr: 0.732 ± 0.465
1.219CysVal: 1.219 ± 0.768
0.488CysTrp: 0.488 ± 0.307
1.219CysTyr: 1.219 ± 0.346
0.0CysXaa: 0.0 ± 0.0
Asp
3.17AspAla: 3.17 ± 0.992
0.244AspCys: 0.244 ± 0.275
3.901AspAsp: 3.901 ± 2.406
3.901AspGlu: 3.901 ± 0.779
2.195AspPhe: 2.195 ± 0.682
2.682AspGly: 2.682 ± 1.311
1.463AspHis: 1.463 ± 0.895
5.121AspIle: 5.121 ± 0.658
3.414AspLys: 3.414 ± 1.042
7.803AspLeu: 7.803 ± 0.782
1.951AspMet: 1.951 ± 0.712
2.195AspAsn: 2.195 ± 0.668
2.195AspPro: 2.195 ± 0.644
3.414AspGln: 3.414 ± 1.408
1.951AspArg: 1.951 ± 0.672
2.926AspSer: 2.926 ± 1.162
3.414AspThr: 3.414 ± 0.931
1.707AspVal: 1.707 ± 0.631
0.975AspTrp: 0.975 ± 0.614
2.195AspTyr: 2.195 ± 0.854
0.0AspXaa: 0.0 ± 0.0
Glu
3.17GluAla: 3.17 ± 1.763
1.219GluCys: 1.219 ± 0.516
4.145GluAsp: 4.145 ± 1.575
5.852GluGlu: 5.852 ± 2.865
2.438GluPhe: 2.438 ± 0.983
4.145GluGly: 4.145 ± 0.929
1.707GluHis: 1.707 ± 0.944
4.877GluIle: 4.877 ± 0.912
4.389GluLys: 4.389 ± 1.877
5.852GluLeu: 5.852 ± 0.704
0.488GluMet: 0.488 ± 0.34
2.926GluAsn: 2.926 ± 0.62
2.195GluPro: 2.195 ± 0.692
1.463GluGln: 1.463 ± 0.299
2.438GluArg: 2.438 ± 0.563
4.633GluSer: 4.633 ± 0.668
2.682GluThr: 2.682 ± 0.866
2.438GluVal: 2.438 ± 0.675
0.732GluTrp: 0.732 ± 0.537
3.901GluTyr: 3.901 ± 0.429
0.0GluXaa: 0.0 ± 0.0
Phe
1.463PheAla: 1.463 ± 0.603
0.732PheCys: 0.732 ± 0.646
1.219PheAsp: 1.219 ± 0.938
1.951PheGlu: 1.951 ± 0.623
2.195PhePhe: 2.195 ± 0.692
1.707PheGly: 1.707 ± 0.528
2.926PheHis: 2.926 ± 0.772
2.926PheIle: 2.926 ± 0.846
3.17PheLys: 3.17 ± 0.686
4.633PheLeu: 4.633 ± 1.939
1.219PheMet: 1.219 ± 0.733
3.17PheAsn: 3.17 ± 1.551
2.195PhePro: 2.195 ± 0.473
1.463PheGln: 1.463 ± 0.687
3.414PheArg: 3.414 ± 0.772
5.121PheSer: 5.121 ± 1.503
1.463PheThr: 1.463 ± 0.696
2.438PheVal: 2.438 ± 0.797
0.488PheTrp: 0.488 ± 0.34
0.975PheTyr: 0.975 ± 0.508
0.0PheXaa: 0.0 ± 0.0
Gly
2.195GlyAla: 2.195 ± 0.696
0.488GlyCys: 0.488 ± 0.307
3.658GlyAsp: 3.658 ± 1.148
2.195GlyGlu: 2.195 ± 0.672
3.17GlyPhe: 3.17 ± 1.232
2.682GlyGly: 2.682 ± 0.629
0.244GlyHis: 0.244 ± 0.154
3.901GlyIle: 3.901 ± 0.429
3.658GlyLys: 3.658 ± 1.263
6.584GlyLeu: 6.584 ± 0.967
0.732GlyMet: 0.732 ± 0.313
1.219GlyAsn: 1.219 ± 0.543
2.195GlyPro: 2.195 ± 0.771
1.463GlyGln: 1.463 ± 0.491
1.707GlyArg: 1.707 ± 0.704
5.365GlySer: 5.365 ± 0.847
2.682GlyThr: 2.682 ± 0.366
1.463GlyVal: 1.463 ± 0.845
0.488GlyTrp: 0.488 ± 0.307
2.438GlyTyr: 2.438 ± 1.048
0.0GlyXaa: 0.0 ± 0.0
His
0.488HisAla: 0.488 ± 0.34
0.244HisCys: 0.244 ± 0.386
1.219HisAsp: 1.219 ± 0.768
1.219HisGlu: 1.219 ± 0.363
1.463HisPhe: 1.463 ± 0.683
0.975HisGly: 0.975 ± 0.59
0.975HisHis: 0.975 ± 0.508
2.438HisIle: 2.438 ± 0.794
1.219HisLys: 1.219 ± 0.826
3.414HisLeu: 3.414 ± 0.97
0.488HisMet: 0.488 ± 0.254
1.463HisAsn: 1.463 ± 1.05
1.951HisPro: 1.951 ± 0.385
0.732HisGln: 0.732 ± 0.291
2.438HisArg: 2.438 ± 0.586
2.438HisSer: 2.438 ± 0.58
0.732HisThr: 0.732 ± 0.455
1.463HisVal: 1.463 ± 0.386
1.219HisTrp: 1.219 ± 0.347
1.219HisTyr: 1.219 ± 0.559
0.0HisXaa: 0.0 ± 0.0
Ile
3.17IleAla: 3.17 ± 1.004
0.975IleCys: 0.975 ± 0.39
5.365IleAsp: 5.365 ± 0.998
5.365IleGlu: 5.365 ± 1.271
2.926IlePhe: 2.926 ± 0.735
4.633IleGly: 4.633 ± 1.417
1.951IleHis: 1.951 ± 0.647
3.901IleIle: 3.901 ± 0.51
6.096IleLys: 6.096 ± 0.794
7.803IleLeu: 7.803 ± 0.608
1.463IleMet: 1.463 ± 1.367
3.658IleAsn: 3.658 ± 1.005
7.071IlePro: 7.071 ± 0.841
1.951IleGln: 1.951 ± 1.306
4.145IleArg: 4.145 ± 1.265
5.608IleSer: 5.608 ± 1.096
3.901IleThr: 3.901 ± 1.385
3.17IleVal: 3.17 ± 1.659
0.975IleTrp: 0.975 ± 0.411
3.17IleTyr: 3.17 ± 0.96
0.0IleXaa: 0.0 ± 0.0
Lys
2.682LysAla: 2.682 ± 0.767
1.219LysCys: 1.219 ± 0.524
3.658LysAsp: 3.658 ± 1.084
3.901LysGlu: 3.901 ± 1.597
2.438LysPhe: 2.438 ± 1.58
2.926LysGly: 2.926 ± 1.633
3.414LysHis: 3.414 ± 1.29
5.852LysIle: 5.852 ± 1.582
5.852LysLys: 5.852 ± 1.433
5.608LysLeu: 5.608 ± 0.792
0.732LysMet: 0.732 ± 0.402
2.926LysAsn: 2.926 ± 0.821
2.195LysPro: 2.195 ± 0.592
0.975LysGln: 0.975 ± 0.584
3.901LysArg: 3.901 ± 0.958
4.633LysSer: 4.633 ± 1.265
6.096LysThr: 6.096 ± 2.298
3.414LysVal: 3.414 ± 0.459
1.463LysTrp: 1.463 ± 0.48
1.951LysTyr: 1.951 ± 0.675
0.0LysXaa: 0.0 ± 0.0
Leu
5.121LeuAla: 5.121 ± 0.921
1.219LeuCys: 1.219 ± 0.502
5.121LeuAsp: 5.121 ± 1.058
6.096LeuGlu: 6.096 ± 0.964
4.389LeuPhe: 4.389 ± 1.15
4.633LeuGly: 4.633 ± 1.254
1.707LeuHis: 1.707 ± 0.675
9.266LeuIle: 9.266 ± 2.732
5.608LeuLys: 5.608 ± 1.357
8.778LeuLeu: 8.778 ± 0.911
1.951LeuMet: 1.951 ± 0.65
7.315LeuAsn: 7.315 ± 1.988
2.682LeuPro: 2.682 ± 1.23
2.438LeuGln: 2.438 ± 0.477
8.535LeuArg: 8.535 ± 1.22
7.071LeuSer: 7.071 ± 1.443
8.535LeuThr: 8.535 ± 1.803
6.096LeuVal: 6.096 ± 1.201
1.219LeuTrp: 1.219 ± 0.516
3.658LeuTyr: 3.658 ± 0.696
0.0LeuXaa: 0.0 ± 0.0
Met
1.219MetAla: 1.219 ± 0.505
0.488MetCys: 0.488 ± 0.244
1.463MetAsp: 1.463 ± 0.683
2.682MetGlu: 2.682 ± 0.816
1.707MetPhe: 1.707 ± 0.988
0.975MetGly: 0.975 ± 0.344
0.488MetHis: 0.488 ± 0.482
1.707MetIle: 1.707 ± 0.321
1.219MetLys: 1.219 ± 0.98
1.219MetLeu: 1.219 ± 0.397
0.488MetMet: 0.488 ± 0.344
0.975MetAsn: 0.975 ± 0.64
0.732MetPro: 0.732 ± 0.54
1.219MetGln: 1.219 ± 0.634
0.975MetArg: 0.975 ± 0.447
0.975MetSer: 0.975 ± 0.344
0.732MetThr: 0.732 ± 0.291
0.975MetVal: 0.975 ± 0.614
0.244MetTrp: 0.244 ± 0.303
1.463MetTyr: 1.463 ± 0.683
0.0MetXaa: 0.0 ± 0.0
Asn
2.926AsnAla: 2.926 ± 0.98
1.219AsnCys: 1.219 ± 0.513
2.682AsnAsp: 2.682 ± 1.099
3.17AsnGlu: 3.17 ± 1.736
2.195AsnPhe: 2.195 ± 0.583
1.707AsnGly: 1.707 ± 0.321
1.219AsnHis: 1.219 ± 0.413
3.17AsnIle: 3.17 ± 0.845
4.389AsnLys: 4.389 ± 1.024
5.852AsnLeu: 5.852 ± 1.865
1.707AsnMet: 1.707 ± 0.998
1.951AsnAsn: 1.951 ± 0.748
3.901AsnPro: 3.901 ± 0.551
0.975AsnGln: 0.975 ± 0.411
2.438AsnArg: 2.438 ± 1.455
3.658AsnSer: 3.658 ± 1.181
1.707AsnThr: 1.707 ± 0.541
2.682AsnVal: 2.682 ± 0.673
0.244AsnTrp: 0.244 ± 0.154
2.195AsnTyr: 2.195 ± 0.69
0.0AsnXaa: 0.0 ± 0.0
Pro
2.438ProAla: 2.438 ± 0.511
0.244ProCys: 0.244 ± 0.154
3.414ProAsp: 3.414 ± 0.408
3.17ProGlu: 3.17 ± 0.703
1.463ProPhe: 1.463 ± 0.386
2.926ProGly: 2.926 ± 1.416
1.707ProHis: 1.707 ± 0.56
4.145ProIle: 4.145 ± 0.751
2.195ProLys: 2.195 ± 1.272
3.658ProLeu: 3.658 ± 1.035
0.975ProMet: 0.975 ± 0.323
2.682ProAsn: 2.682 ± 0.575
1.951ProPro: 1.951 ± 0.5
0.975ProGln: 0.975 ± 0.343
1.707ProArg: 1.707 ± 0.563
3.901ProSer: 3.901 ± 1.459
1.951ProThr: 1.951 ± 0.672
2.926ProVal: 2.926 ± 0.99
0.488ProTrp: 0.488 ± 0.307
1.463ProTyr: 1.463 ± 0.985
0.0ProXaa: 0.0 ± 0.0
Gln
1.707GlnAla: 1.707 ± 0.918
0.244GlnCys: 0.244 ± 0.154
0.975GlnAsp: 0.975 ± 0.508
1.951GlnGlu: 1.951 ± 0.594
2.195GlnPhe: 2.195 ± 0.519
1.463GlnGly: 1.463 ± 0.468
0.488GlnHis: 0.488 ± 0.423
1.707GlnIle: 1.707 ± 0.484
1.951GlnLys: 1.951 ± 0.633
2.926GlnLeu: 2.926 ± 1.261
0.244GlnMet: 0.244 ± 0.386
1.219GlnAsn: 1.219 ± 0.768
0.732GlnPro: 0.732 ± 0.301
0.732GlnGln: 0.732 ± 0.335
0.975GlnArg: 0.975 ± 0.323
2.926GlnSer: 2.926 ± 0.921
2.195GlnThr: 2.195 ± 0.964
2.438GlnVal: 2.438 ± 0.686
0.488GlnTrp: 0.488 ± 0.47
1.463GlnTyr: 1.463 ± 0.651
0.0GlnXaa: 0.0 ± 0.0
Arg
3.658ArgAla: 3.658 ± 1.044
1.463ArgCys: 1.463 ± 0.299
2.195ArgAsp: 2.195 ± 0.698
2.682ArgGlu: 2.682 ± 1.359
3.901ArgPhe: 3.901 ± 0.782
1.951ArgGly: 1.951 ± 0.707
0.732ArgHis: 0.732 ± 0.461
2.438ArgIle: 2.438 ± 0.576
2.926ArgLys: 2.926 ± 0.98
4.389ArgLeu: 4.389 ± 0.73
1.219ArgMet: 1.219 ± 0.527
3.658ArgAsn: 3.658 ± 0.704
2.195ArgPro: 2.195 ± 0.746
1.707ArgGln: 1.707 ± 0.592
3.414ArgArg: 3.414 ± 0.872
5.365ArgSer: 5.365 ± 1.351
3.17ArgThr: 3.17 ± 0.718
2.682ArgVal: 2.682 ± 0.646
0.975ArgTrp: 0.975 ± 0.411
1.463ArgTyr: 1.463 ± 0.683
0.0ArgXaa: 0.0 ± 0.0
Ser
3.901SerAla: 3.901 ± 0.842
1.951SerCys: 1.951 ± 0.945
5.852SerAsp: 5.852 ± 0.874
2.926SerGlu: 2.926 ± 0.748
2.926SerPhe: 2.926 ± 0.753
4.389SerGly: 4.389 ± 0.534
1.707SerHis: 1.707 ± 0.355
5.852SerIle: 5.852 ± 0.683
3.658SerLys: 3.658 ± 1.303
9.51SerLeu: 9.51 ± 2.881
2.682SerMet: 2.682 ± 0.66
4.389SerAsn: 4.389 ± 0.463
4.145SerPro: 4.145 ± 1.063
2.682SerGln: 2.682 ± 0.762
4.633SerArg: 4.633 ± 1.121
6.828SerSer: 6.828 ± 1.994
6.34SerThr: 6.34 ± 1.74
3.658SerVal: 3.658 ± 1.322
2.438SerTrp: 2.438 ± 0.561
3.901SerTyr: 3.901 ± 0.589
0.0SerXaa: 0.0 ± 0.0
Thr
2.438ThrAla: 2.438 ± 0.68
1.219ThrCys: 1.219 ± 0.556
2.195ThrAsp: 2.195 ± 0.753
4.145ThrGlu: 4.145 ± 1.933
2.682ThrPhe: 2.682 ± 0.938
1.951ThrGly: 1.951 ± 0.779
1.951ThrHis: 1.951 ± 0.413
6.096ThrIle: 6.096 ± 1.838
3.901ThrLys: 3.901 ± 0.559
5.608ThrLeu: 5.608 ± 1.739
1.463ThrMet: 1.463 ± 0.373
2.682ThrAsn: 2.682 ± 0.625
1.951ThrPro: 1.951 ± 0.682
1.707ThrGln: 1.707 ± 0.717
3.17ThrArg: 3.17 ± 0.8
3.901ThrSer: 3.901 ± 0.917
7.071ThrThr: 7.071 ± 1.522
4.877ThrVal: 4.877 ± 1.918
1.463ThrTrp: 1.463 ± 0.838
1.951ThrTyr: 1.951 ± 0.413
0.0ThrXaa: 0.0 ± 0.0
Val
3.17ValAla: 3.17 ± 1.007
1.219ValCys: 1.219 ± 0.413
3.414ValAsp: 3.414 ± 0.483
3.414ValGlu: 3.414 ± 0.517
2.195ValPhe: 2.195 ± 0.546
2.195ValGly: 2.195 ± 1.434
1.219ValHis: 1.219 ± 0.403
5.121ValIle: 5.121 ± 0.807
2.195ValLys: 2.195 ± 1.146
4.633ValLeu: 4.633 ± 0.663
1.707ValMet: 1.707 ± 0.608
1.463ValAsn: 1.463 ± 0.491
0.975ValPro: 0.975 ± 0.614
0.732ValGln: 0.732 ± 0.291
2.195ValArg: 2.195 ± 1.062
5.365ValSer: 5.365 ± 1.078
4.145ValThr: 4.145 ± 1.096
3.901ValVal: 3.901 ± 0.928
0.0ValTrp: 0.0 ± 0.0
2.438ValTyr: 2.438 ± 1.237
0.0ValXaa: 0.0 ± 0.0
Trp
0.488TrpAla: 0.488 ± 0.244
0.0TrpCys: 0.0 ± 0.0
0.488TrpAsp: 0.488 ± 0.307
1.219TrpGlu: 1.219 ± 0.493
0.488TrpPhe: 0.488 ± 0.244
1.463TrpGly: 1.463 ± 0.442
0.732TrpHis: 0.732 ± 0.319
2.438TrpIle: 2.438 ± 0.768
1.219TrpLys: 1.219 ± 0.413
0.975TrpLeu: 0.975 ± 0.305
0.488TrpMet: 0.488 ± 0.254
0.975TrpAsn: 0.975 ± 0.39
0.244TrpPro: 0.244 ± 0.154
0.244TrpGln: 0.244 ± 0.275
0.488TrpArg: 0.488 ± 0.307
2.195TrpSer: 2.195 ± 1.093
0.488TrpThr: 0.488 ± 0.254
0.488TrpVal: 0.488 ± 0.55
0.0TrpTrp: 0.0 ± 0.0
0.244TrpTyr: 0.244 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.732TyrAla: 0.732 ± 0.366
0.975TyrCys: 0.975 ± 0.412
2.438TyrAsp: 2.438 ± 0.928
2.438TyrGlu: 2.438 ± 0.5
1.463TyrPhe: 1.463 ± 0.505
1.951TyrGly: 1.951 ± 0.2
2.195TyrHis: 2.195 ± 0.9
2.438TyrIle: 2.438 ± 0.743
3.901TyrLys: 3.901 ± 0.74
4.389TyrLeu: 4.389 ± 1.33
0.244TyrMet: 0.244 ± 0.154
1.463TyrAsn: 1.463 ± 0.468
1.463TyrPro: 1.463 ± 0.654
1.219TyrGln: 1.219 ± 0.422
1.463TyrArg: 1.463 ± 0.347
6.34TyrSer: 6.34 ± 1.01
2.195TyrThr: 2.195 ± 0.94
0.975TyrVal: 0.975 ± 0.59
0.244TyrTrp: 0.244 ± 0.303
1.707TyrTyr: 1.707 ± 0.539
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4102 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski