Amino acid dipepetide frequency for Piry virus (PIRYV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.603AlaAla: 3.603 ± 2.146
1.109AlaCys: 1.109 ± 0.628
1.663AlaAsp: 1.663 ± 0.699
1.94AlaGlu: 1.94 ± 0.556
1.109AlaPhe: 1.109 ± 0.409
2.494AlaGly: 2.494 ± 0.368
1.386AlaHis: 1.386 ± 0.284
3.049AlaIle: 3.049 ± 0.88
3.603AlaLys: 3.603 ± 0.662
6.098AlaLeu: 6.098 ± 1.175
1.663AlaMet: 1.663 ± 0.555
3.326AlaAsn: 3.326 ± 0.928
2.772AlaPro: 2.772 ± 1.871
2.494AlaGln: 2.494 ± 0.597
3.326AlaArg: 3.326 ± 2.348
3.326AlaSer: 3.326 ± 1.832
3.326AlaThr: 3.326 ± 0.539
4.157AlaVal: 4.157 ± 1.381
0.554AlaTrp: 0.554 ± 0.346
1.663AlaTyr: 1.663 ± 0.813
0.0AlaXaa: 0.0 ± 0.0
Cys
1.109CysAla: 1.109 ± 0.745
0.277CysCys: 0.277 ± 0.157
0.831CysAsp: 0.831 ± 0.707
1.109CysGlu: 1.109 ± 0.793
1.109CysPhe: 1.109 ± 0.472
0.831CysGly: 0.831 ± 0.337
0.554CysHis: 0.554 ± 0.531
1.663CysIle: 1.663 ± 0.65
0.831CysLys: 0.831 ± 0.35
1.663CysLeu: 1.663 ± 0.478
0.277CysMet: 0.277 ± 0.157
0.277CysAsn: 0.277 ± 0.157
0.554CysPro: 0.554 ± 0.33
0.831CysGln: 0.831 ± 0.337
0.831CysArg: 0.831 ± 0.35
1.109CysSer: 1.109 ± 0.409
0.554CysThr: 0.554 ± 0.314
1.386CysVal: 1.386 ± 0.423
0.554CysTrp: 0.554 ± 0.314
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.217AspAla: 2.217 ± 0.399
0.554AspCys: 0.554 ± 0.531
4.157AspAsp: 4.157 ± 0.963
3.603AspGlu: 3.603 ± 1.983
2.772AspPhe: 2.772 ± 0.714
3.326AspGly: 3.326 ± 0.616
0.554AspHis: 0.554 ± 0.531
1.663AspIle: 1.663 ± 0.572
3.88AspLys: 3.88 ± 0.801
7.483AspLeu: 7.483 ± 1.069
3.049AspMet: 3.049 ± 1.169
3.603AspAsn: 3.603 ± 1.451
3.603AspPro: 3.603 ± 0.456
1.386AspGln: 1.386 ± 0.423
1.109AspArg: 1.109 ± 0.409
3.88AspSer: 3.88 ± 0.49
3.326AspThr: 3.326 ± 0.616
3.049AspVal: 3.049 ± 1.006
1.386AspTrp: 1.386 ± 0.48
3.603AspTyr: 3.603 ± 0.822
0.0AspXaa: 0.0 ± 0.0
Glu
3.603GluAla: 3.603 ± 0.796
0.277GluCys: 0.277 ± 0.392
3.603GluAsp: 3.603 ± 1.085
3.88GluGlu: 3.88 ± 1.174
3.603GluPhe: 3.603 ± 1.169
2.494GluGly: 2.494 ± 0.543
1.109GluHis: 1.109 ± 0.455
3.603GluIle: 3.603 ± 0.812
4.157GluLys: 4.157 ± 0.945
4.157GluLeu: 4.157 ± 0.941
1.94GluMet: 1.94 ± 1.011
2.217GluAsn: 2.217 ± 0.773
1.663GluPro: 1.663 ± 0.419
2.494GluGln: 2.494 ± 0.543
1.109GluArg: 1.109 ± 0.873
7.483GluSer: 7.483 ± 2.345
3.049GluThr: 3.049 ± 0.83
3.88GluVal: 3.88 ± 1.565
1.386GluTrp: 1.386 ± 1.197
4.712GluTyr: 4.712 ± 0.703
0.0GluXaa: 0.0 ± 0.0
Phe
2.217PheAla: 2.217 ± 1.349
0.831PheCys: 0.831 ± 0.707
2.494PheAsp: 2.494 ± 0.494
1.94PheGlu: 1.94 ± 0.245
2.494PhePhe: 2.494 ± 0.356
3.326PheGly: 3.326 ± 0.796
2.217PheHis: 2.217 ± 0.607
1.386PheIle: 1.386 ± 0.284
3.88PheLys: 3.88 ± 0.808
4.712PheLeu: 4.712 ± 1.272
0.831PheMet: 0.831 ± 0.471
1.386PheAsn: 1.386 ± 0.773
4.157PhePro: 4.157 ± 2.174
1.386PheGln: 1.386 ± 0.38
3.326PheArg: 3.326 ± 1.158
3.88PheSer: 3.88 ± 1.083
1.663PheThr: 1.663 ± 0.815
1.386PheVal: 1.386 ± 0.525
0.554PheTrp: 0.554 ± 0.346
0.554PheTyr: 0.554 ± 0.368
0.0PheXaa: 0.0 ± 0.0
Gly
1.663GlyAla: 1.663 ± 0.986
0.831GlyCys: 0.831 ± 0.471
3.326GlyAsp: 3.326 ± 0.538
1.94GlyGlu: 1.94 ± 1.512
2.494GlyPhe: 2.494 ± 0.938
3.049GlyGly: 3.049 ± 0.83
1.109GlyHis: 1.109 ± 0.424
3.049GlyIle: 3.049 ± 0.717
4.435GlyLys: 4.435 ± 0.786
9.146GlyLeu: 9.146 ± 1.203
1.94GlyMet: 1.94 ± 0.44
2.494GlyAsn: 2.494 ± 0.783
3.049GlyPro: 3.049 ± 0.956
3.049GlyGln: 3.049 ± 1.168
3.88GlyArg: 3.88 ± 1.02
4.712GlySer: 4.712 ± 1.79
3.049GlyThr: 3.049 ± 0.4
3.049GlyVal: 3.049 ± 0.89
1.109GlyTrp: 1.109 ± 0.409
1.94GlyTyr: 1.94 ± 0.363
0.0GlyXaa: 0.0 ± 0.0
His
0.831HisAla: 0.831 ± 0.707
0.554HisCys: 0.554 ± 0.33
0.554HisAsp: 0.554 ± 0.314
1.109HisGlu: 1.109 ± 0.314
1.94HisPhe: 1.94 ± 0.68
1.386HisGly: 1.386 ± 0.488
0.831HisHis: 0.831 ± 0.839
1.94HisIle: 1.94 ± 0.618
1.386HisLys: 1.386 ± 0.786
1.663HisLeu: 1.663 ± 0.572
0.831HisMet: 0.831 ± 0.396
0.831HisAsn: 0.831 ± 0.407
1.386HisPro: 1.386 ± 0.648
1.109HisGln: 1.109 ± 0.628
1.663HisArg: 1.663 ± 0.676
1.663HisSer: 1.663 ± 1.344
1.663HisThr: 1.663 ± 0.65
1.109HisVal: 1.109 ± 0.409
1.386HisTrp: 1.386 ± 0.52
0.277HisTyr: 0.277 ± 0.157
0.0HisXaa: 0.0 ± 0.0
Ile
2.494IleAla: 2.494 ± 0.808
1.386IleCys: 1.386 ± 0.648
5.543IleAsp: 5.543 ± 0.754
1.94IleGlu: 1.94 ± 0.363
3.049IlePhe: 3.049 ± 0.714
3.326IleGly: 3.326 ± 0.671
1.109IleHis: 1.109 ± 0.409
1.663IleIle: 1.663 ± 0.478
4.712IleLys: 4.712 ± 0.506
5.266IleLeu: 5.266 ± 0.766
0.831IleMet: 0.831 ± 0.337
3.88IleAsn: 3.88 ± 0.88
3.326IlePro: 3.326 ± 0.698
2.494IleGln: 2.494 ± 0.751
6.929IleArg: 6.929 ± 1.179
3.603IleSer: 3.603 ± 0.801
2.494IleThr: 2.494 ± 0.368
3.326IleVal: 3.326 ± 1.591
0.831IleTrp: 0.831 ± 0.394
1.94IleTyr: 1.94 ± 0.81
0.0IleXaa: 0.0 ± 0.0
Lys
1.94LysAla: 1.94 ± 1.158
0.831LysCys: 0.831 ± 0.394
4.712LysAsp: 4.712 ± 1.028
6.929LysGlu: 6.929 ± 0.947
1.386LysPhe: 1.386 ± 0.648
4.989LysGly: 4.989 ± 1.257
1.386LysHis: 1.386 ± 0.488
5.82LysIle: 5.82 ± 1.003
6.098LysLys: 6.098 ± 2.081
4.712LysLeu: 4.712 ± 1.289
0.831LysMet: 0.831 ± 0.337
2.772LysAsn: 2.772 ± 0.493
1.663LysPro: 1.663 ± 0.534
1.109LysGln: 1.109 ± 0.483
3.049LysArg: 3.049 ± 0.554
7.206LysSer: 7.206 ± 1.496
3.603LysThr: 3.603 ± 1.016
2.772LysVal: 2.772 ± 0.856
1.663LysTrp: 1.663 ± 0.65
1.386LysTyr: 1.386 ± 0.691
0.0LysXaa: 0.0 ± 0.0
Leu
4.989LeuAla: 4.989 ± 1.255
1.663LeuCys: 1.663 ± 0.336
4.989LeuAsp: 4.989 ± 1.157
7.761LeuGlu: 7.761 ± 2.19
3.603LeuPhe: 3.603 ± 0.378
5.266LeuGly: 5.266 ± 0.819
2.772LeuHis: 2.772 ± 0.801
8.038LeuIle: 8.038 ± 2.091
5.82LeuLys: 5.82 ± 1.559
6.929LeuLeu: 6.929 ± 1.766
3.049LeuMet: 3.049 ± 1.035
5.543LeuAsn: 5.543 ± 1.01
3.049LeuPro: 3.049 ± 0.777
4.157LeuGln: 4.157 ± 0.981
5.266LeuArg: 5.266 ± 1.498
9.978LeuSer: 9.978 ± 2.783
4.989LeuThr: 4.989 ± 0.918
4.435LeuVal: 4.435 ± 1.12
1.109LeuTrp: 1.109 ± 0.659
2.494LeuTyr: 2.494 ± 0.597
0.0LeuXaa: 0.0 ± 0.0
Met
1.94MetAla: 1.94 ± 0.507
0.277MetCys: 0.277 ± 0.157
1.109MetAsp: 1.109 ± 0.784
2.217MetGlu: 2.217 ± 0.787
0.554MetPhe: 0.554 ± 0.346
2.217MetGly: 2.217 ± 0.57
0.0MetHis: 0.0 ± 0.0
0.831MetIle: 0.831 ± 0.471
1.386MetLys: 1.386 ± 0.758
3.326MetLeu: 3.326 ± 0.534
1.386MetMet: 1.386 ± 0.488
0.831MetAsn: 0.831 ± 0.471
1.386MetPro: 1.386 ± 1.182
1.109MetGln: 1.109 ± 0.409
1.386MetArg: 1.386 ± 0.52
2.772MetSer: 2.772 ± 0.771
2.217MetThr: 2.217 ± 0.627
1.663MetVal: 1.663 ± 0.526
0.277MetTrp: 0.277 ± 0.157
0.831MetTyr: 0.831 ± 0.54
0.0MetXaa: 0.0 ± 0.0
Asn
3.049AsnAla: 3.049 ± 0.9
0.554AsnCys: 0.554 ± 0.784
1.94AsnAsp: 1.94 ± 0.556
1.663AsnGlu: 1.663 ± 0.42
1.94AsnPhe: 1.94 ± 0.44
3.603AsnGly: 3.603 ± 1.182
1.386AsnHis: 1.386 ± 0.52
3.049AsnIle: 3.049 ± 0.625
3.049AsnLys: 3.049 ± 0.9
4.712AsnLeu: 4.712 ± 1.212
0.277AsnMet: 0.277 ± 0.392
2.217AsnAsn: 2.217 ± 0.727
2.772AsnPro: 2.772 ± 1.325
2.217AsnGln: 2.217 ± 0.719
0.831AsnArg: 0.831 ± 0.471
3.326AsnSer: 3.326 ± 0.919
3.049AsnThr: 3.049 ± 1.035
1.663AsnVal: 1.663 ± 0.826
1.94AsnTrp: 1.94 ± 0.363
1.94AsnTyr: 1.94 ± 0.51
0.0AsnXaa: 0.0 ± 0.0
Pro
3.049ProAla: 3.049 ± 1.015
0.277ProCys: 0.277 ± 0.408
3.603ProAsp: 3.603 ± 0.651
3.049ProGlu: 3.049 ± 2.645
1.663ProPhe: 1.663 ± 0.721
2.217ProGly: 2.217 ± 0.992
0.831ProHis: 0.831 ± 0.337
2.217ProIle: 2.217 ± 0.996
2.772ProLys: 2.772 ± 0.738
4.712ProLeu: 4.712 ± 1.304
1.663ProMet: 1.663 ± 0.794
1.386ProAsn: 1.386 ± 1.031
2.217ProPro: 2.217 ± 1.435
1.386ProGln: 1.386 ± 1.065
2.217ProArg: 2.217 ± 0.518
4.989ProSer: 4.989 ± 1.248
4.712ProThr: 4.712 ± 0.956
3.049ProVal: 3.049 ± 1.316
0.554ProTrp: 0.554 ± 0.393
1.109ProTyr: 1.109 ± 1.352
0.0ProXaa: 0.0 ± 0.0
Gln
1.386GlnAla: 1.386 ± 0.488
1.109GlnCys: 1.109 ± 0.417
1.663GlnAsp: 1.663 ± 0.715
2.217GlnGlu: 2.217 ± 0.545
1.663GlnPhe: 1.663 ± 0.478
2.772GlnGly: 2.772 ± 0.617
0.831GlnHis: 0.831 ± 0.471
1.663GlnIle: 1.663 ± 0.478
2.772GlnLys: 2.772 ± 0.761
2.494GlnLeu: 2.494 ± 0.599
0.831GlnMet: 0.831 ± 0.337
1.94GlnAsn: 1.94 ± 0.507
0.554GlnPro: 0.554 ± 0.393
0.277GlnGln: 0.277 ± 0.157
3.326GlnArg: 3.326 ± 0.671
2.494GlnSer: 2.494 ± 1.01
2.772GlnThr: 2.772 ± 0.595
2.494GlnVal: 2.494 ± 0.569
0.831GlnTrp: 0.831 ± 0.481
2.772GlnTyr: 2.772 ± 0.958
0.0GlnXaa: 0.0 ± 0.0
Arg
3.88ArgAla: 3.88 ± 0.993
1.386ArgCys: 1.386 ± 0.586
2.772ArgAsp: 2.772 ± 1.053
4.157ArgGlu: 4.157 ± 0.22
3.326ArgPhe: 3.326 ± 0.786
4.435ArgGly: 4.435 ± 0.531
1.109ArgHis: 1.109 ± 0.409
2.217ArgIle: 2.217 ± 0.262
1.94ArgLys: 1.94 ± 0.592
3.603ArgLeu: 3.603 ± 0.654
1.663ArgMet: 1.663 ± 0.487
3.049ArgAsn: 3.049 ± 1.124
2.772ArgPro: 2.772 ± 0.998
2.217ArgGln: 2.217 ± 0.399
1.94ArgArg: 1.94 ± 0.51
3.603ArgSer: 3.603 ± 0.917
3.88ArgThr: 3.88 ± 0.756
4.157ArgVal: 4.157 ± 1.256
1.386ArgTrp: 1.386 ± 0.52
0.831ArgTyr: 0.831 ± 0.407
0.0ArgXaa: 0.0 ± 0.0
Ser
6.375SerAla: 6.375 ± 2.476
0.554SerCys: 0.554 ± 0.59
6.098SerAsp: 6.098 ± 0.896
5.266SerGlu: 5.266 ± 1.597
3.326SerPhe: 3.326 ± 0.644
3.049SerGly: 3.049 ± 1.24
2.217SerHis: 2.217 ± 0.719
6.098SerIle: 6.098 ± 1.614
4.712SerLys: 4.712 ± 0.668
6.929SerLeu: 6.929 ± 1.323
1.663SerMet: 1.663 ± 0.526
2.217SerAsn: 2.217 ± 0.262
6.098SerPro: 6.098 ± 1.66
2.217SerGln: 2.217 ± 0.912
5.266SerArg: 5.266 ± 1.038
7.206SerSer: 7.206 ± 2.24
4.989SerThr: 4.989 ± 2.275
4.989SerVal: 4.989 ± 0.951
1.109SerTrp: 1.109 ± 0.417
3.88SerTyr: 3.88 ± 1.065
0.0SerXaa: 0.0 ± 0.0
Thr
2.772ThrAla: 2.772 ± 0.352
1.109ThrCys: 1.109 ± 0.409
1.94ThrAsp: 1.94 ± 1.244
2.217ThrGlu: 2.217 ± 0.702
1.94ThrPhe: 1.94 ± 0.592
3.88ThrGly: 3.88 ± 1.155
1.109ThrHis: 1.109 ± 0.424
4.712ThrIle: 4.712 ± 1.322
2.772ThrLys: 2.772 ± 0.917
7.206ThrLeu: 7.206 ± 1.662
1.386ThrMet: 1.386 ± 0.525
2.217ThrAsn: 2.217 ± 0.818
3.049ThrPro: 3.049 ± 1.564
2.772ThrGln: 2.772 ± 0.85
2.494ThrArg: 2.494 ± 1.059
5.266ThrSer: 5.266 ± 1.046
3.326ThrThr: 3.326 ± 0.728
5.266ThrVal: 5.266 ± 1.741
1.94ThrTrp: 1.94 ± 0.903
0.554ThrTyr: 0.554 ± 0.314
0.0ThrXaa: 0.0 ± 0.0
Val
1.94ValAla: 1.94 ± 0.872
2.494ValCys: 2.494 ± 0.569
4.435ValAsp: 4.435 ± 0.692
4.435ValGlu: 4.435 ± 0.922
3.326ValPhe: 3.326 ± 0.42
3.326ValGly: 3.326 ± 1.096
2.217ValHis: 2.217 ± 0.57
3.88ValIle: 3.88 ± 1.464
2.772ValLys: 2.772 ± 1.632
4.712ValLeu: 4.712 ± 1.695
1.386ValMet: 1.386 ± 0.729
2.494ValAsn: 2.494 ± 1.0
2.217ValPro: 2.217 ± 0.835
2.494ValGln: 2.494 ± 0.658
3.603ValArg: 3.603 ± 0.74
3.88ValSer: 3.88 ± 0.808
3.603ValThr: 3.603 ± 1.139
3.049ValVal: 3.049 ± 1.152
0.277ValTrp: 0.277 ± 0.392
1.386ValTyr: 1.386 ± 0.525
0.0ValXaa: 0.0 ± 0.0
Trp
0.554TrpAla: 0.554 ± 0.393
0.0TrpCys: 0.0 ± 0.0
1.386TrpAsp: 1.386 ± 0.284
2.217TrpGlu: 2.217 ± 0.748
1.109TrpPhe: 1.109 ± 0.659
1.663TrpGly: 1.663 ± 0.653
0.277TrpHis: 0.277 ± 0.157
1.663TrpIle: 1.663 ± 0.986
0.831TrpLys: 0.831 ± 0.471
1.663TrpLeu: 1.663 ± 0.699
0.554TrpMet: 0.554 ± 0.33
0.831TrpAsn: 0.831 ± 0.471
0.277TrpPro: 0.277 ± 0.157
0.277TrpGln: 0.277 ± 0.157
1.386TrpArg: 1.386 ± 1.031
1.663TrpSer: 1.663 ± 0.42
0.554TrpThr: 0.554 ± 0.393
1.663TrpVal: 1.663 ± 0.778
0.277TrpTrp: 0.277 ± 0.408
0.277TrpTyr: 0.277 ± 0.392
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.049TyrAla: 3.049 ± 1.227
0.277TyrCys: 0.277 ± 0.157
1.386TyrAsp: 1.386 ± 0.6
1.109TyrGlu: 1.109 ± 0.474
2.217TyrPhe: 2.217 ± 0.943
1.663TyrGly: 1.663 ± 0.673
1.109TyrHis: 1.109 ± 0.409
1.94TyrIle: 1.94 ± 0.927
3.049TyrLys: 3.049 ± 0.703
4.712TyrLeu: 4.712 ± 0.474
1.386TyrMet: 1.386 ± 0.818
1.663TyrAsn: 1.663 ± 0.788
1.109TyrPro: 1.109 ± 0.404
1.386TyrGln: 1.386 ± 0.525
1.386TyrArg: 1.386 ± 0.893
2.217TyrSer: 2.217 ± 0.518
1.109TyrThr: 1.109 ± 0.424
1.386TyrVal: 1.386 ± 0.475
0.0TyrTrp: 0.0 ± 0.0
0.277TyrTyr: 0.277 ± 0.157
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3609 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski