Amino acid dipepetide frequency for Helleborus net necrosis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.753AlaAla: 5.753 ± 2.589
1.798AlaCys: 1.798 ± 0.977
5.034AlaAsp: 5.034 ± 1.516
5.753AlaGlu: 5.753 ± 2.094
2.877AlaPhe: 2.877 ± 1.089
3.955AlaGly: 3.955 ± 1.536
2.517AlaHis: 2.517 ± 1.013
4.675AlaIle: 4.675 ± 1.065
6.472AlaLys: 6.472 ± 1.824
8.63AlaLeu: 8.63 ± 3.336
1.079AlaMet: 1.079 ± 0.586
3.236AlaAsn: 3.236 ± 1.709
1.798AlaPro: 1.798 ± 0.977
2.517AlaGln: 2.517 ± 1.167
4.315AlaArg: 4.315 ± 0.865
5.394AlaSer: 5.394 ± 1.54
2.157AlaThr: 2.157 ± 1.885
6.113AlaVal: 6.113 ± 3.955
0.0AlaTrp: 0.0 ± 0.0
1.438AlaTyr: 1.438 ± 0.782
0.0AlaXaa: 0.0 ± 0.0
Cys
3.596CysAla: 3.596 ± 1.473
0.0CysCys: 0.0 ± 0.0
0.36CysAsp: 0.36 ± 0.782
1.438CysGlu: 1.438 ± 0.782
2.157CysPhe: 2.157 ± 1.166
2.157CysGly: 2.157 ± 0.996
1.079CysHis: 1.079 ± 1.777
1.798CysIle: 1.798 ± 1.112
1.079CysLys: 1.079 ± 0.666
1.438CysLeu: 1.438 ± 1.665
0.36CysMet: 0.36 ± 0.195
1.438CysAsn: 1.438 ± 0.901
0.0CysPro: 0.0 ± 0.0
0.36CysGln: 0.36 ± 0.195
1.798CysArg: 1.798 ± 0.845
2.877CysSer: 2.877 ± 1.41
2.877CysThr: 2.877 ± 1.315
1.798CysVal: 1.798 ± 0.763
0.719CysTrp: 0.719 ± 0.969
1.438CysTyr: 1.438 ± 1.665
0.0CysXaa: 0.0 ± 0.0
Asp
1.438AspAla: 1.438 ± 1.036
0.719AspCys: 0.719 ± 0.391
2.517AspAsp: 2.517 ± 1.368
3.236AspGlu: 3.236 ± 0.892
4.315AspPhe: 4.315 ± 1.92
3.955AspGly: 3.955 ± 0.957
2.517AspHis: 2.517 ± 1.785
1.438AspIle: 1.438 ± 0.613
1.438AspLys: 1.438 ± 0.782
5.753AspLeu: 5.753 ± 3.926
1.438AspMet: 1.438 ± 1.151
2.517AspAsn: 2.517 ± 0.947
2.157AspPro: 2.157 ± 1.833
1.798AspGln: 1.798 ± 1.155
2.157AspArg: 2.157 ± 1.168
5.034AspSer: 5.034 ± 1.543
1.798AspThr: 1.798 ± 0.952
2.517AspVal: 2.517 ± 0.898
1.079AspTrp: 1.079 ± 0.57
2.877AspTyr: 2.877 ± 2.013
0.0AspXaa: 0.0 ± 0.0
Glu
6.113GluAla: 6.113 ± 2.491
1.079GluCys: 1.079 ± 0.586
2.157GluAsp: 2.157 ± 1.14
5.394GluGlu: 5.394 ± 2.128
1.798GluPhe: 1.798 ± 1.145
3.955GluGly: 3.955 ± 1.258
2.517GluHis: 2.517 ± 1.368
4.675GluIle: 4.675 ± 1.831
1.798GluLys: 1.798 ± 0.684
5.394GluLeu: 5.394 ± 1.456
1.079GluMet: 1.079 ± 0.586
3.236GluAsn: 3.236 ± 1.333
2.877GluPro: 2.877 ± 1.167
2.157GluGln: 2.157 ± 1.775
3.236GluArg: 3.236 ± 0.955
5.753GluSer: 5.753 ± 1.533
2.877GluThr: 2.877 ± 1.709
3.236GluVal: 3.236 ± 0.892
0.0GluTrp: 0.0 ± 0.0
2.517GluTyr: 2.517 ± 1.133
0.0GluXaa: 0.0 ± 0.0
Phe
5.394PheAla: 5.394 ± 1.554
0.719PheCys: 0.719 ± 0.391
5.034PheAsp: 5.034 ± 1.455
4.675PheGlu: 4.675 ± 1.532
2.517PhePhe: 2.517 ± 0.994
5.034PheGly: 5.034 ± 2.86
0.719PheHis: 0.719 ± 0.7
2.517PheIle: 2.517 ± 2.565
3.236PheLys: 3.236 ± 1.759
6.472PheLeu: 6.472 ± 1.524
1.079PheMet: 1.079 ± 0.586
3.236PheAsn: 3.236 ± 1.333
1.079PhePro: 1.079 ± 0.57
0.719PheGln: 0.719 ± 0.391
1.798PheArg: 1.798 ± 0.977
6.113PheSer: 6.113 ± 2.165
5.034PheThr: 5.034 ± 1.399
5.034PheVal: 5.034 ± 1.919
0.719PheTrp: 0.719 ± 0.592
2.517PheTyr: 2.517 ± 0.9
0.0PheXaa: 0.0 ± 0.0
Gly
4.315GlyAla: 4.315 ± 1.822
1.438GlyCys: 1.438 ± 1.835
4.315GlyAsp: 4.315 ± 1.85
4.315GlyGlu: 4.315 ± 1.499
3.955GlyPhe: 3.955 ± 3.522
2.877GlyGly: 2.877 ± 2.741
1.079GlyHis: 1.079 ± 0.915
1.798GlyIle: 1.798 ± 0.763
5.753GlyLys: 5.753 ± 1.069
8.27GlyLeu: 8.27 ± 2.101
0.0GlyMet: 0.0 ± 0.0
2.157GlyAsn: 2.157 ± 0.918
0.36GlyPro: 0.36 ± 0.195
1.079GlyGln: 1.079 ± 0.666
3.236GlyArg: 3.236 ± 0.974
5.394GlySer: 5.394 ± 1.936
3.236GlyThr: 3.236 ± 1.409
2.517GlyVal: 2.517 ± 0.898
1.079GlyTrp: 1.079 ± 0.586
3.236GlyTyr: 3.236 ± 1.48
0.0GlyXaa: 0.0 ± 0.0
His
1.798HisAla: 1.798 ± 0.763
0.719HisCys: 0.719 ± 0.391
0.719HisAsp: 0.719 ± 0.391
0.719HisGlu: 0.719 ± 0.391
2.157HisPhe: 2.157 ± 0.875
1.438HisGly: 1.438 ± 1.835
0.36HisHis: 0.36 ± 0.195
1.079HisIle: 1.079 ± 1.103
2.517HisLys: 2.517 ± 0.994
2.157HisLeu: 2.157 ± 1.173
1.079HisMet: 1.079 ± 1.326
1.079HisAsn: 1.079 ± 0.666
1.079HisPro: 1.079 ± 0.666
0.36HisGln: 0.36 ± 0.195
1.438HisArg: 1.438 ± 1.755
4.315HisSer: 4.315 ± 2.621
0.719HisThr: 0.719 ± 0.391
1.079HisVal: 1.079 ± 1.312
0.0HisTrp: 0.0 ± 0.0
0.36HisTyr: 0.36 ± 0.195
0.0HisXaa: 0.0 ± 0.0
Ile
4.675IleAla: 4.675 ± 1.647
2.157IleCys: 2.157 ± 0.872
1.798IleAsp: 1.798 ± 1.352
4.675IleGlu: 4.675 ± 2.235
2.877IlePhe: 2.877 ± 1.16
2.877IleGly: 2.877 ± 1.295
1.438IleHis: 1.438 ± 0.782
2.877IleIle: 2.877 ± 2.225
3.236IleLys: 3.236 ± 1.44
5.753IleLeu: 5.753 ± 1.705
1.079IleMet: 1.079 ± 1.146
1.438IleAsn: 1.438 ± 0.689
1.079IlePro: 1.079 ± 0.586
1.079IleGln: 1.079 ± 0.57
1.798IleArg: 1.798 ± 1.487
4.315IleSer: 4.315 ± 1.073
1.798IleThr: 1.798 ± 0.684
3.236IleVal: 3.236 ± 1.197
0.36IleTrp: 0.36 ± 0.195
1.438IleTyr: 1.438 ± 1.984
0.0IleXaa: 0.0 ± 0.0
Lys
3.955LysAla: 3.955 ± 1.695
1.438LysCys: 1.438 ± 0.782
3.236LysAsp: 3.236 ± 1.998
2.877LysGlu: 2.877 ± 0.774
4.315LysPhe: 4.315 ± 1.881
3.236LysGly: 3.236 ± 0.974
2.157LysHis: 2.157 ± 1.332
2.877LysIle: 2.877 ± 1.709
2.877LysLys: 2.877 ± 1.215
8.27LysLeu: 8.27 ± 2.547
0.719LysMet: 0.719 ± 0.391
3.236LysAsn: 3.236 ± 0.987
3.955LysPro: 3.955 ± 1.364
1.798LysGln: 1.798 ± 1.066
4.315LysArg: 4.315 ± 1.809
4.315LysSer: 4.315 ± 1.75
4.315LysThr: 4.315 ± 1.331
2.157LysVal: 2.157 ± 1.173
0.719LysTrp: 0.719 ± 0.391
1.438LysTyr: 1.438 ± 0.613
0.0LysXaa: 0.0 ± 0.0
Leu
7.192LeuAla: 7.192 ± 2.973
3.236LeuCys: 3.236 ± 1.708
5.753LeuAsp: 5.753 ± 2.461
5.034LeuGlu: 5.034 ± 1.57
5.394LeuPhe: 5.394 ± 1.994
6.472LeuGly: 6.472 ± 1.857
1.079LeuHis: 1.079 ± 0.902
5.394LeuIle: 5.394 ± 1.835
10.428LeuLys: 10.428 ± 3.005
7.192LeuLeu: 7.192 ± 2.281
1.438LeuMet: 1.438 ± 0.782
4.675LeuAsn: 4.675 ± 1.427
6.832LeuPro: 6.832 ± 1.567
2.877LeuGln: 2.877 ± 0.774
4.675LeuArg: 4.675 ± 1.427
5.034LeuSer: 5.034 ± 2.553
8.27LeuThr: 8.27 ± 2.406
6.472LeuVal: 6.472 ± 1.784
0.36LeuTrp: 0.36 ± 0.195
2.517LeuTyr: 2.517 ± 1.341
0.0LeuXaa: 0.0 ± 0.0
Met
2.517MetAla: 2.517 ± 0.994
0.36MetCys: 0.36 ± 0.195
1.079MetAsp: 1.079 ± 0.666
1.438MetGlu: 1.438 ± 0.782
0.719MetPhe: 0.719 ± 0.391
0.719MetGly: 0.719 ± 0.391
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.719MetLys: 0.719 ± 0.391
3.596MetLeu: 3.596 ± 1.513
0.0MetMet: 0.0 ± 0.0
0.36MetAsn: 0.36 ± 0.672
1.079MetPro: 1.079 ± 1.122
0.36MetGln: 0.36 ± 1.227
1.798MetArg: 1.798 ± 0.952
1.079MetSer: 1.079 ± 0.915
0.36MetThr: 0.36 ± 0.195
1.079MetVal: 1.079 ± 0.586
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.955AsnAla: 3.955 ± 0.726
2.877AsnCys: 2.877 ± 1.361
2.157AsnAsp: 2.157 ± 1.356
2.517AsnGlu: 2.517 ± 0.994
2.877AsnPhe: 2.877 ± 1.564
2.157AsnGly: 2.157 ± 0.996
1.079AsnHis: 1.079 ± 0.586
1.798AsnIle: 1.798 ± 1.155
3.596AsnLys: 3.596 ± 1.526
5.034AsnLeu: 5.034 ± 1.76
0.719AsnMet: 0.719 ± 0.391
1.798AsnAsn: 1.798 ± 0.952
3.236AsnPro: 3.236 ± 1.197
0.0AsnGln: 0.0 ± 0.0
3.236AsnArg: 3.236 ± 1.98
2.157AsnSer: 2.157 ± 1.885
2.157AsnThr: 2.157 ± 1.14
1.798AsnVal: 1.798 ± 0.977
0.0AsnTrp: 0.0 ± 0.0
2.877AsnTyr: 2.877 ± 1.234
0.0AsnXaa: 0.0 ± 0.0
Pro
2.157ProAla: 2.157 ± 1.045
1.079ProCys: 1.079 ± 1.103
3.596ProAsp: 3.596 ± 1.565
3.596ProGlu: 3.596 ± 1.368
1.438ProPhe: 1.438 ± 1.09
1.798ProGly: 1.798 ± 0.763
1.438ProHis: 1.438 ± 1.984
2.877ProIle: 2.877 ± 1.378
2.877ProLys: 2.877 ± 1.804
3.236ProLeu: 3.236 ± 1.411
0.36ProMet: 0.36 ± 0.195
1.798ProAsn: 1.798 ± 0.763
2.877ProPro: 2.877 ± 2.236
0.719ProGln: 0.719 ± 0.391
2.157ProArg: 2.157 ± 1.14
2.157ProSer: 2.157 ± 0.872
3.236ProThr: 3.236 ± 1.774
2.157ProVal: 2.157 ± 0.841
1.438ProTrp: 1.438 ± 0.613
1.438ProTyr: 1.438 ± 0.76
0.0ProXaa: 0.0 ± 0.0
Gln
2.877GlnAla: 2.877 ± 3.185
0.0GlnCys: 0.0 ± 0.0
0.719GlnAsp: 0.719 ± 0.391
2.157GlnGlu: 2.157 ± 0.841
0.719GlnPhe: 0.719 ± 0.391
1.079GlnGly: 1.079 ± 0.586
1.079GlnHis: 1.079 ± 0.586
1.438GlnIle: 1.438 ± 1.399
0.719GlnLys: 0.719 ± 0.969
2.157GlnLeu: 2.157 ± 0.841
0.36GlnMet: 0.36 ± 0.195
0.719GlnAsn: 0.719 ± 0.391
1.438GlnPro: 1.438 ± 1.393
2.517GlnGln: 2.517 ± 3.932
2.157GlnArg: 2.157 ± 1.14
4.315GlnSer: 4.315 ± 1.499
1.079GlnThr: 1.079 ± 0.586
1.079GlnVal: 1.079 ± 1.183
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.236ArgAla: 3.236 ± 1.711
1.798ArgCys: 1.798 ± 3.77
3.236ArgAsp: 3.236 ± 2.378
1.798ArgGlu: 1.798 ± 0.709
5.394ArgPhe: 5.394 ± 1.456
2.877ArgGly: 2.877 ± 1.52
1.438ArgHis: 1.438 ± 0.689
2.157ArgIle: 2.157 ± 1.356
2.877ArgLys: 2.877 ± 0.774
3.955ArgLeu: 3.955 ± 1.734
0.719ArgMet: 0.719 ± 0.391
2.877ArgAsn: 2.877 ± 0.933
1.079ArgPro: 1.079 ± 0.57
1.438ArgGln: 1.438 ± 1.61
5.394ArgArg: 5.394 ± 3.454
6.832ArgSer: 6.832 ± 2.671
3.955ArgThr: 3.955 ± 1.364
3.955ArgVal: 3.955 ± 2.497
0.36ArgTrp: 0.36 ± 0.195
2.517ArgTyr: 2.517 ± 0.994
0.0ArgXaa: 0.0 ± 0.0
Ser
6.113SerAla: 6.113 ± 2.383
2.517SerCys: 2.517 ± 2.818
5.034SerAsp: 5.034 ± 1.457
2.877SerGlu: 2.877 ± 1.16
5.753SerPhe: 5.753 ± 3.067
5.753SerGly: 5.753 ± 1.533
1.079SerHis: 1.079 ± 1.312
4.315SerIle: 4.315 ± 1.358
5.753SerLys: 5.753 ± 2.077
7.192SerLeu: 7.192 ± 2.175
0.719SerMet: 0.719 ± 0.607
5.394SerAsn: 5.394 ± 4.184
3.236SerPro: 3.236 ± 1.055
1.798SerGln: 1.798 ± 0.977
5.034SerArg: 5.034 ± 3.409
5.394SerSer: 5.394 ± 3.152
2.877SerThr: 2.877 ± 1.475
5.034SerVal: 5.034 ± 1.799
0.36SerTrp: 0.36 ± 0.195
2.157SerTyr: 2.157 ± 1.14
0.0SerXaa: 0.0 ± 0.0
Thr
3.596ThrAla: 3.596 ± 1.611
1.079ThrCys: 1.079 ± 1.777
1.079ThrAsp: 1.079 ± 0.586
4.675ThrGlu: 4.675 ± 1.561
8.63ThrPhe: 8.63 ± 3.722
2.877ThrGly: 2.877 ± 1.234
2.157ThrHis: 2.157 ± 0.841
1.438ThrIle: 1.438 ± 0.848
1.798ThrLys: 1.798 ± 0.845
5.394ThrLeu: 5.394 ± 1.928
2.877ThrMet: 2.877 ± 1.194
1.438ThrAsn: 1.438 ± 0.782
3.236ThrPro: 3.236 ± 1.642
2.157ThrGln: 2.157 ± 0.841
2.517ThrArg: 2.517 ± 2.14
2.877ThrSer: 2.877 ± 1.772
4.315ThrThr: 4.315 ± 1.376
2.517ThrVal: 2.517 ± 1.63
0.0ThrTrp: 0.0 ± 0.0
1.079ThrTyr: 1.079 ± 0.586
0.0ThrXaa: 0.0 ± 0.0
Val
4.315ValAla: 4.315 ± 1.331
2.877ValCys: 2.877 ± 0.686
1.079ValAsp: 1.079 ± 0.586
3.236ValGlu: 3.236 ± 1.332
2.877ValPhe: 2.877 ± 1.215
4.315ValGly: 4.315 ± 3.489
0.719ValHis: 0.719 ± 1.493
4.315ValIle: 4.315 ± 1.768
2.517ValLys: 2.517 ± 1.368
5.753ValLeu: 5.753 ± 2.032
0.719ValMet: 0.719 ± 1.141
3.596ValAsn: 3.596 ± 1.471
2.877ValPro: 2.877 ± 1.475
1.798ValGln: 1.798 ± 0.977
4.675ValArg: 4.675 ± 2.089
2.877ValSer: 2.877 ± 1.226
3.236ValThr: 3.236 ± 1.261
2.157ValVal: 2.157 ± 1.493
0.719ValTrp: 0.719 ± 0.592
1.079ValTyr: 1.079 ± 0.586
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.719TrpCys: 0.719 ± 0.391
0.719TrpAsp: 0.719 ± 0.592
0.36TrpGlu: 0.36 ± 0.195
0.719TrpPhe: 0.719 ± 0.391
0.36TrpGly: 0.36 ± 0.195
0.36TrpHis: 0.36 ± 0.195
0.36TrpIle: 0.36 ± 0.195
0.0TrpLys: 0.0 ± 0.0
1.079TrpLeu: 1.079 ± 0.586
0.0TrpMet: 0.0 ± 0.0
0.719TrpAsn: 0.719 ± 1.344
0.36TrpPro: 0.36 ± 0.195
0.719TrpGln: 0.719 ± 0.592
0.36TrpArg: 0.36 ± 1.057
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.719TrpVal: 0.719 ± 0.391
0.0TrpTrp: 0.0 ± 0.0
0.719TrpTyr: 0.719 ± 0.391
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.517TyrAla: 2.517 ± 1.527
2.157TyrCys: 2.157 ± 1.004
1.079TyrAsp: 1.079 ± 0.666
1.079TyrGlu: 1.079 ± 0.586
2.157TyrPhe: 2.157 ± 0.875
2.517TyrGly: 2.517 ± 0.947
0.719TyrHis: 0.719 ± 0.391
2.157TyrIle: 2.157 ± 1.173
2.517TyrLys: 2.517 ± 0.926
3.596TyrLeu: 3.596 ± 1.278
1.079TyrMet: 1.079 ± 0.57
1.079TyrAsn: 1.079 ± 0.586
1.798TyrPro: 1.798 ± 0.977
0.36TyrGln: 0.36 ± 0.782
1.798TyrArg: 1.798 ± 1.723
2.157TyrSer: 2.157 ± 1.775
1.438TyrThr: 1.438 ± 0.689
1.079TyrVal: 1.079 ± 0.586
0.36TyrTrp: 0.36 ± 0.195
1.438TyrTyr: 1.438 ± 0.848
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2782 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski