Amino acid dipepetide frequency for American hop latent virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.966AlaAla: 4.966 ± 1.019
1.419AlaCys: 1.419 ± 1.035
4.612AlaAsp: 4.612 ± 1.874
4.612AlaGlu: 4.612 ± 1.007
2.838AlaPhe: 2.838 ± 1.189
3.547AlaGly: 3.547 ± 0.726
1.774AlaHis: 1.774 ± 0.583
4.612AlaIle: 4.612 ± 1.516
7.804AlaLys: 7.804 ± 3.138
6.385AlaLeu: 6.385 ± 1.953
1.774AlaMet: 1.774 ± 0.983
3.547AlaAsn: 3.547 ± 1.206
3.902AlaPro: 3.902 ± 1.922
1.774AlaGln: 1.774 ± 0.983
3.547AlaArg: 3.547 ± 1.364
6.74AlaSer: 6.74 ± 2.399
2.128AlaThr: 2.128 ± 0.97
6.031AlaVal: 6.031 ± 1.667
0.355AlaTrp: 0.355 ± 0.192
1.419AlaTyr: 1.419 ± 0.687
0.0AlaXaa: 0.0 ± 0.0
Cys
1.774CysAla: 1.774 ± 1.065
0.0CysCys: 0.0 ± 0.0
0.709CysAsp: 0.709 ± 1.276
1.774CysGlu: 1.774 ± 0.789
2.838CysPhe: 2.838 ± 0.93
2.483CysGly: 2.483 ± 1.016
0.355CysHis: 0.355 ± 0.885
2.838CysIle: 2.838 ± 0.886
2.483CysLys: 2.483 ± 0.96
2.483CysLeu: 2.483 ± 0.854
0.709CysMet: 0.709 ± 0.368
1.419CysAsn: 1.419 ± 0.716
0.709CysPro: 0.709 ± 1.307
0.709CysGln: 0.709 ± 0.765
2.128CysArg: 2.128 ± 1.349
1.064CysSer: 1.064 ± 0.575
1.064CysThr: 1.064 ± 0.709
3.193CysVal: 3.193 ± 1.393
0.0CysTrp: 0.0 ± 0.0
1.774CysTyr: 1.774 ± 0.932
0.0CysXaa: 0.0 ± 0.0
Asp
1.774AspAla: 1.774 ± 0.718
0.709AspCys: 0.709 ± 0.383
2.128AspAsp: 2.128 ± 1.15
5.321AspGlu: 5.321 ± 0.902
3.193AspPhe: 3.193 ± 1.281
3.193AspGly: 3.193 ± 0.607
0.709AspHis: 0.709 ± 0.522
4.966AspIle: 4.966 ± 1.498
1.774AspLys: 1.774 ± 0.603
4.612AspLeu: 4.612 ± 0.83
0.355AspMet: 0.355 ± 0.192
2.838AspAsn: 2.838 ± 1.039
2.128AspPro: 2.128 ± 1.409
1.064AspGln: 1.064 ± 0.575
2.128AspArg: 2.128 ± 0.48
4.612AspSer: 4.612 ± 0.901
1.064AspThr: 1.064 ± 0.479
3.547AspVal: 3.547 ± 0.726
1.064AspTrp: 1.064 ± 0.479
2.128AspTyr: 2.128 ± 0.736
0.0AspXaa: 0.0 ± 0.0
Glu
6.385GluAla: 6.385 ± 2.416
2.128GluCys: 2.128 ± 0.958
2.838GluAsp: 2.838 ± 1.499
6.74GluGlu: 6.74 ± 2.64
2.483GluPhe: 2.483 ± 0.907
3.193GluGly: 3.193 ± 1.064
3.193GluHis: 3.193 ± 1.436
4.257GluIle: 4.257 ± 3.727
6.031GluLys: 6.031 ± 1.77
6.031GluLeu: 6.031 ± 1.543
1.774GluMet: 1.774 ± 0.603
1.419GluAsn: 1.419 ± 0.767
3.193GluPro: 3.193 ± 0.585
2.128GluGln: 2.128 ± 0.931
5.676GluArg: 5.676 ± 2.421
3.547GluSer: 3.547 ± 1.155
2.483GluThr: 2.483 ± 1.342
6.031GluVal: 6.031 ± 1.734
0.709GluTrp: 0.709 ± 0.383
2.128GluTyr: 2.128 ± 0.873
0.0GluXaa: 0.0 ± 0.0
Phe
2.838PheAla: 2.838 ± 1.189
0.709PheCys: 0.709 ± 0.778
2.838PheAsp: 2.838 ± 1.411
4.257PheGlu: 4.257 ± 2.301
0.355PhePhe: 0.355 ± 0.192
3.547PheGly: 3.547 ± 2.924
1.064PheHis: 1.064 ± 0.709
2.483PheIle: 2.483 ± 2.394
3.193PheLys: 3.193 ± 2.126
6.385PheLeu: 6.385 ± 2.314
1.774PheMet: 1.774 ± 0.664
2.483PheAsn: 2.483 ± 0.778
1.064PhePro: 1.064 ± 0.675
1.419PheGln: 1.419 ± 0.767
2.483PheArg: 2.483 ± 0.851
4.966PheSer: 4.966 ± 1.691
4.257PheThr: 4.257 ± 1.208
2.483PheVal: 2.483 ± 0.907
0.355PheTrp: 0.355 ± 0.192
0.709PheTyr: 0.709 ± 0.759
0.0PheXaa: 0.0 ± 0.0
Gly
6.385GlyAla: 6.385 ± 1.618
2.838GlyCys: 2.838 ± 1.534
2.838GlyAsp: 2.838 ± 1.461
3.547GlyGlu: 3.547 ± 0.775
3.902GlyPhe: 3.902 ± 2.012
2.483GlyGly: 2.483 ± 3.112
0.355GlyHis: 0.355 ± 0.836
3.193GlyIle: 3.193 ± 1.328
5.321GlyLys: 5.321 ± 2.174
4.257GlyLeu: 4.257 ± 1.322
0.0GlyMet: 0.0 ± 0.0
3.193GlyAsn: 3.193 ± 0.939
1.774GlyPro: 1.774 ± 0.769
0.709GlyGln: 0.709 ± 0.778
3.547GlyArg: 3.547 ± 1.41
3.547GlySer: 3.547 ± 1.116
3.547GlyThr: 3.547 ± 1.721
4.966GlyVal: 4.966 ± 1.225
0.709GlyTrp: 0.709 ± 0.383
2.128GlyTyr: 2.128 ± 0.795
0.0GlyXaa: 0.0 ± 0.0
His
3.193HisAla: 3.193 ± 1.231
0.709HisCys: 0.709 ± 1.77
0.355HisAsp: 0.355 ± 0.192
1.419HisGlu: 1.419 ± 0.767
1.064HisPhe: 1.064 ± 0.709
0.355HisGly: 0.355 ± 0.848
0.709HisHis: 0.709 ± 0.383
1.419HisIle: 1.419 ± 0.51
1.419HisLys: 1.419 ± 0.51
2.838HisLeu: 2.838 ± 0.481
0.709HisMet: 0.709 ± 1.11
1.774HisAsn: 1.774 ± 0.603
0.709HisPro: 0.709 ± 0.522
1.774HisGln: 1.774 ± 1.372
0.709HisArg: 0.709 ± 0.383
2.128HisSer: 2.128 ± 2.278
0.355HisThr: 0.355 ± 0.885
1.064HisVal: 1.064 ± 1.685
0.0HisTrp: 0.0 ± 0.0
0.709HisTyr: 0.709 ± 0.383
0.0HisXaa: 0.0 ± 0.0
Ile
4.257IleAla: 4.257 ± 1.361
2.483IleCys: 2.483 ± 0.997
3.902IleAsp: 3.902 ± 1.669
6.031IleGlu: 6.031 ± 1.572
2.483IlePhe: 2.483 ± 0.85
4.257IleGly: 4.257 ± 2.219
1.064IleHis: 1.064 ± 0.575
2.483IleIle: 2.483 ± 0.953
4.966IleLys: 4.966 ± 1.642
4.966IleLeu: 4.966 ± 0.978
1.064IleMet: 1.064 ± 0.631
1.419IleAsn: 1.419 ± 0.767
1.774IlePro: 1.774 ± 0.789
1.064IleGln: 1.064 ± 1.134
2.128IleArg: 2.128 ± 2.41
3.547IleSer: 3.547 ± 0.975
1.774IleThr: 1.774 ± 0.718
4.257IleVal: 4.257 ± 1.405
0.0IleTrp: 0.0 ± 0.0
2.128IleTyr: 2.128 ± 0.795
0.0IleXaa: 0.0 ± 0.0
Lys
3.547LysAla: 3.547 ± 1.436
0.709LysCys: 0.709 ± 0.703
1.419LysAsp: 1.419 ± 0.51
6.031LysGlu: 6.031 ± 1.827
2.128LysPhe: 2.128 ± 0.795
6.031LysGly: 6.031 ± 2.22
2.128LysHis: 2.128 ± 0.863
1.774LysIle: 1.774 ± 0.959
3.902LysLys: 3.902 ± 2.109
8.514LysLeu: 8.514 ± 2.273
1.419LysMet: 1.419 ± 0.767
2.838LysAsn: 2.838 ± 0.704
3.547LysPro: 3.547 ± 0.991
1.774LysGln: 1.774 ± 0.959
4.966LysArg: 4.966 ± 1.516
2.838LysSer: 2.838 ± 0.878
4.612LysThr: 4.612 ± 1.581
3.902LysVal: 3.902 ± 1.175
0.709LysTrp: 0.709 ± 0.383
1.774LysTyr: 1.774 ± 0.932
0.0LysXaa: 0.0 ± 0.0
Leu
8.868LeuAla: 8.868 ± 2.445
2.838LeuCys: 2.838 ± 2.048
3.902LeuAsp: 3.902 ± 1.474
4.966LeuGlu: 4.966 ± 1.414
4.257LeuPhe: 4.257 ± 1.478
4.612LeuGly: 4.612 ± 1.125
1.774LeuHis: 1.774 ± 1.15
5.676LeuIle: 5.676 ± 1.667
6.031LeuLys: 6.031 ± 2.775
9.933LeuLeu: 9.933 ± 4.072
1.064LeuMet: 1.064 ± 0.575
4.257LeuAsn: 4.257 ± 0.69
5.676LeuPro: 5.676 ± 1.7
3.547LeuGln: 3.547 ± 1.435
6.385LeuArg: 6.385 ± 0.749
7.804LeuSer: 7.804 ± 4.096
3.547LeuThr: 3.547 ± 0.915
9.223LeuVal: 9.223 ± 2.126
1.064LeuTrp: 1.064 ± 0.675
1.774LeuTyr: 1.774 ± 0.983
0.0LeuXaa: 0.0 ± 0.0
Met
2.838MetAla: 2.838 ± 1.057
0.709MetCys: 0.709 ± 0.383
1.419MetAsp: 1.419 ± 0.51
0.709MetGlu: 0.709 ± 0.778
0.355MetPhe: 0.355 ± 0.192
1.064MetGly: 1.064 ± 0.479
0.709MetHis: 0.709 ± 0.383
0.709MetIle: 0.709 ± 0.383
0.355MetLys: 0.355 ± 0.779
2.483MetLeu: 2.483 ± 0.97
0.0MetMet: 0.0 ± 0.0
0.355MetAsn: 0.355 ± 0.192
1.419MetPro: 1.419 ± 1.065
0.709MetGln: 0.709 ± 0.383
1.064MetArg: 1.064 ± 0.575
2.483MetSer: 2.483 ± 1.814
0.709MetThr: 0.709 ± 0.522
0.709MetVal: 0.709 ± 0.522
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.547AsnAla: 3.547 ± 0.915
1.419AsnCys: 1.419 ± 0.699
3.193AsnAsp: 3.193 ± 1.231
1.419AsnGlu: 1.419 ± 0.767
2.483AsnPhe: 2.483 ± 0.907
3.902AsnGly: 3.902 ± 1.272
0.709AsnHis: 0.709 ± 0.759
1.774AsnIle: 1.774 ± 0.773
3.193AsnLys: 3.193 ± 1.418
4.257AsnLeu: 4.257 ± 1.413
1.419AsnMet: 1.419 ± 1.043
1.064AsnAsn: 1.064 ± 0.675
0.709AsnPro: 0.709 ± 1.247
0.355AsnGln: 0.355 ± 0.192
3.193AsnArg: 3.193 ± 1.725
1.774AsnSer: 1.774 ± 2.157
2.838AsnThr: 2.838 ± 1.019
3.193AsnVal: 3.193 ± 0.585
0.709AsnTrp: 0.709 ± 0.383
2.128AsnTyr: 2.128 ± 0.807
0.0AsnXaa: 0.0 ± 0.0
Pro
2.838ProAla: 2.838 ± 2.087
1.774ProCys: 1.774 ± 0.769
3.547ProAsp: 3.547 ± 1.166
3.902ProGlu: 3.902 ± 1.237
0.355ProPhe: 0.355 ± 0.192
2.483ProGly: 2.483 ± 1.66
2.128ProHis: 2.128 ± 1.64
1.064ProIle: 1.064 ± 0.883
0.709ProLys: 0.709 ± 0.383
4.966ProLeu: 4.966 ± 2.181
0.0ProMet: 0.0 ± 0.0
1.774ProAsn: 1.774 ± 0.852
3.902ProPro: 3.902 ± 3.087
1.419ProGln: 1.419 ± 0.699
3.902ProArg: 3.902 ± 2.837
2.483ProSer: 2.483 ± 2.077
1.774ProThr: 1.774 ± 1.651
3.902ProVal: 3.902 ± 0.615
0.709ProTrp: 0.709 ± 0.778
2.128ProTyr: 2.128 ± 0.736
0.0ProXaa: 0.0 ± 0.0
Gln
1.419GlnAla: 1.419 ± 0.51
1.419GlnCys: 1.419 ± 0.761
1.419GlnAsp: 1.419 ± 0.51
1.419GlnGlu: 1.419 ± 0.51
0.709GlnPhe: 0.709 ± 0.383
1.064GlnGly: 1.064 ± 0.575
0.0GlnHis: 0.0 ± 0.0
2.838GlnIle: 2.838 ± 1.189
1.064GlnLys: 1.064 ± 0.709
3.193GlnLeu: 3.193 ± 0.8
1.064GlnMet: 1.064 ± 0.479
0.709GlnAsn: 0.709 ± 0.383
1.774GlnPro: 1.774 ± 0.789
0.709GlnGln: 0.709 ± 0.383
1.774GlnArg: 1.774 ± 0.603
2.838GlnSer: 2.838 ± 2.5
1.419GlnThr: 1.419 ± 0.767
0.355GlnVal: 0.355 ± 0.192
0.0GlnTrp: 0.0 ± 0.0
0.355GlnTyr: 0.355 ± 0.192
0.0GlnXaa: 0.0 ± 0.0
Arg
5.676ArgAla: 5.676 ± 1.664
1.419ArgCys: 1.419 ± 1.406
2.838ArgAsp: 2.838 ± 0.714
5.321ArgGlu: 5.321 ± 1.119
4.612ArgPhe: 4.612 ± 0.961
2.483ArgGly: 2.483 ± 0.441
1.774ArgHis: 1.774 ± 1.15
3.193ArgIle: 3.193 ± 1.231
3.902ArgLys: 3.902 ± 1.237
5.321ArgLeu: 5.321 ± 3.049
1.064ArgMet: 1.064 ± 0.479
2.838ArgAsn: 2.838 ± 0.84
2.128ArgPro: 2.128 ± 1.31
0.355ArgGln: 0.355 ± 0.192
3.902ArgArg: 3.902 ± 2.785
3.547ArgSer: 3.547 ± 1.544
2.838ArgThr: 2.838 ± 0.85
3.902ArgVal: 3.902 ± 1.172
1.419ArgTrp: 1.419 ± 0.767
3.902ArgTyr: 3.902 ± 1.332
0.0ArgXaa: 0.0 ± 0.0
Ser
3.902SerAla: 3.902 ± 1.591
2.128SerCys: 2.128 ± 1.287
5.676SerAsp: 5.676 ± 1.502
3.547SerGlu: 3.547 ± 0.612
4.612SerPhe: 4.612 ± 0.816
4.966SerGly: 4.966 ± 1.019
1.419SerHis: 1.419 ± 1.519
2.483SerIle: 2.483 ± 0.441
6.031SerLys: 6.031 ± 1.81
4.257SerLeu: 4.257 ± 2.584
0.0SerMet: 0.0 ± 0.0
2.128SerAsn: 2.128 ± 0.48
2.128SerPro: 2.128 ± 1.247
2.128SerGln: 2.128 ± 1.014
4.612SerArg: 4.612 ± 1.561
4.612SerSer: 4.612 ± 3.252
3.902SerThr: 3.902 ± 1.051
4.966SerVal: 4.966 ± 3.773
0.0SerTrp: 0.0 ± 0.0
4.257SerTyr: 4.257 ± 1.203
0.0SerXaa: 0.0 ± 0.0
Thr
2.128ThrAla: 2.128 ± 1.409
1.419ThrCys: 1.419 ± 1.024
0.709ThrAsp: 0.709 ± 0.522
3.547ThrGlu: 3.547 ± 1.56
6.031ThrPhe: 6.031 ± 2.421
2.838ThrGly: 2.838 ± 0.85
1.774ThrHis: 1.774 ± 0.603
2.128ThrIle: 2.128 ± 1.15
2.838ThrLys: 2.838 ± 0.822
4.257ThrLeu: 4.257 ± 1.427
1.419ThrMet: 1.419 ± 0.767
2.838ThrAsn: 2.838 ± 1.491
3.902ThrPro: 3.902 ± 3.561
1.064ThrGln: 1.064 ± 0.575
2.128ThrArg: 2.128 ± 1.855
2.838ThrSer: 2.838 ± 1.694
1.419ThrThr: 1.419 ± 0.935
1.064ThrVal: 1.064 ± 0.575
0.355ThrTrp: 0.355 ± 0.192
1.774ThrTyr: 1.774 ± 0.773
0.0ThrXaa: 0.0 ± 0.0
Val
3.193ValAla: 3.193 ± 1.725
4.612ValCys: 4.612 ± 1.371
1.774ValAsp: 1.774 ± 1.476
5.321ValGlu: 5.321 ± 1.694
3.547ValPhe: 3.547 ± 2.162
4.257ValGly: 4.257 ± 1.478
1.774ValHis: 1.774 ± 0.959
4.966ValIle: 4.966 ± 1.225
1.774ValLys: 1.774 ± 1.372
7.804ValLeu: 7.804 ± 1.751
1.064ValMet: 1.064 ± 0.675
4.612ValAsn: 4.612 ± 1.529
3.547ValPro: 3.547 ± 1.175
1.419ValGln: 1.419 ± 0.767
3.902ValArg: 3.902 ± 0.886
4.612ValSer: 4.612 ± 0.83
3.902ValThr: 3.902 ± 1.393
3.902ValVal: 3.902 ± 3.529
1.064ValTrp: 1.064 ± 1.047
2.128ValTyr: 2.128 ± 0.931
0.0ValXaa: 0.0 ± 0.0
Trp
0.709TrpAla: 0.709 ± 0.759
1.064TrpCys: 1.064 ± 0.575
0.355TrpAsp: 0.355 ± 0.192
0.355TrpGlu: 0.355 ± 0.192
0.709TrpPhe: 0.709 ± 0.383
0.709TrpGly: 0.709 ± 0.703
0.0TrpHis: 0.0 ± 0.0
0.709TrpIle: 0.709 ± 0.383
0.0TrpLys: 0.0 ± 0.0
1.064TrpLeu: 1.064 ± 0.575
0.0TrpMet: 0.0 ± 0.0
0.709TrpAsn: 0.709 ± 0.522
0.355TrpPro: 0.355 ± 0.192
0.709TrpGln: 0.709 ± 0.522
0.355TrpArg: 0.355 ± 0.192
0.355TrpSer: 0.355 ± 0.885
0.0TrpThr: 0.0 ± 0.0
0.709TrpVal: 0.709 ± 0.383
0.0TrpTrp: 0.0 ± 0.0
0.709TrpTyr: 0.709 ± 0.383
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.547TyrAla: 3.547 ± 1.544
0.0TyrCys: 0.0 ± 0.0
2.838TyrAsp: 2.838 ± 1.534
2.128TyrGlu: 2.128 ± 0.993
0.709TyrPhe: 0.709 ± 0.383
1.774TyrGly: 1.774 ± 0.769
0.0TyrHis: 0.0 ± 0.0
2.838TyrIle: 2.838 ± 0.704
1.774TyrLys: 1.774 ± 0.959
3.193TyrLeu: 3.193 ± 1.359
1.774TyrMet: 1.774 ± 0.983
1.064TyrAsn: 1.064 ± 0.709
1.419TyrPro: 1.419 ± 0.767
0.709TyrGln: 0.709 ± 0.778
3.902TyrArg: 3.902 ± 1.521
1.419TyrSer: 1.419 ± 0.51
2.838TyrThr: 2.838 ± 1.467
1.774TyrVal: 1.774 ± 0.773
0.355TyrTrp: 0.355 ± 0.192
1.064TyrTyr: 1.064 ± 0.479
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2820 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski