Amino acid dipepetide frequency for Humulus japonicus latent virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.551AlaAla: 4.551 ± 1.783
1.241AlaCys: 1.241 ± 0.462
2.069AlaAsp: 2.069 ± 1.491
5.792AlaGlu: 5.792 ± 1.049
2.482AlaPhe: 2.482 ± 1.669
4.965AlaGly: 4.965 ± 1.836
2.069AlaHis: 2.069 ± 1.18
6.62AlaIle: 6.62 ± 1.981
4.137AlaLys: 4.137 ± 1.057
7.861AlaLeu: 7.861 ± 1.259
2.482AlaMet: 2.482 ± 0.93
4.551AlaAsn: 4.551 ± 1.151
1.655AlaPro: 1.655 ± 0.497
2.069AlaGln: 2.069 ± 2.332
3.31AlaArg: 3.31 ± 2.052
6.206AlaSer: 6.206 ± 1.86
4.551AlaThr: 4.551 ± 2.054
5.792AlaVal: 5.792 ± 1.326
2.482AlaTrp: 2.482 ± 0.82
2.069AlaTyr: 2.069 ± 1.012
0.0AlaXaa: 0.0 ± 0.0
Cys
1.241CysAla: 1.241 ± 0.834
1.655CysCys: 1.655 ± 0.889
1.655CysAsp: 1.655 ± 0.616
0.827CysGlu: 0.827 ± 0.273
2.482CysPhe: 2.482 ± 1.303
0.827CysGly: 0.827 ± 0.786
0.827CysHis: 0.827 ± 0.273
0.0CysIle: 0.0 ± 0.0
0.414CysLys: 0.414 ± 0.298
2.069CysLeu: 2.069 ± 1.368
0.827CysMet: 0.827 ± 0.51
1.241CysAsn: 1.241 ± 1.06
1.655CysPro: 1.655 ± 0.616
0.414CysGln: 0.414 ± 0.298
1.655CysArg: 1.655 ± 0.728
3.724CysSer: 3.724 ± 1.305
1.655CysThr: 1.655 ± 0.684
1.655CysVal: 1.655 ± 0.604
0.414CysTrp: 0.414 ± 0.847
0.414CysTyr: 0.414 ± 0.559
0.0CysXaa: 0.0 ± 0.0
Asp
8.688AspAla: 8.688 ± 1.506
1.241AspCys: 1.241 ± 0.462
4.551AspAsp: 4.551 ± 1.483
2.482AspGlu: 2.482 ± 0.924
4.137AspPhe: 4.137 ± 1.001
3.724AspGly: 3.724 ± 0.118
2.482AspHis: 2.482 ± 1.547
4.137AspIle: 4.137 ± 1.298
6.206AspLys: 6.206 ± 2.024
3.724AspLeu: 3.724 ± 0.118
2.896AspMet: 2.896 ± 1.443
1.241AspAsn: 1.241 ± 0.907
2.069AspPro: 2.069 ± 1.355
2.069AspGln: 2.069 ± 0.718
2.069AspArg: 2.069 ± 1.012
2.896AspSer: 2.896 ± 0.963
1.655AspThr: 1.655 ± 1.084
3.724AspVal: 3.724 ± 1.051
0.827AspTrp: 0.827 ± 0.273
1.655AspTyr: 1.655 ± 0.684
0.0AspXaa: 0.0 ± 0.0
Glu
4.965GluAla: 4.965 ± 1.39
2.482GluCys: 2.482 ± 1.31
4.137GluAsp: 4.137 ± 1.298
5.792GluGlu: 5.792 ± 0.727
2.482GluPhe: 2.482 ± 0.82
2.896GluGly: 2.896 ± 1.041
0.414GluHis: 0.414 ± 0.298
3.724GluIle: 3.724 ± 1.218
7.861GluLys: 7.861 ± 1.591
4.965GluLeu: 4.965 ± 1.84
1.655GluMet: 1.655 ± 0.546
1.655GluAsn: 1.655 ± 0.853
1.241GluPro: 1.241 ± 0.834
1.655GluGln: 1.655 ± 1.193
3.724GluArg: 3.724 ± 1.108
3.724GluSer: 3.724 ± 1.692
2.482GluThr: 2.482 ± 0.928
5.379GluVal: 5.379 ± 1.234
1.241GluTrp: 1.241 ± 0.536
2.069GluTyr: 2.069 ± 1.012
0.0GluXaa: 0.0 ± 0.0
Phe
3.31PheAla: 3.31 ± 1.233
2.069PheCys: 2.069 ± 0.713
2.482PheAsp: 2.482 ± 0.654
3.724PheGlu: 3.724 ± 1.167
2.896PhePhe: 2.896 ± 0.89
0.414PheGly: 0.414 ± 0.298
0.414PheHis: 0.414 ± 0.337
5.792PheIle: 5.792 ± 1.5
3.724PheLys: 3.724 ± 1.305
4.137PheLeu: 4.137 ± 0.757
0.827PheMet: 0.827 ± 0.596
3.31PheAsn: 3.31 ± 1.093
2.896PhePro: 2.896 ± 0.552
3.31PheGln: 3.31 ± 1.797
1.655PheArg: 1.655 ± 0.497
4.137PheSer: 4.137 ± 0.802
0.0PheThr: 0.0 ± 0.0
1.655PheVal: 1.655 ± 0.889
0.827PheTrp: 0.827 ± 0.81
0.827PheTyr: 0.827 ± 0.786
0.0PheXaa: 0.0 ± 0.0
Gly
1.655GlyAla: 1.655 ± 0.889
1.655GlyCys: 1.655 ± 0.546
3.724GlyAsp: 3.724 ± 1.195
4.137GlyGlu: 4.137 ± 1.125
3.31GlyPhe: 3.31 ± 0.594
2.069GlyGly: 2.069 ± 1.407
0.414GlyHis: 0.414 ± 0.298
2.069GlyIle: 2.069 ± 1.407
2.896GlyLys: 2.896 ± 0.78
3.31GlyLeu: 3.31 ± 1.236
0.414GlyMet: 0.414 ± 0.298
2.069GlyAsn: 2.069 ± 0.62
2.069GlyPro: 2.069 ± 0.856
0.414GlyGln: 0.414 ± 0.847
2.896GlyArg: 2.896 ± 1.183
2.482GlySer: 2.482 ± 1.682
1.241GlyThr: 1.241 ± 0.655
5.379GlyVal: 5.379 ± 0.973
0.414GlyTrp: 0.414 ± 0.298
1.241GlyTyr: 1.241 ± 0.655
0.0GlyXaa: 0.0 ± 0.0
His
3.31HisAla: 3.31 ± 1.022
0.827HisCys: 0.827 ± 0.273
0.414HisAsp: 0.414 ± 0.559
1.241HisGlu: 1.241 ± 0.462
1.655HisPhe: 1.655 ± 1.193
0.414HisGly: 0.414 ± 0.337
1.241HisHis: 1.241 ± 1.011
2.482HisIle: 2.482 ± 1.09
1.655HisLys: 1.655 ± 0.991
0.414HisLeu: 0.414 ± 0.337
1.241HisMet: 1.241 ± 0.655
0.414HisAsn: 0.414 ± 0.298
0.414HisPro: 0.414 ± 0.337
0.827HisGln: 0.827 ± 0.602
0.414HisArg: 0.414 ± 0.298
1.655HisSer: 1.655 ± 0.546
1.241HisThr: 1.241 ± 0.462
3.31HisVal: 3.31 ± 1.7
0.414HisTrp: 0.414 ± 0.337
1.241HisTyr: 1.241 ± 0.907
0.0HisXaa: 0.0 ± 0.0
Ile
3.31IleAla: 3.31 ± 1.457
1.241IleCys: 1.241 ± 0.536
2.482IleAsp: 2.482 ± 1.159
4.965IleGlu: 4.965 ± 1.433
2.069IlePhe: 2.069 ± 0.517
2.069IleGly: 2.069 ± 0.517
1.655IleHis: 1.655 ± 0.497
3.724IleIle: 3.724 ± 0.954
4.965IleLys: 4.965 ± 1.342
4.137IleLeu: 4.137 ± 1.44
1.655IleMet: 1.655 ± 0.736
3.31IleAsn: 3.31 ± 0.734
7.034IlePro: 7.034 ± 1.563
1.655IleGln: 1.655 ± 1.263
0.827IleArg: 0.827 ± 0.273
4.551IleSer: 4.551 ± 1.167
2.896IleThr: 2.896 ± 1.276
4.137IleVal: 4.137 ± 1.298
0.414IleTrp: 0.414 ± 0.298
1.655IleTyr: 1.655 ± 0.874
0.0IleXaa: 0.0 ± 0.0
Lys
5.792LysAla: 5.792 ± 1.549
0.827LysCys: 0.827 ± 0.542
6.206LysAsp: 6.206 ± 1.552
3.31LysGlu: 3.31 ± 1.41
4.965LysPhe: 4.965 ± 1.843
2.482LysGly: 2.482 ± 0.54
1.241LysHis: 1.241 ± 0.536
2.896LysIle: 2.896 ± 1.384
5.379LysLys: 5.379 ± 1.372
8.688LysLeu: 8.688 ± 1.087
1.241LysMet: 1.241 ± 0.806
4.137LysAsn: 4.137 ± 1.505
3.31LysPro: 3.31 ± 1.484
1.241LysGln: 1.241 ± 0.464
5.379LysArg: 5.379 ± 3.087
4.551LysSer: 4.551 ± 1.722
2.482LysThr: 2.482 ± 1.413
4.551LysVal: 4.551 ± 1.396
0.414LysTrp: 0.414 ± 0.337
2.482LysTyr: 2.482 ± 1.072
0.0LysXaa: 0.0 ± 0.0
Leu
7.034LeuAla: 7.034 ± 1.693
3.31LeuCys: 3.31 ± 1.638
8.275LeuAsp: 8.275 ± 1.106
4.137LeuGlu: 4.137 ± 0.829
3.724LeuPhe: 3.724 ± 0.493
3.31LeuGly: 3.31 ± 0.734
0.414LeuHis: 0.414 ± 0.337
2.482LeuIle: 2.482 ± 0.82
7.034LeuLys: 7.034 ± 1.846
6.62LeuLeu: 6.62 ± 0.581
2.896LeuMet: 2.896 ± 0.995
5.792LeuAsn: 5.792 ± 1.604
3.31LeuPro: 3.31 ± 0.778
3.31LeuGln: 3.31 ± 1.022
4.965LeuArg: 4.965 ± 0.911
7.861LeuSer: 7.861 ± 0.181
4.551LeuThr: 4.551 ± 1.783
5.379LeuVal: 5.379 ± 1.036
0.414LeuTrp: 0.414 ± 0.559
2.896LeuTyr: 2.896 ± 1.321
0.0LeuXaa: 0.0 ± 0.0
Met
3.724MetAla: 3.724 ± 0.667
0.414MetCys: 0.414 ± 0.337
1.655MetAsp: 1.655 ± 0.853
1.241MetGlu: 1.241 ± 0.799
0.0MetPhe: 0.0 ± 0.0
0.414MetGly: 0.414 ± 0.847
0.827MetHis: 0.827 ± 0.81
1.241MetIle: 1.241 ± 0.936
2.069MetLys: 2.069 ± 0.86
2.482MetLeu: 2.482 ± 1.072
0.827MetMet: 0.827 ± 0.674
2.482MetAsn: 2.482 ± 0.591
2.482MetPro: 2.482 ± 1.174
1.241MetGln: 1.241 ± 1.06
2.069MetArg: 2.069 ± 1.012
1.241MetSer: 1.241 ± 0.907
1.655MetThr: 1.655 ± 1.193
1.241MetVal: 1.241 ± 0.464
0.414MetTrp: 0.414 ± 0.298
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.792AsnAla: 5.792 ± 1.947
1.241AsnCys: 1.241 ± 0.834
1.655AsnAsp: 1.655 ± 1.193
0.414AsnGlu: 0.414 ± 0.337
1.241AsnPhe: 1.241 ± 0.536
1.241AsnGly: 1.241 ± 0.462
0.827AsnHis: 0.827 ± 0.786
1.241AsnIle: 1.241 ± 0.895
3.724AsnLys: 3.724 ± 1.147
2.896AsnLeu: 2.896 ± 1.474
2.069AsnMet: 2.069 ± 1.684
1.655AsnAsn: 1.655 ± 1.069
3.724AsnPro: 3.724 ± 1.108
1.241AsnGln: 1.241 ± 1.011
3.724AsnArg: 3.724 ± 1.065
4.551AsnSer: 4.551 ± 1.167
2.482AsnThr: 2.482 ± 1.449
4.551AsnVal: 4.551 ± 1.494
0.414AsnTrp: 0.414 ± 0.298
2.482AsnTyr: 2.482 ± 0.54
0.0AsnXaa: 0.0 ± 0.0
Pro
1.655ProAla: 1.655 ± 0.728
1.655ProCys: 1.655 ± 0.684
4.551ProAsp: 4.551 ± 1.221
4.551ProGlu: 4.551 ± 0.576
0.827ProPhe: 0.827 ± 0.81
2.482ProGly: 2.482 ± 1.398
1.241ProHis: 1.241 ± 0.536
3.31ProIle: 3.31 ± 1.207
2.069ProLys: 2.069 ± 0.539
6.62ProLeu: 6.62 ± 1.328
2.896ProMet: 2.896 ± 0.995
1.655ProAsn: 1.655 ± 0.684
2.069ProPro: 2.069 ± 1.443
0.414ProGln: 0.414 ± 0.337
2.069ProArg: 2.069 ± 0.718
2.069ProSer: 2.069 ± 1.659
2.896ProThr: 2.896 ± 1.134
2.069ProVal: 2.069 ± 1.18
0.0ProTrp: 0.0 ± 0.0
2.069ProTyr: 2.069 ± 0.86
0.0ProXaa: 0.0 ± 0.0
Gln
2.896GlnAla: 2.896 ± 0.78
0.827GlnCys: 0.827 ± 0.602
0.827GlnAsp: 0.827 ± 0.542
0.827GlnGlu: 0.827 ± 0.273
0.827GlnPhe: 0.827 ± 0.81
1.655GlnGly: 1.655 ± 1.312
0.414GlnHis: 0.414 ± 0.337
2.069GlnIle: 2.069 ± 0.539
2.896GlnLys: 2.896 ± 1.296
2.896GlnLeu: 2.896 ± 1.428
0.827GlnMet: 0.827 ± 0.273
0.827GlnAsn: 0.827 ± 0.273
0.827GlnPro: 0.827 ± 0.674
2.069GlnGln: 2.069 ± 0.698
2.069GlnArg: 2.069 ± 1.368
2.482GlnSer: 2.482 ± 0.591
1.241GlnThr: 1.241 ± 0.464
1.655GlnVal: 1.655 ± 0.604
0.414GlnTrp: 0.414 ± 0.337
1.241GlnTyr: 1.241 ± 0.834
0.0GlnXaa: 0.0 ± 0.0
Arg
3.724ArgAla: 3.724 ± 0.739
2.069ArgCys: 2.069 ± 0.675
2.482ArgAsp: 2.482 ± 0.924
5.379ArgGlu: 5.379 ± 2.264
2.896ArgPhe: 2.896 ± 0.686
3.31ArgGly: 3.31 ± 0.594
3.724ArgHis: 3.724 ± 0.906
3.31ArgIle: 3.31 ± 0.442
2.069ArgLys: 2.069 ± 3.287
4.137ArgLeu: 4.137 ± 1.301
0.827ArgMet: 0.827 ± 0.596
3.724ArgAsn: 3.724 ± 1.387
2.069ArgPro: 2.069 ± 0.951
0.414ArgGln: 0.414 ± 0.337
3.31ArgArg: 3.31 ± 0.909
4.551ArgSer: 4.551 ± 1.569
3.724ArgThr: 3.724 ± 1.299
3.724ArgVal: 3.724 ± 1.299
0.414ArgTrp: 0.414 ± 0.559
0.414ArgTyr: 0.414 ± 0.559
0.0ArgXaa: 0.0 ± 0.0
Ser
3.31SerAla: 3.31 ± 2.146
1.655SerCys: 1.655 ± 0.976
4.965SerAsp: 4.965 ± 2.797
3.31SerGlu: 3.31 ± 1.093
4.551SerPhe: 4.551 ± 1.395
4.137SerGly: 4.137 ± 0.802
2.482SerHis: 2.482 ± 1.414
6.206SerIle: 6.206 ± 1.886
3.31SerLys: 3.31 ± 1.638
5.792SerLeu: 5.792 ± 2.145
0.414SerMet: 0.414 ± 0.298
3.724SerAsn: 3.724 ± 0.762
2.896SerPro: 2.896 ± 1.236
1.655SerGln: 1.655 ± 0.684
4.137SerArg: 4.137 ± 1.396
3.724SerSer: 3.724 ± 1.542
4.137SerThr: 4.137 ± 1.298
6.206SerVal: 6.206 ± 1.062
0.414SerTrp: 0.414 ± 0.337
1.655SerTyr: 1.655 ± 0.546
0.0SerXaa: 0.0 ± 0.0
Thr
1.655ThrAla: 1.655 ± 1.263
0.827ThrCys: 0.827 ± 0.596
2.482ThrAsp: 2.482 ± 1.003
3.724ThrGlu: 3.724 ± 2.142
1.241ThrPhe: 1.241 ± 0.834
3.31ThrGly: 3.31 ± 1.367
1.241ThrHis: 1.241 ± 0.655
1.241ThrIle: 1.241 ± 0.907
3.31ThrLys: 3.31 ± 1.849
5.379ThrLeu: 5.379 ± 0.757
0.827ThrMet: 0.827 ± 0.273
0.414ThrAsn: 0.414 ± 0.298
2.069ThrPro: 2.069 ± 1.178
1.241ThrGln: 1.241 ± 0.536
4.965ThrArg: 4.965 ± 2.35
3.724ThrSer: 3.724 ± 1.147
2.896ThrThr: 2.896 ± 0.893
2.896ThrVal: 2.896 ± 1.548
0.0ThrTrp: 0.0 ± 0.0
2.069ThrTyr: 2.069 ± 0.781
0.0ThrXaa: 0.0 ± 0.0
Val
4.965ValAla: 4.965 ± 1.639
0.827ValCys: 0.827 ± 0.273
6.206ValAsp: 6.206 ± 1.779
4.137ValGlu: 4.137 ± 0.757
4.551ValPhe: 4.551 ± 1.573
2.482ValGly: 2.482 ± 1.31
1.655ValHis: 1.655 ± 1.084
4.137ValIle: 4.137 ± 1.126
5.792ValLys: 5.792 ± 1.155
8.275ValLeu: 8.275 ± 2.378
1.655ValMet: 1.655 ± 1.312
3.724ValAsn: 3.724 ± 0.679
4.965ValPro: 4.965 ± 1.765
1.655ValGln: 1.655 ± 0.728
5.379ValArg: 5.379 ± 0.712
3.31ValSer: 3.31 ± 1.528
2.069ValThr: 2.069 ± 0.856
4.137ValVal: 4.137 ± 0.757
0.414ValTrp: 0.414 ± 0.559
2.482ValTyr: 2.482 ± 0.82
0.0ValXaa: 0.0 ± 0.0
Trp
1.655TrpAla: 1.655 ± 0.684
0.0TrpCys: 0.0 ± 0.0
0.414TrpAsp: 0.414 ± 0.559
2.069TrpGlu: 2.069 ± 0.781
0.414TrpPhe: 0.414 ± 0.337
0.0TrpGly: 0.0 ± 0.0
0.414TrpHis: 0.414 ± 0.337
0.0TrpIle: 0.0 ± 0.0
0.827TrpLys: 0.827 ± 0.273
0.0TrpLeu: 0.0 ± 0.0
0.414TrpMet: 0.414 ± 0.298
0.0TrpAsn: 0.0 ± 0.0
0.414TrpPro: 0.414 ± 0.559
0.0TrpGln: 0.0 ± 0.0
0.414TrpArg: 0.414 ± 0.337
0.414TrpSer: 0.414 ± 0.298
0.0TrpThr: 0.0 ± 0.0
2.482TrpVal: 2.482 ± 0.54
0.0TrpTrp: 0.0 ± 0.0
0.414TrpTyr: 0.414 ± 0.559
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.724TyrAla: 3.724 ± 1.478
0.0TyrCys: 0.0 ± 0.0
1.241TyrAsp: 1.241 ± 0.462
2.482TyrGlu: 2.482 ± 0.718
1.655TyrPhe: 1.655 ± 1.069
1.655TyrGly: 1.655 ± 0.728
0.827TyrHis: 0.827 ± 0.674
2.069TyrIle: 2.069 ± 1.012
1.241TyrLys: 1.241 ± 0.673
3.31TyrLeu: 3.31 ± 1.093
0.414TyrMet: 0.414 ± 0.847
1.241TyrAsn: 1.241 ± 0.655
0.0TyrPro: 0.0 ± 0.0
2.896TyrGln: 2.896 ± 0.89
1.241TyrArg: 1.241 ± 0.462
0.827TyrSer: 0.827 ± 0.786
1.655TyrThr: 1.655 ± 0.616
2.896TyrVal: 2.896 ± 0.778
0.0TyrTrp: 0.0 ± 0.0
0.414TyrTyr: 0.414 ± 0.298
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2418 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski