Amino acid dipepetide frequency for Pea enation mosaic virus-2 (PEMV-2)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.386AlaAla: 4.386 ± 2.65
1.462AlaCys: 1.462 ± 0.713
2.193AlaAsp: 2.193 ± 1.398
5.117AlaGlu: 5.117 ± 2.822
1.462AlaPhe: 1.462 ± 1.132
2.924AlaGly: 2.924 ± 1.275
0.731AlaHis: 0.731 ± 0.759
2.924AlaIle: 2.924 ± 0.764
5.117AlaLys: 5.117 ± 2.289
8.772AlaLeu: 8.772 ± 3.034
2.924AlaMet: 2.924 ± 2.413
4.386AlaAsn: 4.386 ± 1.702
4.386AlaPro: 4.386 ± 1.362
5.117AlaGln: 5.117 ± 2.556
9.503AlaArg: 9.503 ± 3.243
5.117AlaSer: 5.117 ± 2.027
1.462AlaThr: 1.462 ± 1.001
10.234AlaVal: 10.234 ± 2.126
0.731AlaTrp: 0.731 ± 0.759
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.731CysAla: 0.731 ± 0.464
0.731CysCys: 0.731 ± 0.464
0.731CysAsp: 0.731 ± 0.759
0.0CysGlu: 0.0 ± 0.0
0.731CysPhe: 0.731 ± 0.464
2.193CysGly: 2.193 ± 0.934
3.655CysHis: 3.655 ± 1.682
2.924CysIle: 2.924 ± 0.764
2.193CysLys: 2.193 ± 1.398
0.0CysLeu: 0.0 ± 0.245
0.731CysMet: 0.731 ± 0.345
0.731CysAsn: 0.731 ± 0.759
0.0CysPro: 0.0 ± 0.0
2.193CysGln: 2.193 ± 0.934
2.193CysArg: 2.193 ± 1.479
2.193CysSer: 2.193 ± 0.904
0.731CysThr: 0.731 ± 0.85
3.655CysVal: 3.655 ± 2.086
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.924AspAla: 2.924 ± 0.728
2.193AspCys: 2.193 ± 0.952
1.462AspAsp: 1.462 ± 0.785
0.731AspGlu: 0.731 ± 0.871
2.924AspPhe: 2.924 ± 1.427
2.924AspGly: 2.924 ± 0.886
0.731AspHis: 0.731 ± 0.464
0.0AspIle: 0.0 ± 0.0
0.731AspLys: 0.731 ± 0.464
4.386AspLeu: 4.386 ± 1.843
1.462AspMet: 1.462 ± 0.786
3.655AspAsn: 3.655 ± 1.01
5.117AspPro: 5.117 ± 1.467
1.462AspGln: 1.462 ± 0.928
2.193AspArg: 2.193 ± 1.392
3.655AspSer: 3.655 ± 1.588
1.462AspThr: 1.462 ± 0.928
5.117AspVal: 5.117 ± 1.419
0.0AspTrp: 0.0 ± 0.0
0.731AspTyr: 0.731 ± 0.759
0.0AspXaa: 0.0 ± 0.0
Glu
5.117GluAla: 5.117 ± 1.419
0.731GluCys: 0.731 ± 0.759
2.193GluAsp: 2.193 ± 0.934
2.193GluGlu: 2.193 ± 1.479
1.462GluPhe: 1.462 ± 0.928
5.848GluGly: 5.848 ± 1.197
0.731GluHis: 0.731 ± 0.871
2.193GluIle: 2.193 ± 0.699
5.117GluLys: 5.117 ± 1.538
5.848GluLeu: 5.848 ± 2.267
1.462GluMet: 1.462 ± 1.518
0.0GluAsn: 0.0 ± 0.0
2.924GluPro: 2.924 ± 1.291
0.731GluGln: 0.731 ± 0.871
5.848GluArg: 5.848 ± 3.359
4.386GluSer: 4.386 ± 2.136
4.386GluThr: 4.386 ± 1.702
5.848GluVal: 5.848 ± 1.693
0.731GluTrp: 0.731 ± 0.759
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.924PheAla: 2.924 ± 1.302
0.731PheCys: 0.731 ± 0.464
2.924PheAsp: 2.924 ± 1.291
0.731PheGlu: 0.731 ± 0.85
1.462PhePhe: 1.462 ± 0.928
2.924PheGly: 2.924 ± 1.427
0.0PheHis: 0.0 ± 0.0
0.731PheIle: 0.731 ± 0.464
1.462PheLys: 1.462 ± 0.928
1.462PheLeu: 1.462 ± 0.785
0.0PheMet: 0.0 ± 0.0
2.924PheAsn: 2.924 ± 1.856
0.731PhePro: 0.731 ± 0.464
0.0PheGln: 0.0 ± 0.0
2.193PheArg: 2.193 ± 0.904
1.462PheSer: 1.462 ± 1.701
5.117PheThr: 5.117 ± 1.443
1.462PheVal: 1.462 ± 0.928
0.731PheTrp: 0.731 ± 0.759
2.193PheTyr: 2.193 ± 0.97
0.0PheXaa: 0.0 ± 0.0
Gly
4.386GlyAla: 4.386 ± 1.665
2.193GlyCys: 2.193 ± 1.479
5.848GlyAsp: 5.848 ± 1.771
4.386GlyGlu: 4.386 ± 1.336
4.386GlyPhe: 4.386 ± 2.784
6.579GlyGly: 6.579 ± 4.774
0.0GlyHis: 0.0 ± 0.0
2.924GlyIle: 2.924 ± 1.275
3.655GlyLys: 3.655 ± 1.666
5.848GlyLeu: 5.848 ± 1.404
2.193GlyMet: 2.193 ± 0.869
2.193GlyAsn: 2.193 ± 0.904
5.117GlyPro: 5.117 ± 0.859
2.193GlyGln: 2.193 ± 1.803
2.193GlyArg: 2.193 ± 0.76
6.579GlySer: 6.579 ± 2.665
1.462GlyThr: 1.462 ± 0.786
5.117GlyVal: 5.117 ± 1.811
0.731GlyTrp: 0.731 ± 0.464
0.731GlyTyr: 0.731 ± 0.871
0.0GlyXaa: 0.0 ± 0.0
His
1.462HisAla: 1.462 ± 0.786
1.462HisCys: 1.462 ± 0.786
0.731HisAsp: 0.731 ± 0.85
1.462HisGlu: 1.462 ± 0.928
0.0HisPhe: 0.0 ± 0.0
1.462HisGly: 1.462 ± 0.786
1.462HisHis: 1.462 ± 0.786
0.731HisIle: 0.731 ± 0.85
0.0HisLys: 0.0 ± 0.0
2.193HisLeu: 2.193 ± 0.97
0.0HisMet: 0.0 ± 0.0
1.462HisAsn: 1.462 ± 0.785
5.848HisPro: 5.848 ± 4.89
0.731HisGln: 0.731 ± 0.464
2.924HisArg: 2.924 ± 1.688
0.0HisSer: 0.0 ± 0.0
0.731HisThr: 0.731 ± 0.759
2.193HisVal: 2.193 ± 0.952
0.0HisTrp: 0.0 ± 0.0
0.731HisTyr: 0.731 ± 0.871
0.0HisXaa: 0.0 ± 0.0
Ile
5.117IleAla: 5.117 ± 2.493
0.731IleCys: 0.731 ± 0.85
1.462IleAsp: 1.462 ± 0.713
2.924IleGlu: 2.924 ± 1.302
0.0IlePhe: 0.0 ± 0.0
2.193IleGly: 2.193 ± 1.593
0.0IleHis: 0.0 ± 0.0
2.193IleIle: 2.193 ± 1.593
3.655IleLys: 3.655 ± 1.001
1.462IleLeu: 1.462 ± 0.785
0.731IleMet: 0.731 ± 0.759
2.193IleAsn: 2.193 ± 0.952
4.386IlePro: 4.386 ± 1.451
1.462IleGln: 1.462 ± 0.786
0.0IleArg: 0.0 ± 0.0
2.924IleSer: 2.924 ± 1.302
2.924IleThr: 2.924 ± 2.413
2.193IleVal: 2.193 ± 0.97
0.731IleTrp: 0.731 ± 0.85
1.462IleTyr: 1.462 ± 0.928
0.0IleXaa: 0.0 ± 0.0
Lys
2.924LysAla: 2.924 ± 1.856
0.731LysCys: 0.731 ± 0.464
2.924LysAsp: 2.924 ± 1.856
1.462LysGlu: 1.462 ± 1.701
2.193LysPhe: 2.193 ± 1.398
3.655LysGly: 3.655 ± 1.452
0.731LysHis: 0.731 ± 0.871
1.462LysIle: 1.462 ± 0.928
0.0LysLys: 0.0 ± 0.0
3.655LysLeu: 3.655 ± 2.411
1.462LysMet: 1.462 ± 0.881
1.462LysAsn: 1.462 ± 0.786
2.924LysPro: 2.924 ± 1.275
1.462LysGln: 1.462 ± 1.743
4.386LysArg: 4.386 ± 2.737
3.655LysSer: 3.655 ± 1.596
2.924LysThr: 2.924 ± 0.886
5.117LysVal: 5.117 ± 1.538
1.462LysTrp: 1.462 ± 0.928
2.193LysTyr: 2.193 ± 0.952
0.0LysXaa: 0.0 ± 0.0
Leu
8.772LeuAla: 8.772 ± 3.559
2.193LeuCys: 2.193 ± 1.392
2.924LeuAsp: 2.924 ± 1.472
6.579LeuGlu: 6.579 ± 1.241
2.193LeuPhe: 2.193 ± 1.57
8.772LeuGly: 8.772 ± 3.249
2.924LeuHis: 2.924 ± 0.598
0.731LeuIle: 0.731 ± 0.759
4.386LeuLys: 4.386 ± 3.139
9.503LeuLeu: 9.503 ± 2.894
1.462LeuMet: 1.462 ± 0.785
0.731LeuAsn: 0.731 ± 0.464
6.579LeuPro: 6.579 ± 2.498
2.924LeuGln: 2.924 ± 1.275
2.924LeuArg: 2.924 ± 2.003
8.041LeuSer: 8.041 ± 2.098
2.924LeuThr: 2.924 ± 1.231
2.924LeuVal: 2.924 ± 1.427
2.924LeuTrp: 2.924 ± 1.427
2.924LeuTyr: 2.924 ± 1.302
0.0LeuXaa: 0.0 ± 0.0
Met
1.462MetAla: 1.462 ± 1.743
0.0MetCys: 0.0 ± 0.0
2.193MetAsp: 2.193 ± 1.392
2.193MetGlu: 2.193 ± 2.277
0.731MetPhe: 0.731 ± 0.464
1.462MetGly: 1.462 ± 0.713
0.0MetHis: 0.0 ± 0.0
0.731MetIle: 0.731 ± 0.759
1.462MetLys: 1.462 ± 0.785
0.731MetLeu: 0.731 ± 0.464
0.731MetMet: 0.731 ± 0.464
0.731MetAsn: 0.731 ± 0.759
1.462MetPro: 1.462 ± 0.785
0.731MetGln: 0.731 ± 0.464
0.0MetArg: 0.0 ± 0.0
4.386MetSer: 4.386 ± 1.939
0.731MetThr: 0.731 ± 0.871
2.193MetVal: 2.193 ± 0.699
0.731MetTrp: 0.731 ± 0.85
0.731MetTyr: 0.731 ± 0.464
0.0MetXaa: 0.0 ± 0.0
Asn
3.655AsnAla: 3.655 ± 2.32
2.193AsnCys: 2.193 ± 1.398
1.462AsnAsp: 1.462 ± 1.743
3.655AsnGlu: 3.655 ± 1.001
1.462AsnPhe: 1.462 ± 0.785
0.731AsnGly: 0.731 ± 0.464
0.731AsnHis: 0.731 ± 0.871
0.731AsnIle: 0.731 ± 0.464
2.193AsnLys: 2.193 ± 1.593
5.848AsnLeu: 5.848 ± 0.738
0.0AsnMet: 0.0 ± 0.0
3.655AsnAsn: 3.655 ± 1.666
1.462AsnPro: 1.462 ± 0.713
0.731AsnGln: 0.731 ± 0.85
1.462AsnArg: 1.462 ± 0.785
1.462AsnSer: 1.462 ± 0.786
0.731AsnThr: 0.731 ± 0.85
1.462AsnVal: 1.462 ± 0.786
0.731AsnTrp: 0.731 ± 0.464
1.462AsnTyr: 1.462 ± 0.785
0.0AsnXaa: 0.0 ± 0.0
Pro
5.117ProAla: 5.117 ± 2.556
1.462ProCys: 1.462 ± 1.518
2.924ProAsp: 2.924 ± 0.728
4.386ProGlu: 4.386 ± 2.209
1.462ProPhe: 1.462 ± 0.785
4.386ProGly: 4.386 ± 1.702
3.655ProHis: 3.655 ± 1.452
2.924ProIle: 2.924 ± 0.886
4.386ProLys: 4.386 ± 1.809
5.117ProLeu: 5.117 ± 2.248
1.462ProMet: 1.462 ± 0.928
0.0ProAsn: 0.0 ± 0.0
9.503ProPro: 9.503 ± 2.462
1.462ProGln: 1.462 ± 0.786
10.965ProArg: 10.965 ± 2.399
5.117ProSer: 5.117 ± 0.91
7.31ProThr: 7.31 ± 2.512
6.579ProVal: 6.579 ± 1.112
0.731ProTrp: 0.731 ± 0.464
1.462ProTyr: 1.462 ± 0.937
0.0ProXaa: 0.0 ± 0.0
Gln
3.655GlnAla: 3.655 ± 1.321
1.462GlnCys: 1.462 ± 0.928
0.0GlnAsp: 0.0 ± 0.0
2.924GlnGlu: 2.924 ± 2.224
0.731GlnPhe: 0.731 ± 0.464
1.462GlnGly: 1.462 ± 0.786
2.193GlnHis: 2.193 ± 0.904
0.731GlnIle: 0.731 ± 0.464
0.731GlnLys: 0.731 ± 0.85
3.655GlnLeu: 3.655 ± 2.296
0.731GlnMet: 0.731 ± 0.464
0.731GlnAsn: 0.731 ± 0.871
5.117GlnPro: 5.117 ± 1.903
1.462GlnGln: 1.462 ± 1.701
2.193GlnArg: 2.193 ± 1.392
2.193GlnSer: 2.193 ± 0.97
0.731GlnThr: 0.731 ± 0.464
1.462GlnVal: 1.462 ± 0.786
0.0GlnTrp: 0.0 ± 0.0
0.731GlnTyr: 0.731 ± 0.85
0.0GlnXaa: 0.0 ± 0.0
Arg
10.234ArgAla: 10.234 ± 4.489
2.193ArgCys: 2.193 ± 1.392
4.386ArgAsp: 4.386 ± 1.78
6.579ArgGlu: 6.579 ± 2.918
2.193ArgPhe: 2.193 ± 0.97
5.848ArgGly: 5.848 ± 2.106
0.0ArgHis: 0.0 ± 0.0
1.462ArgIle: 1.462 ± 0.937
1.462ArgLys: 1.462 ± 1.743
3.655ArgLeu: 3.655 ± 1.128
2.193ArgMet: 2.193 ± 1.392
1.462ArgAsn: 1.462 ± 0.785
4.386ArgPro: 4.386 ± 1.38
1.462ArgGln: 1.462 ± 1.743
10.234ArgArg: 10.234 ± 6.051
5.117ArgSer: 5.117 ± 1.647
6.579ArgThr: 6.579 ± 2.594
6.579ArgVal: 6.579 ± 1.755
1.462ArgTrp: 1.462 ± 0.713
2.193ArgTyr: 2.193 ± 0.97
0.0ArgXaa: 0.0 ± 0.0
Ser
6.579SerAla: 6.579 ± 1.141
0.731SerCys: 0.731 ± 0.759
2.193SerAsp: 2.193 ± 1.593
1.462SerGlu: 1.462 ± 1.701
2.193SerPhe: 2.193 ± 0.97
4.386SerGly: 4.386 ± 0.394
3.655SerHis: 3.655 ± 2.356
7.31SerIle: 7.31 ± 3.404
1.462SerLys: 1.462 ± 1.132
4.386SerLeu: 4.386 ± 1.209
1.462SerMet: 1.462 ± 0.713
2.924SerAsn: 2.924 ± 1.57
4.386SerPro: 4.386 ± 1.478
1.462SerGln: 1.462 ± 0.785
6.579SerArg: 6.579 ± 1.141
5.117SerSer: 5.117 ± 2.17
3.655SerThr: 3.655 ± 1.321
7.31SerVal: 7.31 ± 2.198
0.731SerTrp: 0.731 ± 0.464
1.462SerTyr: 1.462 ± 0.928
0.0SerXaa: 0.0 ± 0.0
Thr
2.924ThrAla: 2.924 ± 1.472
1.462ThrCys: 1.462 ± 1.001
0.731ThrAsp: 0.731 ± 0.464
3.655ThrGlu: 3.655 ± 1.321
0.731ThrPhe: 0.731 ± 0.464
5.848ThrGly: 5.848 ± 1.981
2.193ThrHis: 2.193 ± 0.904
2.193ThrIle: 2.193 ± 1.803
1.462ThrLys: 1.462 ± 0.786
2.924ThrLeu: 2.924 ± 2.224
1.462ThrMet: 1.462 ± 0.785
2.924ThrAsn: 2.924 ± 1.302
9.503ThrPro: 9.503 ± 3.274
4.386ThrGln: 4.386 ± 1.478
3.655ThrArg: 3.655 ± 0.203
3.655ThrSer: 3.655 ± 1.001
4.386ThrThr: 4.386 ± 2.217
2.193ThrVal: 2.193 ± 0.934
1.462ThrTrp: 1.462 ± 1.518
0.731ThrTyr: 0.731 ± 0.464
0.0ThrXaa: 0.0 ± 0.0
Val
4.386ValAla: 4.386 ± 1.78
3.655ValCys: 3.655 ± 1.025
4.386ValAsp: 4.386 ± 2.796
6.579ValGlu: 6.579 ± 1.037
3.655ValPhe: 3.655 ± 1.001
2.924ValGly: 2.924 ± 0.886
2.193ValHis: 2.193 ± 0.952
4.386ValIle: 4.386 ± 0.394
5.117ValLys: 5.117 ± 2.204
9.503ValLeu: 9.503 ± 0.547
1.462ValMet: 1.462 ± 0.713
2.193ValAsn: 2.193 ± 0.952
4.386ValPro: 4.386 ± 1.843
1.462ValGln: 1.462 ± 1.743
6.579ValArg: 6.579 ± 1.037
2.193ValSer: 2.193 ± 0.97
5.848ValThr: 5.848 ± 2.773
7.31ValVal: 7.31 ± 2.466
0.0ValTrp: 0.0 ± 0.0
1.462ValTyr: 1.462 ± 0.928
0.0ValXaa: 0.0 ± 0.0
Trp
0.731TrpAla: 0.731 ± 0.759
0.0TrpCys: 0.0 ± 0.0
0.731TrpAsp: 0.731 ± 0.464
0.0TrpGlu: 0.0 ± 0.0
0.731TrpPhe: 0.731 ± 0.759
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.731TrpIle: 0.731 ± 0.759
0.731TrpLys: 0.731 ± 0.464
2.924TrpLeu: 2.924 ± 0.756
0.0TrpMet: 0.0 ± 0.678
0.731TrpAsn: 0.731 ± 0.464
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.462TrpArg: 1.462 ± 0.928
1.462TrpSer: 1.462 ± 0.785
0.731TrpThr: 0.731 ± 0.464
0.731TrpVal: 0.731 ± 0.759
0.731TrpTrp: 0.731 ± 0.464
2.193TrpTyr: 2.193 ± 1.398
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.462TyrAla: 1.462 ± 0.785
0.0TyrCys: 0.0 ± 0.0
1.462TyrAsp: 1.462 ± 0.928
0.731TyrGlu: 0.731 ± 0.759
1.462TyrPhe: 1.462 ± 0.928
2.193TyrGly: 2.193 ± 0.904
0.0TyrHis: 0.0 ± 0.0
0.731TyrIle: 0.731 ± 0.464
2.193TyrLys: 2.193 ± 1.392
1.462TyrLeu: 1.462 ± 1.701
0.731TyrMet: 0.731 ± 0.871
0.731TyrAsn: 0.731 ± 0.871
2.193TyrPro: 2.193 ± 0.699
1.462TyrGln: 1.462 ± 0.713
2.193TyrArg: 2.193 ± 1.392
0.731TyrSer: 0.731 ± 0.85
3.655TyrThr: 3.655 ± 1.7
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1369 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski