Amino acid dipepetide frequency for Wenling crustacean virus 13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.413AlaAla: 3.413 ± 0.779
2.482AlaCys: 2.482 ± 0.322
1.551AlaAsp: 1.551 ± 0.399
4.034AlaGlu: 4.034 ± 0.469
1.862AlaPhe: 1.862 ± 0.656
4.344AlaGly: 4.344 ± 2.415
1.241AlaHis: 1.241 ± 0.666
4.034AlaIle: 4.034 ± 1.702
3.413AlaLys: 3.413 ± 1.251
6.516AlaLeu: 6.516 ± 0.616
2.792AlaMet: 2.792 ± 0.642
1.862AlaAsn: 1.862 ± 2.179
2.792AlaPro: 2.792 ± 1.937
2.792AlaGln: 2.792 ± 1.425
2.172AlaArg: 2.172 ± 0.801
6.516AlaSer: 6.516 ± 0.915
4.344AlaThr: 4.344 ± 0.323
5.275AlaVal: 5.275 ± 0.581
1.551AlaTrp: 1.551 ± 0.787
3.413AlaTyr: 3.413 ± 0.917
0.0AlaXaa: 0.0 ± 0.0
Cys
0.31CysAla: 0.31 ± 0.456
0.621CysCys: 0.621 ± 0.333
0.931CysAsp: 0.931 ± 0.5
1.862CysGlu: 1.862 ± 0.61
0.31CysPhe: 0.31 ± 0.456
0.31CysGly: 0.31 ± 0.499
1.241CysHis: 1.241 ± 0.757
2.792CysIle: 2.792 ± 0.487
1.862CysLys: 1.862 ± 0.525
2.792CysLeu: 2.792 ± 0.252
0.31CysMet: 0.31 ± 0.167
0.31CysAsn: 0.31 ± 0.456
0.621CysPro: 0.621 ± 0.333
0.31CysGln: 0.31 ± 0.167
1.551CysArg: 1.551 ± 0.399
1.551CysSer: 1.551 ± 0.667
1.551CysThr: 1.551 ± 0.399
0.931CysVal: 0.931 ± 0.5
0.31CysTrp: 0.31 ± 0.456
1.862CysTyr: 1.862 ± 0.61
0.0CysXaa: 0.0 ± 0.0
Asp
2.792AspAla: 2.792 ± 0.547
0.621AspCys: 0.621 ± 0.378
3.723AspAsp: 3.723 ± 0.679
5.275AspGlu: 5.275 ± 0.292
2.482AspPhe: 2.482 ± 1.073
2.482AspGly: 2.482 ± 0.438
1.551AspHis: 1.551 ± 0.833
1.551AspIle: 1.551 ± 0.399
2.172AspLys: 2.172 ± 0.161
7.446AspLeu: 7.446 ± 2.223
1.551AspMet: 1.551 ± 1.245
2.172AspAsn: 2.172 ± 1.166
1.241AspPro: 1.241 ± 0.752
0.931AspGln: 0.931 ± 0.52
1.241AspArg: 1.241 ± 0.666
4.034AspSer: 4.034 ± 3.029
3.723AspThr: 3.723 ± 0.478
2.482AspVal: 2.482 ± 1.333
1.551AspTrp: 1.551 ± 0.524
0.31AspTyr: 0.31 ± 0.167
0.0AspXaa: 0.0 ± 0.0
Glu
4.654GluAla: 4.654 ± 0.941
0.931GluCys: 0.931 ± 0.5
2.792GluAsp: 2.792 ± 0.487
2.792GluGlu: 2.792 ± 1.499
0.931GluPhe: 0.931 ± 0.5
6.516GluGly: 6.516 ± 0.616
0.621GluHis: 0.621 ± 0.998
3.413GluIle: 3.413 ± 0.902
3.103GluLys: 3.103 ± 1.268
6.826GluLeu: 6.826 ± 2.599
1.862GluMet: 1.862 ± 0.062
3.103GluAsn: 3.103 ± 1.333
2.172GluPro: 2.172 ± 0.669
1.551GluGln: 1.551 ± 0.667
4.344GluArg: 4.344 ± 0.962
3.413GluSer: 3.413 ± 1.3
6.205GluThr: 6.205 ± 1.519
5.275GluVal: 5.275 ± 0.592
0.931GluTrp: 0.931 ± 0.305
0.931GluTyr: 0.931 ± 0.5
0.0GluXaa: 0.0 ± 0.0
Phe
0.621PheAla: 0.621 ± 0.378
0.621PheCys: 0.621 ± 0.333
1.862PheAsp: 1.862 ± 0.525
2.482PheGlu: 2.482 ± 0.953
0.931PhePhe: 0.931 ± 0.5
1.862PheGly: 1.862 ± 1.739
0.621PheHis: 0.621 ± 0.333
0.931PheIle: 0.931 ± 0.52
0.621PheLys: 0.621 ± 0.378
4.964PheLeu: 4.964 ± 1.642
0.931PheMet: 0.931 ± 0.5
0.931PheAsn: 0.931 ± 0.361
1.551PhePro: 1.551 ± 0.758
2.172PheGln: 2.172 ± 0.161
2.792PheArg: 2.792 ± 1.109
3.413PheSer: 3.413 ± 0.622
1.862PheThr: 1.862 ± 0.63
1.862PheVal: 1.862 ± 0.626
0.931PheTrp: 0.931 ± 0.361
0.31PheTyr: 0.31 ± 0.456
0.0PheXaa: 0.0 ± 0.0
Gly
4.344GlyAla: 4.344 ± 0.321
0.621GlyCys: 0.621 ± 0.998
2.482GlyAsp: 2.482 ± 0.977
4.034GlyGlu: 4.034 ± 1.455
3.103GlyPhe: 3.103 ± 1.516
4.964GlyGly: 4.964 ± 2.711
2.482GlyHis: 2.482 ± 0.345
3.413GlyIle: 3.413 ± 0.806
1.862GlyLys: 1.862 ± 0.062
2.792GlyLeu: 2.792 ± 0.419
0.931GlyMet: 0.931 ± 0.87
1.551GlyAsn: 1.551 ± 0.718
4.344GlyPro: 4.344 ± 3.317
1.241GlyGln: 1.241 ± 0.355
4.344GlyArg: 4.344 ± 1.478
3.723GlySer: 3.723 ± 2.27
2.172GlyThr: 2.172 ± 0.801
3.413GlyVal: 3.413 ± 0.818
0.931GlyTrp: 0.931 ± 0.305
2.792GlyTyr: 2.792 ± 0.879
0.0GlyXaa: 0.0 ± 0.0
His
1.862HisAla: 1.862 ± 0.525
0.621HisCys: 0.621 ± 0.376
2.482HisAsp: 2.482 ± 0.438
1.862HisGlu: 1.862 ± 0.63
1.241HisPhe: 1.241 ± 0.666
2.482HisGly: 2.482 ± 0.345
1.241HisHis: 1.241 ± 0.355
1.551HisIle: 1.551 ± 0.399
1.551HisLys: 1.551 ± 0.667
3.723HisLeu: 3.723 ± 0.478
0.621HisMet: 0.621 ± 0.378
0.931HisAsn: 0.931 ± 0.305
1.862HisPro: 1.862 ± 1.269
0.931HisGln: 0.931 ± 0.305
2.172HisArg: 2.172 ± 0.669
1.551HisSer: 1.551 ± 0.193
1.862HisThr: 1.862 ± 0.626
2.172HisVal: 2.172 ± 0.514
0.31HisTrp: 0.31 ± 0.167
2.482HisTyr: 2.482 ± 0.821
0.0HisXaa: 0.0 ± 0.0
Ile
4.344IleAla: 4.344 ± 0.501
1.241IleCys: 1.241 ± 0.313
4.654IleAsp: 4.654 ± 0.481
4.964IleGlu: 4.964 ± 1.107
1.551IlePhe: 1.551 ± 0.833
1.551IleGly: 1.551 ± 0.399
1.241IleHis: 1.241 ± 0.666
4.034IleIle: 4.034 ± 0.179
2.792IleLys: 2.792 ± 0.843
5.585IleLeu: 5.585 ± 0.629
2.172IleMet: 2.172 ± 0.161
1.241IleAsn: 1.241 ± 0.418
4.964IlePro: 4.964 ± 1.421
3.413IleGln: 3.413 ± 0.234
4.034IleArg: 4.034 ± 1.753
3.103IleSer: 3.103 ± 1.138
4.964IleThr: 4.964 ± 1.421
4.034IleVal: 4.034 ± 2.468
0.931IleTrp: 0.931 ± 0.361
2.482IleTyr: 2.482 ± 0.627
0.0IleXaa: 0.0 ± 0.0
Lys
3.103LysAla: 3.103 ± 1.33
0.931LysCys: 0.931 ± 0.52
1.551LysAsp: 1.551 ± 0.193
1.551LysGlu: 1.551 ± 0.833
3.103LysPhe: 3.103 ± 0.798
0.931LysGly: 0.931 ± 0.5
0.621LysHis: 0.621 ± 0.376
6.516LysIle: 6.516 ± 0.578
3.413LysLys: 3.413 ± 0.818
4.344LysLeu: 4.344 ± 1.897
1.551LysMet: 1.551 ± 0.385
3.413LysAsn: 3.413 ± 0.344
1.862LysPro: 1.862 ± 1.269
1.862LysGln: 1.862 ± 0.626
4.034LysArg: 4.034 ± 0.628
2.482LysSer: 2.482 ± 0.953
1.862LysThr: 1.862 ± 0.525
4.034LysVal: 4.034 ± 1.005
1.241LysTrp: 1.241 ± 0.418
0.621LysTyr: 0.621 ± 0.333
0.0LysXaa: 0.0 ± 0.0
Leu
8.067LeuAla: 8.067 ± 0.974
2.482LeuCys: 2.482 ± 0.837
5.895LeuAsp: 5.895 ± 0.474
5.895LeuGlu: 5.895 ± 2.149
2.792LeuPhe: 2.792 ± 1.417
4.654LeuGly: 4.654 ± 1.196
4.034LeuHis: 4.034 ± 0.179
4.964LeuIle: 4.964 ± 0.958
5.585LeuLys: 5.585 ± 0.187
12.721LeuLeu: 12.721 ± 1.67
1.241LeuMet: 1.241 ± 0.355
3.723LeuAsn: 3.723 ± 1.59
6.516LeuPro: 6.516 ± 0.807
3.103LeuGln: 3.103 ± 1.268
5.585LeuArg: 5.585 ± 1.939
8.998LeuSer: 8.998 ± 2.397
3.413LeuThr: 3.413 ± 0.818
7.446LeuVal: 7.446 ± 1.559
1.551LeuTrp: 1.551 ± 0.787
3.413LeuTyr: 3.413 ± 0.344
0.0LeuXaa: 0.0 ± 0.0
Met
3.413MetAla: 3.413 ± 1.788
0.31MetCys: 0.31 ± 0.167
0.621MetAsp: 0.621 ± 0.333
2.792MetGlu: 2.792 ± 0.934
0.621MetPhe: 0.621 ± 0.378
1.241MetGly: 1.241 ± 1.578
1.551MetHis: 1.551 ± 0.667
0.931MetIle: 0.931 ± 0.361
2.172MetLys: 2.172 ± 0.763
0.621MetLeu: 0.621 ± 0.378
0.931MetMet: 0.931 ± 0.52
0.0MetAsn: 0.0 ± 0.0
1.551MetPro: 1.551 ± 0.833
0.31MetGln: 0.31 ± 0.167
1.551MetArg: 1.551 ± 0.833
1.551MetSer: 1.551 ± 0.833
1.862MetThr: 1.862 ± 1.909
1.862MetVal: 1.862 ± 1.135
0.0MetTrp: 0.0 ± 0.0
0.31MetTyr: 0.31 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
3.103AsnAla: 3.103 ± 1.91
0.31AsnCys: 0.31 ± 0.167
1.551AsnAsp: 1.551 ± 0.718
0.621AsnGlu: 0.621 ± 0.378
1.241AsnPhe: 1.241 ± 0.752
2.172AsnGly: 2.172 ± 0.48
0.621AsnHis: 0.621 ± 0.333
1.862AsnIle: 1.862 ± 0.525
2.482AsnLys: 2.482 ± 0.322
4.964AsnLeu: 4.964 ± 1.079
0.931AsnMet: 0.931 ± 0.87
1.551AsnAsn: 1.551 ± 0.833
1.862AsnPro: 1.862 ± 0.656
2.172AsnGln: 2.172 ± 0.669
1.862AsnArg: 1.862 ± 0.626
1.551AsnSer: 1.551 ± 0.667
2.172AsnThr: 2.172 ± 0.669
2.172AsnVal: 2.172 ± 0.514
1.551AsnTrp: 1.551 ± 0.193
1.862AsnTyr: 1.862 ± 0.656
0.0AsnXaa: 0.0 ± 0.0
Pro
3.413ProAla: 3.413 ± 1.784
1.551ProCys: 1.551 ± 0.667
3.103ProAsp: 3.103 ± 1.049
2.482ProGlu: 2.482 ± 0.965
2.172ProPhe: 2.172 ± 1.166
3.723ProGly: 3.723 ± 3.352
2.172ProHis: 2.172 ± 0.514
3.103ProIle: 3.103 ± 0.893
2.172ProLys: 2.172 ± 1.12
4.654ProLeu: 4.654 ± 0.481
0.31ProMet: 0.31 ± 0.296
2.172ProAsn: 2.172 ± 1.761
4.654ProPro: 4.654 ± 1.523
3.103ProGln: 3.103 ± 2.442
2.482ProArg: 2.482 ± 0.322
1.862ProSer: 1.862 ± 0.999
1.862ProThr: 1.862 ± 0.656
3.723ProVal: 3.723 ± 1.621
1.862ProTrp: 1.862 ± 0.062
1.241ProTyr: 1.241 ± 0.947
0.0ProXaa: 0.0 ± 0.0
Gln
2.482GlnAla: 2.482 ± 0.71
0.621GlnCys: 0.621 ± 0.378
3.103GlnAsp: 3.103 ± 0.946
2.792GlnGlu: 2.792 ± 0.487
0.931GlnPhe: 0.931 ± 0.52
1.551GlnGly: 1.551 ± 1.331
1.551GlnHis: 1.551 ± 0.193
1.862GlnIle: 1.862 ± 0.626
0.931GlnLys: 0.931 ± 0.5
2.792GlnLeu: 2.792 ± 0.934
0.621GlnMet: 0.621 ± 0.376
1.551GlnAsn: 1.551 ± 0.399
1.551GlnPro: 1.551 ± 1.331
1.241GlnGln: 1.241 ± 0.752
1.862GlnArg: 1.862 ± 0.999
2.172GlnSer: 2.172 ± 0.801
0.621GlnThr: 0.621 ± 0.333
4.654GlnVal: 4.654 ± 0.204
0.621GlnTrp: 0.621 ± 0.378
0.621GlnTyr: 0.621 ± 0.333
0.0GlnXaa: 0.0 ± 0.0
Arg
4.344ArgAla: 4.344 ± 1.456
0.621ArgCys: 0.621 ± 0.333
2.172ArgAsp: 2.172 ± 0.669
4.964ArgGlu: 4.964 ± 1.65
0.931ArgPhe: 0.931 ± 0.5
1.551ArgGly: 1.551 ± 0.787
3.413ArgHis: 3.413 ± 1.429
3.413ArgIle: 3.413 ± 0.234
4.034ArgLys: 4.034 ± 0.535
7.757ArgLeu: 7.757 ± 1.13
1.241ArgMet: 1.241 ± 0.313
1.241ArgAsn: 1.241 ± 0.313
2.482ArgPro: 2.482 ± 1.532
1.241ArgGln: 1.241 ± 0.666
3.103ArgArg: 3.103 ± 1.138
3.413ArgSer: 3.413 ± 1.176
4.964ArgThr: 4.964 ± 0.691
6.205ArgVal: 6.205 ± 1.296
0.621ArgTrp: 0.621 ± 0.333
1.551ArgTyr: 1.551 ± 0.193
0.0ArgXaa: 0.0 ± 0.0
Ser
4.034SerAla: 4.034 ± 0.815
1.551SerCys: 1.551 ± 0.399
2.172SerAsp: 2.172 ± 0.801
4.034SerGlu: 4.034 ± 1.626
2.172SerPhe: 2.172 ± 0.48
4.344SerGly: 4.344 ± 2.649
3.413SerHis: 3.413 ± 1.794
5.275SerIle: 5.275 ± 0.841
2.482SerLys: 2.482 ± 0.627
6.205SerLeu: 6.205 ± 0.753
1.241SerMet: 1.241 ± 0.313
2.172SerAsn: 2.172 ± 0.763
2.792SerPro: 2.792 ± 0.978
1.862SerGln: 1.862 ± 0.656
3.103SerArg: 3.103 ± 0.893
5.895SerSer: 5.895 ± 1.136
4.964SerThr: 4.964 ± 0.742
5.275SerVal: 5.275 ± 1.414
2.482SerTrp: 2.482 ± 0.627
3.413SerTyr: 3.413 ± 1.3
0.0SerXaa: 0.0 ± 0.0
Thr
3.723ThrAla: 3.723 ± 0.677
2.172ThrCys: 2.172 ± 0.48
3.103ThrAsp: 3.103 ± 0.951
2.172ThrGlu: 2.172 ± 0.596
1.241ThrPhe: 1.241 ± 0.313
3.103ThrGly: 3.103 ± 0.251
1.241ThrHis: 1.241 ± 0.313
6.516ThrIle: 6.516 ± 0.915
1.551ThrLys: 1.551 ± 1.245
6.205ThrLeu: 6.205 ± 0.926
1.551ThrMet: 1.551 ± 0.193
3.413ThrAsn: 3.413 ± 0.622
3.103ThrPro: 3.103 ± 0.463
1.551ThrGln: 1.551 ± 0.758
6.516ThrArg: 6.516 ± 0.166
3.103ThrSer: 3.103 ± 0.722
3.103ThrThr: 3.103 ± 0.463
3.413ThrVal: 3.413 ± 0.622
1.241ThrTrp: 1.241 ± 0.418
1.551ThrTyr: 1.551 ± 0.399
0.0ThrXaa: 0.0 ± 0.0
Val
4.964ValAla: 4.964 ± 0.992
2.792ValCys: 2.792 ± 0.698
3.723ValAsp: 3.723 ± 0.556
4.034ValGlu: 4.034 ± 1.108
2.792ValPhe: 2.792 ± 0.915
4.034ValGly: 4.034 ± 0.628
3.413ValHis: 3.413 ± 1.751
4.964ValIle: 4.964 ± 1.313
2.792ValLys: 2.792 ± 0.487
5.275ValLeu: 5.275 ± 1.313
0.621ValMet: 0.621 ± 0.376
2.482ValAsn: 2.482 ± 0.969
3.413ValPro: 3.413 ± 1.177
2.792ValGln: 2.792 ± 0.487
3.723ValArg: 3.723 ± 0.125
5.585ValSer: 5.585 ± 2.316
5.585ValThr: 5.585 ± 0.187
7.136ValVal: 7.136 ± 1.163
1.241ValTrp: 1.241 ± 0.418
3.103ValTyr: 3.103 ± 0.386
0.0ValXaa: 0.0 ± 0.0
Trp
1.241TrpAla: 1.241 ± 0.418
0.621TrpCys: 0.621 ± 0.378
1.551TrpAsp: 1.551 ± 0.833
0.931TrpGlu: 0.931 ± 0.305
0.621TrpPhe: 0.621 ± 0.378
1.862TrpGly: 1.862 ± 1.269
0.31TrpHis: 0.31 ± 0.167
0.621TrpIle: 0.621 ± 0.333
1.862TrpLys: 1.862 ± 0.999
1.862TrpLeu: 1.862 ± 0.626
0.31TrpMet: 0.31 ± 0.499
1.241TrpAsn: 1.241 ± 0.666
0.931TrpPro: 0.931 ± 0.5
0.0TrpGln: 0.0 ± 0.0
1.862TrpArg: 1.862 ± 0.626
2.172TrpSer: 2.172 ± 0.596
0.931TrpThr: 0.931 ± 0.819
1.551TrpVal: 1.551 ± 1.19
0.621TrpTrp: 0.621 ± 0.378
0.621TrpTyr: 0.621 ± 0.333
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.482TyrAla: 2.482 ± 1.333
0.621TyrCys: 0.621 ± 0.333
0.31TyrAsp: 0.31 ± 0.167
2.792TyrGlu: 2.792 ± 0.698
1.241TyrPhe: 1.241 ± 0.313
1.862TyrGly: 1.862 ± 0.062
0.931TyrHis: 0.931 ± 0.305
1.551TyrIle: 1.551 ± 0.718
2.172TyrLys: 2.172 ± 0.514
4.034TyrLeu: 4.034 ± 1.626
2.172TyrMet: 2.172 ± 0.161
1.551TyrAsn: 1.551 ± 0.193
1.551TyrPro: 1.551 ± 0.667
1.551TyrGln: 1.551 ± 0.667
0.931TyrArg: 0.931 ± 0.305
2.792TyrSer: 2.792 ± 1.109
1.551TyrThr: 1.551 ± 0.787
1.551TyrVal: 1.551 ± 0.193
1.241TyrTrp: 1.241 ± 0.666
0.31TyrTyr: 0.31 ± 0.167
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3224 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski