Amino acid dipepetide frequency for Cardamine chlorotic fleck virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.54AlaAla: 8.54 ± 3.585
3.106AlaCys: 3.106 ± 0.928
2.329AlaAsp: 2.329 ± 2.035
2.329AlaGlu: 2.329 ± 1.352
3.106AlaPhe: 3.106 ± 1.069
4.658AlaGly: 4.658 ± 2.401
3.106AlaHis: 3.106 ± 1.573
5.435AlaIle: 5.435 ± 1.107
4.658AlaLys: 4.658 ± 1.217
5.435AlaLeu: 5.435 ± 1.503
2.329AlaMet: 2.329 ± 1.352
3.106AlaAsn: 3.106 ± 1.829
4.658AlaPro: 4.658 ± 1.217
3.106AlaGln: 3.106 ± 0.928
4.658AlaArg: 4.658 ± 2.42
9.317AlaSer: 9.317 ± 1.657
1.553AlaThr: 1.553 ± 2.049
6.211AlaVal: 6.211 ± 1.432
2.329AlaTrp: 2.329 ± 1.352
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.553CysAla: 1.553 ± 0.902
0.0CysCys: 0.0 ± 0.0
0.776CysAsp: 0.776 ± 0.779
1.553CysGlu: 1.553 ± 0.534
0.776CysPhe: 0.776 ± 0.451
3.106CysGly: 3.106 ± 0.928
0.776CysHis: 0.776 ± 1.631
0.776CysIle: 0.776 ± 0.451
0.0CysLys: 0.0 ± 0.0
2.329CysLeu: 2.329 ± 1.352
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.329CysPro: 2.329 ± 0.608
0.776CysGln: 0.776 ± 0.451
1.553CysArg: 1.553 ± 0.902
0.776CysSer: 0.776 ± 0.451
1.553CysThr: 1.553 ± 0.902
1.553CysVal: 1.553 ± 0.902
0.0CysTrp: 0.0 ± 0.0
0.776CysTyr: 0.776 ± 0.451
0.0CysXaa: 0.0 ± 0.0
Asp
6.211AspAla: 6.211 ± 1.856
0.776AspCys: 0.776 ± 0.451
2.329AspAsp: 2.329 ± 0.608
2.329AspGlu: 2.329 ± 1.352
2.329AspPhe: 2.329 ± 0.608
3.882AspGly: 3.882 ± 1.053
0.0AspHis: 0.0 ± 0.0
1.553AspIle: 1.553 ± 1.776
2.329AspLys: 2.329 ± 2.035
3.882AspLeu: 3.882 ± 2.254
1.553AspMet: 1.553 ± 0.902
4.658AspAsn: 4.658 ± 4.729
3.882AspPro: 3.882 ± 1.053
1.553AspGln: 1.553 ± 1.48
1.553AspArg: 1.553 ± 0.902
3.106AspSer: 3.106 ± 2.303
0.776AspThr: 0.776 ± 0.451
2.329AspVal: 2.329 ± 1.352
0.0AspTrp: 0.0 ± 0.0
0.776AspTyr: 0.776 ± 0.451
0.0AspXaa: 0.0 ± 0.0
Glu
6.211GluAla: 6.211 ± 1.856
0.776GluCys: 0.776 ± 0.451
2.329GluAsp: 2.329 ± 0.608
5.435GluGlu: 5.435 ± 3.156
3.106GluPhe: 3.106 ± 0.928
4.658GluGly: 4.658 ± 1.217
3.106GluHis: 3.106 ± 0.928
2.329GluIle: 2.329 ± 0.608
3.106GluLys: 3.106 ± 1.812
3.882GluLeu: 3.882 ± 1.326
0.0GluMet: 0.0 ± 0.0
0.776GluAsn: 0.776 ± 0.451
2.329GluPro: 2.329 ± 0.608
2.329GluGln: 2.329 ± 0.608
3.882GluArg: 3.882 ± 1.053
2.329GluSer: 2.329 ± 1.965
2.329GluThr: 2.329 ± 0.608
6.211GluVal: 6.211 ± 1.779
0.0GluTrp: 0.0 ± 0.0
0.776GluTyr: 0.776 ± 0.779
0.0GluXaa: 0.0 ± 0.0
Phe
0.776PheAla: 0.776 ± 0.451
0.776PheCys: 0.776 ± 0.451
1.553PheAsp: 1.553 ± 0.902
3.106PheGlu: 3.106 ± 1.069
0.776PhePhe: 0.776 ± 0.451
3.882PheGly: 3.882 ± 1.326
1.553PheHis: 1.553 ± 0.534
0.0PheIle: 0.0 ± 0.0
0.776PheLys: 0.776 ± 0.451
2.329PheLeu: 2.329 ± 0.608
0.776PheMet: 0.776 ± 1.768
3.882PheAsn: 3.882 ± 2.31
1.553PhePro: 1.553 ± 0.902
2.329PheGln: 2.329 ± 1.459
0.776PheArg: 0.776 ± 0.779
3.106PheSer: 3.106 ± 1.812
2.329PheThr: 2.329 ± 1.258
4.658PheVal: 4.658 ± 1.217
0.776PheTrp: 0.776 ± 1.631
2.329PheTyr: 2.329 ± 0.608
0.0PheXaa: 0.0 ± 0.0
Gly
0.776GlyAla: 0.776 ± 0.779
3.106GlyCys: 3.106 ± 0.928
6.211GlyAsp: 6.211 ± 1.482
5.435GlyGlu: 5.435 ± 1.503
3.106GlyPhe: 3.106 ± 1.573
4.658GlyGly: 4.658 ± 1.714
0.776GlyHis: 0.776 ± 0.451
3.882GlyIle: 3.882 ± 1.326
5.435GlyLys: 5.435 ± 4.009
4.658GlyLeu: 4.658 ± 1.217
1.553GlyMet: 1.553 ± 0.902
6.211GlyAsn: 6.211 ± 1.782
1.553GlyPro: 1.553 ± 0.534
1.553GlyGln: 1.553 ± 1.559
4.658GlyArg: 4.658 ± 2.705
8.54GlySer: 8.54 ± 2.582
1.553GlyThr: 1.553 ± 1.559
3.882GlyVal: 3.882 ± 1.18
1.553GlyTrp: 1.553 ± 0.534
3.882GlyTyr: 3.882 ± 2.254
0.0GlyXaa: 0.0 ± 0.0
His
1.553HisAla: 1.553 ± 2.049
0.0HisCys: 0.0 ± 0.0
1.553HisAsp: 1.553 ± 0.902
0.776HisGlu: 0.776 ± 0.451
0.776HisPhe: 0.776 ± 0.451
0.776HisGly: 0.776 ± 0.779
0.776HisHis: 0.776 ± 0.451
2.329HisIle: 2.329 ± 1.47
0.776HisLys: 0.776 ± 0.451
3.106HisLeu: 3.106 ± 2.961
0.0HisMet: 0.0 ± 0.0
0.776HisAsn: 0.776 ± 0.451
1.553HisPro: 1.553 ± 0.902
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.776HisSer: 0.776 ± 0.451
3.882HisThr: 3.882 ± 1.053
1.553HisVal: 1.553 ± 0.902
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.882IleAla: 3.882 ± 1.053
0.776IleCys: 0.776 ± 0.451
1.553IleAsp: 1.553 ± 0.902
2.329IleGlu: 2.329 ± 2.035
1.553IlePhe: 1.553 ± 0.902
5.435IleGly: 5.435 ± 1.5
0.776IleHis: 0.776 ± 0.779
0.776IleIle: 0.776 ± 0.451
3.106IleLys: 3.106 ± 1.254
3.106IleLeu: 3.106 ± 3.059
0.776IleMet: 0.776 ± 0.451
2.329IleAsn: 2.329 ± 1.352
4.658IlePro: 4.658 ± 1.235
0.776IleGln: 0.776 ± 0.451
3.882IleArg: 3.882 ± 1.18
4.658IleSer: 4.658 ± 5.317
1.553IleThr: 1.553 ± 0.534
2.329IleVal: 2.329 ± 0.608
0.0IleTrp: 0.0 ± 0.0
1.553IleTyr: 1.553 ± 0.902
0.0IleXaa: 0.0 ± 0.0
Lys
0.776LysAla: 0.776 ± 0.451
1.553LysCys: 1.553 ± 0.902
1.553LysAsp: 1.553 ± 0.902
3.106LysGlu: 3.106 ± 0.928
3.106LysPhe: 3.106 ± 2.791
4.658LysGly: 4.658 ± 1.217
0.776LysHis: 0.776 ± 0.451
3.882LysIle: 3.882 ± 1.682
3.882LysLys: 3.882 ± 1.682
6.988LysLeu: 6.988 ± 2.244
2.329LysMet: 2.329 ± 0.608
1.553LysAsn: 1.553 ± 0.534
3.106LysPro: 3.106 ± 1.254
2.329LysGln: 2.329 ± 2.664
3.106LysArg: 3.106 ± 4.098
3.106LysSer: 3.106 ± 2.621
0.776LysThr: 0.776 ± 0.451
4.658LysVal: 4.658 ± 3.357
3.106LysTrp: 3.106 ± 0.928
3.882LysTyr: 3.882 ± 1.494
0.0LysXaa: 0.0 ± 0.0
Leu
13.975LeuAla: 13.975 ± 3.438
1.553LeuCys: 1.553 ± 1.48
3.106LeuAsp: 3.106 ± 0.928
5.435LeuGlu: 5.435 ± 1.503
0.0LeuPhe: 0.0 ± 0.0
4.658LeuGly: 4.658 ± 4.157
1.553LeuHis: 1.553 ± 0.902
4.658LeuIle: 4.658 ± 4.778
7.764LeuLys: 7.764 ± 2.566
6.988LeuLeu: 6.988 ± 5.755
1.553LeuMet: 1.553 ± 0.744
3.106LeuAsn: 3.106 ± 1.803
7.764LeuPro: 7.764 ± 1.663
0.776LeuGln: 0.776 ± 0.779
5.435LeuArg: 5.435 ± 2.185
3.882LeuSer: 3.882 ± 2.905
6.211LeuThr: 6.211 ± 3.023
7.764LeuVal: 7.764 ± 1.511
0.776LeuTrp: 0.776 ± 0.779
2.329LeuTyr: 2.329 ± 0.608
0.0LeuXaa: 0.0 ± 0.0
Met
2.329MetAla: 2.329 ± 1.258
0.0MetCys: 0.0 ± 0.0
1.553MetAsp: 1.553 ± 2.049
0.776MetGlu: 0.776 ± 0.451
0.0MetPhe: 0.0 ± 0.0
1.553MetGly: 1.553 ± 0.902
1.553MetHis: 1.553 ± 0.902
0.0MetIle: 0.0 ± 0.0
1.553MetLys: 1.553 ± 0.902
1.553MetLeu: 1.553 ± 0.902
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.776MetGln: 0.776 ± 1.631
1.553MetArg: 1.553 ± 0.534
2.329MetSer: 2.329 ± 1.352
0.776MetThr: 0.776 ± 0.451
0.776MetVal: 0.776 ± 0.451
0.776MetTrp: 0.776 ± 0.451
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.106AsnAla: 3.106 ± 1.573
3.106AsnCys: 3.106 ± 1.803
0.0AsnAsp: 0.0 ± 0.0
0.0AsnGlu: 0.0 ± 0.0
2.329AsnPhe: 2.329 ± 1.965
3.882AsnGly: 3.882 ± 1.18
0.776AsnHis: 0.776 ± 0.451
1.553AsnIle: 1.553 ± 0.902
1.553AsnLys: 1.553 ± 2.049
3.106AsnLeu: 3.106 ± 1.069
0.776AsnMet: 0.776 ± 1.276
2.329AsnAsn: 2.329 ± 1.352
3.882AsnPro: 3.882 ± 1.769
3.106AsnGln: 3.106 ± 1.812
3.882AsnArg: 3.882 ± 1.326
3.106AsnSer: 3.106 ± 2.455
4.658AsnThr: 4.658 ± 2.94
2.329AsnVal: 2.329 ± 0.608
0.776AsnTrp: 0.776 ± 2.223
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.658ProAla: 4.658 ± 1.272
1.553ProCys: 1.553 ± 0.534
6.211ProAsp: 6.211 ± 1.155
3.106ProGlu: 3.106 ± 0.928
0.776ProPhe: 0.776 ± 0.451
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
3.106ProIle: 3.106 ± 0.928
3.106ProLys: 3.106 ± 2.024
3.106ProLeu: 3.106 ± 1.069
2.329ProMet: 2.329 ± 0.607
0.0ProAsn: 0.0 ± 0.0
2.329ProPro: 2.329 ± 1.352
3.882ProGln: 3.882 ± 2.254
8.54ProArg: 8.54 ± 2.598
5.435ProSer: 5.435 ± 3.303
3.106ProThr: 3.106 ± 1.069
1.553ProVal: 1.553 ± 1.48
1.553ProTrp: 1.553 ± 1.776
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.776GlnCys: 0.776 ± 0.451
0.0GlnAsp: 0.0 ± 0.0
0.776GlnGlu: 0.776 ± 0.451
1.553GlnPhe: 1.553 ± 0.902
1.553GlnGly: 1.553 ± 0.534
1.553GlnHis: 1.553 ± 1.48
2.329GlnIle: 2.329 ± 1.258
2.329GlnLys: 2.329 ± 0.608
6.211GlnLeu: 6.211 ± 2.509
0.776GlnMet: 0.776 ± 0.451
1.553GlnAsn: 1.553 ± 0.902
3.882GlnPro: 3.882 ± 1.326
0.776GlnGln: 0.776 ± 0.451
2.329GlnArg: 2.329 ± 2.035
2.329GlnSer: 2.329 ± 2.035
1.553GlnThr: 1.553 ± 0.534
2.329GlnVal: 2.329 ± 1.965
1.553GlnTrp: 1.553 ± 0.534
2.329GlnTyr: 2.329 ± 1.459
0.0GlnXaa: 0.0 ± 0.0
Arg
4.658ArgAla: 4.658 ± 1.75
0.0ArgCys: 0.0 ± 0.0
3.106ArgAsp: 3.106 ± 1.069
4.658ArgGlu: 4.658 ± 1.217
3.106ArgPhe: 3.106 ± 1.069
5.435ArgGly: 5.435 ± 2.566
0.776ArgHis: 0.776 ± 0.451
2.329ArgIle: 2.329 ± 0.608
3.106ArgLys: 3.106 ± 1.803
7.764ArgLeu: 7.764 ± 1.774
0.776ArgMet: 0.776 ± 0.451
3.882ArgAsn: 3.882 ± 2.254
0.776ArgPro: 0.776 ± 2.223
0.0ArgGln: 0.0 ± 0.0
8.54ArgArg: 8.54 ± 3.976
5.435ArgSer: 5.435 ± 6.519
0.0ArgThr: 0.0 ± 0.0
6.988ArgVal: 6.988 ± 2.073
3.106ArgTrp: 3.106 ± 0.928
3.106ArgTyr: 3.106 ± 0.928
0.776ArgXaa: 0.776 ± 0.451
Ser
4.658SerAla: 4.658 ± 4.598
0.0SerCys: 0.0 ± 0.0
5.435SerAsp: 5.435 ± 2.051
3.882SerGlu: 3.882 ± 4.348
6.211SerPhe: 6.211 ± 4.094
8.54SerGly: 8.54 ± 3.319
1.553SerHis: 1.553 ± 2.049
4.658SerIle: 4.658 ± 3.499
3.882SerLys: 3.882 ± 2.254
10.093SerLeu: 10.093 ± 2.863
0.776SerMet: 0.776 ± 0.451
0.776SerAsn: 0.776 ± 0.779
1.553SerPro: 1.553 ± 1.48
2.329SerGln: 2.329 ± 2.967
3.882SerArg: 3.882 ± 1.053
3.882SerSer: 3.882 ± 1.494
3.106SerThr: 3.106 ± 1.829
6.988SerVal: 6.988 ± 3.817
1.553SerTrp: 1.553 ± 0.902
3.882SerTyr: 3.882 ± 1.769
0.0SerXaa: 0.0 ± 0.0
Thr
6.211ThrAla: 6.211 ± 2.332
0.776ThrCys: 0.776 ± 0.779
0.776ThrAsp: 0.776 ± 1.631
2.329ThrGlu: 2.329 ± 0.608
2.329ThrPhe: 2.329 ± 1.258
3.882ThrGly: 3.882 ± 1.053
0.776ThrHis: 0.776 ± 0.779
0.776ThrIle: 0.776 ± 0.451
4.658ThrLys: 4.658 ± 4.143
2.329ThrLeu: 2.329 ± 0.608
0.0ThrMet: 0.0 ± 0.0
0.0ThrAsn: 0.0 ± 0.0
3.882ThrPro: 3.882 ± 1.053
2.329ThrGln: 2.329 ± 1.352
1.553ThrArg: 1.553 ± 0.902
4.658ThrSer: 4.658 ± 3.573
6.211ThrThr: 6.211 ± 3.391
3.106ThrVal: 3.106 ± 2.024
0.776ThrTrp: 0.776 ± 0.451
0.776ThrTyr: 0.776 ± 1.631
0.0ThrXaa: 0.0 ± 0.0
Val
8.54ValAla: 8.54 ± 3.675
0.776ValCys: 0.776 ± 0.451
1.553ValAsp: 1.553 ± 0.534
6.211ValGlu: 6.211 ± 1.634
3.106ValPhe: 3.106 ± 1.803
3.882ValGly: 3.882 ± 1.326
0.776ValHis: 0.776 ± 0.451
3.106ValIle: 3.106 ± 1.573
4.658ValLys: 4.658 ± 2.42
6.988ValLeu: 6.988 ± 1.909
0.776ValMet: 0.776 ± 0.451
3.106ValAsn: 3.106 ± 2.621
3.106ValPro: 3.106 ± 1.829
5.435ValGln: 5.435 ± 1.503
6.211ValArg: 6.211 ± 1.976
6.988ValSer: 6.988 ± 1.572
4.658ValThr: 4.658 ± 4.157
3.106ValVal: 3.106 ± 0.928
0.776ValTrp: 0.776 ± 0.451
0.776ValTyr: 0.776 ± 0.451
0.0ValXaa: 0.0 ± 0.0
Trp
1.553TrpAla: 1.553 ± 1.559
0.776TrpCys: 0.776 ± 0.451
0.776TrpAsp: 0.776 ± 0.451
0.776TrpGlu: 0.776 ± 0.451
0.0TrpPhe: 0.0 ± 0.0
2.329TrpGly: 2.329 ± 0.608
0.0TrpHis: 0.0 ± 0.0
1.553TrpIle: 1.553 ± 0.902
0.0TrpLys: 0.0 ± 0.0
3.882TrpLeu: 3.882 ± 1.18
0.0TrpMet: 0.0 ± 0.0
0.776TrpAsn: 0.776 ± 0.451
0.776TrpPro: 0.776 ± 0.779
0.776TrpGln: 0.776 ± 0.451
0.776TrpArg: 0.776 ± 0.451
0.776TrpSer: 0.776 ± 0.779
0.0TrpThr: 0.0 ± 0.0
3.882TrpVal: 3.882 ± 2.33
0.776TrpTrp: 0.776 ± 0.451
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.776TyrCys: 0.776 ± 0.451
3.106TyrAsp: 3.106 ± 1.803
1.553TyrGlu: 1.553 ± 0.534
0.0TyrPhe: 0.0 ± 0.0
0.776TyrGly: 0.776 ± 0.779
0.0TyrHis: 0.0 ± 0.0
0.776TyrIle: 0.776 ± 0.451
1.553TyrLys: 1.553 ± 0.534
2.329TyrLeu: 2.329 ± 1.459
0.0TyrMet: 0.0 ± 0.0
5.435TyrAsn: 5.435 ± 1.107
0.776TyrPro: 0.776 ± 1.631
1.553TyrGln: 1.553 ± 0.902
1.553TyrArg: 1.553 ± 0.534
3.106TyrSer: 3.106 ± 0.928
1.553TyrThr: 1.553 ± 0.534
2.329TyrVal: 2.329 ± 0.608
0.0TyrTrp: 0.0 ± 0.0
2.329TyrTyr: 2.329 ± 1.459
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.776XaaGly: 0.776 ± 0.451
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1289 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski