Amino acid dipepetide frequency for Williamwhitmania taraxaci

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.409AlaAla: 5.409 ± 0.092
0.685AlaCys: 0.685 ± 0.026
3.711AlaAsp: 3.711 ± 0.069
4.323AlaGlu: 4.323 ± 0.076
3.477AlaPhe: 3.477 ± 0.054
4.892AlaGly: 4.892 ± 0.079
1.214AlaHis: 1.214 ± 0.033
6.102AlaIle: 6.102 ± 0.072
4.811AlaLys: 4.811 ± 0.076
6.874AlaLeu: 6.874 ± 0.089
1.76AlaMet: 1.76 ± 0.036
3.618AlaAsn: 3.618 ± 0.062
2.12AlaPro: 2.12 ± 0.042
2.634AlaGln: 2.634 ± 0.049
2.604AlaArg: 2.604 ± 0.056
4.594AlaSer: 4.594 ± 0.081
4.437AlaThr: 4.437 ± 0.084
4.534AlaVal: 4.534 ± 0.062
0.706AlaTrp: 0.706 ± 0.025
2.59AlaTyr: 2.59 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.537CysAla: 0.537 ± 0.02
0.138CysCys: 0.138 ± 0.012
0.472CysAsp: 0.472 ± 0.021
0.451CysGlu: 0.451 ± 0.021
0.476CysPhe: 0.476 ± 0.021
0.777CysGly: 0.777 ± 0.03
0.211CysHis: 0.211 ± 0.016
0.645CysIle: 0.645 ± 0.025
0.571CysLys: 0.571 ± 0.022
0.762CysLeu: 0.762 ± 0.025
0.191CysMet: 0.191 ± 0.013
0.534CysAsn: 0.534 ± 0.022
0.474CysPro: 0.474 ± 0.028
0.265CysGln: 0.265 ± 0.014
0.37CysArg: 0.37 ± 0.02
0.717CysSer: 0.717 ± 0.027
0.515CysThr: 0.515 ± 0.024
0.492CysVal: 0.492 ± 0.02
0.095CysTrp: 0.095 ± 0.009
0.421CysTyr: 0.421 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
3.626AspAla: 3.626 ± 0.065
0.467AspCys: 0.467 ± 0.023
2.259AspAsp: 2.259 ± 0.049
3.07AspGlu: 3.07 ± 0.058
2.912AspPhe: 2.912 ± 0.06
3.586AspGly: 3.586 ± 0.065
0.76AspHis: 0.76 ± 0.029
4.041AspIle: 4.041 ± 0.066
3.547AspLys: 3.547 ± 0.067
4.742AspLeu: 4.742 ± 0.071
1.198AspMet: 1.198 ± 0.04
2.523AspAsn: 2.523 ± 0.051
1.727AspPro: 1.727 ± 0.041
1.505AspGln: 1.505 ± 0.04
2.109AspArg: 2.109 ± 0.05
3.59AspSer: 3.59 ± 0.073
2.612AspThr: 2.612 ± 0.048
3.213AspVal: 3.213 ± 0.054
0.525AspTrp: 0.525 ± 0.021
2.358AspTyr: 2.358 ± 0.049
0.0AspXaa: 0.0 ± 0.0
Glu
4.302GluAla: 4.302 ± 0.069
0.412GluCys: 0.412 ± 0.021
2.5GluAsp: 2.5 ± 0.049
4.209GluGlu: 4.209 ± 0.082
2.696GluPhe: 2.696 ± 0.05
3.453GluGly: 3.453 ± 0.059
1.037GluHis: 1.037 ± 0.033
5.011GluIle: 5.011 ± 0.072
5.129GluLys: 5.129 ± 0.088
6.045GluLeu: 6.045 ± 0.098
1.845GluMet: 1.845 ± 0.043
3.334GluAsn: 3.334 ± 0.058
1.723GluPro: 1.723 ± 0.04
2.408GluGln: 2.408 ± 0.052
2.763GluArg: 2.763 ± 0.06
3.508GluSer: 3.508 ± 0.05
3.084GluThr: 3.084 ± 0.05
4.534GluVal: 4.534 ± 0.072
0.606GluTrp: 0.606 ± 0.021
2.137GluTyr: 2.137 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
3.388PheAla: 3.388 ± 0.064
0.462PheCys: 0.462 ± 0.019
2.898PheAsp: 2.898 ± 0.049
2.867PheGlu: 2.867 ± 0.056
2.683PhePhe: 2.683 ± 0.056
3.66PheGly: 3.66 ± 0.059
0.848PheHis: 0.848 ± 0.03
3.515PheIle: 3.515 ± 0.057
2.871PheLys: 2.871 ± 0.048
4.598PheLeu: 4.598 ± 0.075
1.176PheMet: 1.176 ± 0.034
2.684PheAsn: 2.684 ± 0.05
1.792PhePro: 1.792 ± 0.043
1.45PheGln: 1.45 ± 0.036
2.064PheArg: 2.064 ± 0.04
4.309PheSer: 4.309 ± 0.065
3.16PheThr: 3.16 ± 0.06
3.291PheVal: 3.291 ± 0.054
0.536PheTrp: 0.536 ± 0.021
2.007PheTyr: 2.007 ± 0.045
0.0PheXaa: 0.0 ± 0.0
Gly
4.483GlyAla: 4.483 ± 0.079
0.782GlyCys: 0.782 ± 0.037
3.219GlyAsp: 3.219 ± 0.057
3.732GlyGlu: 3.732 ± 0.068
3.655GlyPhe: 3.655 ± 0.06
4.843GlyGly: 4.843 ± 0.108
1.11GlyHis: 1.11 ± 0.03
5.726GlyIle: 5.726 ± 0.079
5.167GlyLys: 5.167 ± 0.07
6.196GlyLeu: 6.196 ± 0.082
1.817GlyMet: 1.817 ± 0.04
3.566GlyAsn: 3.566 ± 0.074
1.445GlyPro: 1.445 ± 0.037
1.983GlyGln: 1.983 ± 0.044
2.602GlyArg: 2.602 ± 0.049
4.579GlySer: 4.579 ± 0.087
4.032GlyThr: 4.032 ± 0.111
5.019GlyVal: 5.019 ± 0.075
0.827GlyTrp: 0.827 ± 0.032
3.069GlyTyr: 3.069 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
1.031HisAla: 1.031 ± 0.032
0.217HisCys: 0.217 ± 0.012
0.778HisAsp: 0.778 ± 0.024
0.942HisGlu: 0.942 ± 0.03
1.055HisPhe: 1.055 ± 0.033
1.202HisGly: 1.202 ± 0.029
0.417HisHis: 0.417 ± 0.019
1.255HisIle: 1.255 ± 0.033
1.081HisLys: 1.081 ± 0.033
1.744HisLeu: 1.744 ± 0.042
0.368HisMet: 0.368 ± 0.019
0.813HisAsn: 0.813 ± 0.031
0.886HisPro: 0.886 ± 0.031
0.661HisGln: 0.661 ± 0.024
0.811HisArg: 0.811 ± 0.027
1.209HisSer: 1.209 ± 0.029
0.959HisThr: 0.959 ± 0.026
0.868HisVal: 0.868 ± 0.028
0.195HisTrp: 0.195 ± 0.014
0.76HisTyr: 0.76 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
6.226IleAla: 6.226 ± 0.083
0.749IleCys: 0.749 ± 0.025
4.423IleAsp: 4.423 ± 0.061
4.774IleGlu: 4.774 ± 0.079
3.429IlePhe: 3.429 ± 0.058
5.154IleGly: 5.154 ± 0.073
1.355IleHis: 1.355 ± 0.039
5.445IleIle: 5.445 ± 0.092
4.885IleLys: 4.885 ± 0.069
6.679IleLeu: 6.679 ± 0.087
1.52IleMet: 1.52 ± 0.034
3.961IleAsn: 3.961 ± 0.066
3.474IlePro: 3.474 ± 0.059
2.338IleGln: 2.338 ± 0.05
3.099IleArg: 3.099 ± 0.051
5.838IleSer: 5.838 ± 0.073
4.984IleThr: 4.984 ± 0.084
5.17IleVal: 5.17 ± 0.071
0.697IleTrp: 0.697 ± 0.027
2.528IleTyr: 2.528 ± 0.05
0.0IleXaa: 0.0 ± 0.0
Lys
4.748LysAla: 4.748 ± 0.084
0.457LysCys: 0.457 ± 0.021
3.43LysAsp: 3.43 ± 0.063
5.167LysGlu: 5.167 ± 0.086
2.602LysPhe: 2.602 ± 0.047
4.491LysGly: 4.491 ± 0.073
1.21LysHis: 1.21 ± 0.029
4.972LysIle: 4.972 ± 0.07
4.958LysLys: 4.958 ± 0.076
6.121LysLeu: 6.121 ± 0.074
2.098LysMet: 2.098 ± 0.044
3.695LysAsn: 3.695 ± 0.055
2.532LysPro: 2.532 ± 0.05
2.555LysGln: 2.555 ± 0.052
3.078LysArg: 3.078 ± 0.052
4.187LysSer: 4.187 ± 0.071
3.959LysThr: 3.959 ± 0.061
4.92LysVal: 4.92 ± 0.07
0.66LysTrp: 0.66 ± 0.024
2.43LysTyr: 2.43 ± 0.043
0.0LysXaa: 0.0 ± 0.0
Leu
6.889LeuAla: 6.889 ± 0.079
0.865LeuCys: 0.865 ± 0.034
4.515LeuAsp: 4.515 ± 0.069
5.31LeuGlu: 5.31 ± 0.075
5.191LeuPhe: 5.191 ± 0.077
6.206LeuGly: 6.206 ± 0.08
1.601LeuHis: 1.601 ± 0.036
6.658LeuIle: 6.658 ± 0.084
6.636LeuLys: 6.636 ± 0.085
9.888LeuLeu: 9.888 ± 0.126
2.346LeuMet: 2.346 ± 0.047
5.012LeuAsn: 5.012 ± 0.074
3.878LeuPro: 3.878 ± 0.066
3.107LeuGln: 3.107 ± 0.057
4.006LeuArg: 4.006 ± 0.069
7.293LeuSer: 7.293 ± 0.087
5.642LeuThr: 5.642 ± 0.073
6.423LeuVal: 6.423 ± 0.089
0.853LeuTrp: 0.853 ± 0.03
3.201LeuTyr: 3.201 ± 0.046
0.0LeuXaa: 0.0 ± 0.0
Met
2.141MetAla: 2.141 ± 0.051
0.18MetCys: 0.18 ± 0.012
1.333MetAsp: 1.333 ± 0.036
1.728MetGlu: 1.728 ± 0.036
0.921MetPhe: 0.921 ± 0.029
1.78MetGly: 1.78 ± 0.042
0.458MetHis: 0.458 ± 0.02
1.49MetIle: 1.49 ± 0.04
1.936MetLys: 1.936 ± 0.04
2.489MetLeu: 2.489 ± 0.047
0.571MetMet: 0.571 ± 0.024
1.221MetAsn: 1.221 ± 0.031
1.062MetPro: 1.062 ± 0.028
0.863MetGln: 0.863 ± 0.028
1.126MetArg: 1.126 ± 0.031
1.337MetSer: 1.337 ± 0.033
1.053MetThr: 1.053 ± 0.03
2.073MetVal: 2.073 ± 0.04
0.184MetTrp: 0.184 ± 0.011
0.556MetTyr: 0.556 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
3.615AsnAla: 3.615 ± 0.056
0.423AsnCys: 0.423 ± 0.018
2.485AsnAsp: 2.485 ± 0.051
2.948AsnGlu: 2.948 ± 0.051
2.529AsnPhe: 2.529 ± 0.055
3.935AsnGly: 3.935 ± 0.088
0.97AsnHis: 0.97 ± 0.028
3.986AsnIle: 3.986 ± 0.059
3.23AsnLys: 3.23 ± 0.051
5.221AsnLeu: 5.221 ± 0.074
1.144AsnMet: 1.144 ± 0.033
2.831AsnAsn: 2.831 ± 0.06
2.721AsnPro: 2.721 ± 0.052
2.004AsnGln: 2.004 ± 0.043
2.397AsnArg: 2.397 ± 0.046
3.696AsnSer: 3.696 ± 0.059
3.006AsnThr: 3.006 ± 0.059
3.04AsnVal: 3.04 ± 0.055
0.556AsnTrp: 0.556 ± 0.021
2.204AsnTyr: 2.204 ± 0.045
0.0AsnXaa: 0.0 ± 0.0
Pro
2.553ProAla: 2.553 ± 0.05
0.29ProCys: 0.29 ± 0.014
2.091ProAsp: 2.091 ± 0.049
2.693ProGlu: 2.693 ± 0.05
1.999ProPhe: 1.999 ± 0.045
2.204ProGly: 2.204 ± 0.044
0.632ProHis: 0.632 ± 0.026
2.955ProIle: 2.955 ± 0.056
2.192ProLys: 2.192 ± 0.048
3.345ProLeu: 3.345 ± 0.06
0.861ProMet: 0.861 ± 0.028
2.018ProAsn: 2.018 ± 0.044
0.959ProPro: 0.959 ± 0.031
1.162ProGln: 1.162 ± 0.031
1.206ProArg: 1.206 ± 0.034
2.469ProSer: 2.469 ± 0.046
2.387ProThr: 2.387 ± 0.058
2.642ProVal: 2.642 ± 0.057
0.372ProTrp: 0.372 ± 0.021
1.451ProTyr: 1.451 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
2.242GlnAla: 2.242 ± 0.047
0.232GlnCys: 0.232 ± 0.016
1.379GlnAsp: 1.379 ± 0.04
2.064GlnGlu: 2.064 ± 0.046
1.454GlnPhe: 1.454 ± 0.035
1.951GlnGly: 1.951 ± 0.045
0.616GlnHis: 0.616 ± 0.023
2.412GlnIle: 2.412 ± 0.053
2.611GlnLys: 2.611 ± 0.058
3.577GlnLeu: 3.577 ± 0.064
0.926GlnMet: 0.926 ± 0.029
1.833GlnAsn: 1.833 ± 0.042
1.194GlnPro: 1.194 ± 0.036
1.453GlnGln: 1.453 ± 0.041
1.573GlnArg: 1.573 ± 0.04
2.066GlnSer: 2.066 ± 0.047
1.826GlnThr: 1.826 ± 0.041
2.222GlnVal: 2.222 ± 0.041
0.361GlnTrp: 0.361 ± 0.017
1.163GlnTyr: 1.163 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
2.435ArgAla: 2.435 ± 0.045
0.37ArgCys: 0.37 ± 0.02
1.929ArgAsp: 1.929 ± 0.04
2.647ArgGlu: 2.647 ± 0.054
2.293ArgPhe: 2.293 ± 0.043
2.266ArgGly: 2.266 ± 0.047
0.743ArgHis: 0.743 ± 0.026
3.572ArgIle: 3.572 ± 0.054
3.147ArgLys: 3.147 ± 0.062
4.008ArgLeu: 4.008 ± 0.067
1.25ArgMet: 1.25 ± 0.034
2.327ArgAsn: 2.327 ± 0.05
1.294ArgPro: 1.294 ± 0.043
1.436ArgGln: 1.436 ± 0.041
1.754ArgArg: 1.754 ± 0.045
2.515ArgSer: 2.515 ± 0.053
2.083ArgThr: 2.083 ± 0.039
2.771ArgVal: 2.771 ± 0.05
0.524ArgTrp: 0.524 ± 0.022
1.872ArgTyr: 1.872 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
4.634SerAla: 4.634 ± 0.07
0.682SerCys: 0.682 ± 0.025
3.546SerAsp: 3.546 ± 0.055
3.849SerGlu: 3.849 ± 0.066
3.738SerPhe: 3.738 ± 0.061
5.117SerGly: 5.117 ± 0.086
1.173SerHis: 1.173 ± 0.033
5.695SerIle: 5.695 ± 0.08
4.346SerLys: 4.346 ± 0.069
6.876SerLeu: 6.876 ± 0.089
1.647SerMet: 1.647 ± 0.039
3.669SerAsn: 3.669 ± 0.063
2.541SerPro: 2.541 ± 0.051
2.128SerGln: 2.128 ± 0.041
2.749SerArg: 2.749 ± 0.053
4.989SerSer: 4.989 ± 0.087
3.915SerThr: 3.915 ± 0.074
4.616SerVal: 4.616 ± 0.066
0.755SerTrp: 0.755 ± 0.026
2.794SerTyr: 2.794 ± 0.057
0.0SerXaa: 0.0 ± 0.0
Thr
4.215ThrAla: 4.215 ± 0.091
0.474ThrCys: 0.474 ± 0.025
3.037ThrAsp: 3.037 ± 0.059
3.124ThrGlu: 3.124 ± 0.059
2.894ThrPhe: 2.894 ± 0.063
4.367ThrGly: 4.367 ± 0.086
0.976ThrHis: 0.976 ± 0.03
5.228ThrIle: 5.228 ± 0.086
3.326ThrLys: 3.326 ± 0.064
5.678ThrLeu: 5.678 ± 0.088
1.113ThrMet: 1.113 ± 0.03
3.001ThrAsn: 3.001 ± 0.067
2.589ThrPro: 2.589 ± 0.058
1.739ThrGln: 1.739 ± 0.041
1.955ThrArg: 1.955 ± 0.042
3.832ThrSer: 3.832 ± 0.064
3.804ThrThr: 3.804 ± 0.1
4.001ThrVal: 4.001 ± 0.088
0.551ThrTrp: 0.551 ± 0.025
2.347ThrTyr: 2.347 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
5.38ValAla: 5.38 ± 0.081
0.696ValCys: 0.696 ± 0.027
3.88ValAsp: 3.88 ± 0.056
4.471ValGlu: 4.471 ± 0.069
3.41ValPhe: 3.41 ± 0.061
4.676ValGly: 4.676 ± 0.076
0.985ValHis: 0.985 ± 0.031
4.787ValIle: 4.787 ± 0.068
4.693ValLys: 4.693 ± 0.069
5.841ValLeu: 5.841 ± 0.078
1.596ValMet: 1.596 ± 0.041
3.541ValAsn: 3.541 ± 0.062
2.324ValPro: 2.324 ± 0.047
1.698ValGln: 1.698 ± 0.039
2.633ValArg: 2.633 ± 0.052
4.957ValSer: 4.957 ± 0.069
3.836ValThr: 3.836 ± 0.078
5.461ValVal: 5.461 ± 0.089
0.671ValTrp: 0.671 ± 0.027
2.467ValTyr: 2.467 ± 0.056
0.0ValXaa: 0.0 ± 0.0
Trp
0.664TrpAla: 0.664 ± 0.024
0.113TrpCys: 0.113 ± 0.01
0.571TrpAsp: 0.571 ± 0.023
0.553TrpGlu: 0.553 ± 0.022
0.579TrpPhe: 0.579 ± 0.024
0.693TrpGly: 0.693 ± 0.025
0.205TrpHis: 0.205 ± 0.015
0.771TrpIle: 0.771 ± 0.028
0.658TrpLys: 0.658 ± 0.027
1.045TrpLeu: 1.045 ± 0.035
0.303TrpMet: 0.303 ± 0.015
0.58TrpAsn: 0.58 ± 0.025
0.23TrpPro: 0.23 ± 0.013
0.433TrpGln: 0.433 ± 0.017
0.433TrpArg: 0.433 ± 0.018
0.65TrpSer: 0.65 ± 0.027
0.531TrpThr: 0.531 ± 0.029
0.735TrpVal: 0.735 ± 0.027
0.133TrpTrp: 0.133 ± 0.011
0.378TrpTyr: 0.378 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.578TyrAla: 2.578 ± 0.048
0.421TyrCys: 0.421 ± 0.018
1.913TyrAsp: 1.913 ± 0.045
1.812TyrGlu: 1.812 ± 0.043
2.265TyrPhe: 2.265 ± 0.044
2.597TyrGly: 2.597 ± 0.052
0.734TyrHis: 0.734 ± 0.026
2.459TyrIle: 2.459 ± 0.05
2.475TyrLys: 2.475 ± 0.053
3.791TyrLeu: 3.791 ± 0.059
0.771TyrMet: 0.771 ± 0.028
2.181TyrAsn: 2.181 ± 0.049
1.518TyrPro: 1.518 ± 0.041
1.332TyrGln: 1.332 ± 0.035
1.905TyrArg: 1.905 ± 0.039
3.124TyrSer: 3.124 ± 0.06
2.429TyrThr: 2.429 ± 0.062
1.982TyrVal: 1.982 ± 0.044
0.448TyrTrp: 0.448 ± 0.019
1.783TyrTyr: 1.783 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3616 proteins (1202311 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski