Amino acid dipepetide frequency for Andreesenia angusta

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.219AlaAla: 4.219 ± 0.119
0.581AlaCys: 0.581 ± 0.027
3.314AlaAsp: 3.314 ± 0.071
5.577AlaGlu: 5.577 ± 0.102
2.455AlaPhe: 2.455 ± 0.065
4.886AlaGly: 4.886 ± 0.087
0.943AlaHis: 0.943 ± 0.03
5.873AlaIle: 5.873 ± 0.1
5.001AlaLys: 5.001 ± 0.091
6.28AlaLeu: 6.28 ± 0.11
2.018AlaMet: 2.018 ± 0.055
2.338AlaAsn: 2.338 ± 0.067
1.653AlaPro: 1.653 ± 0.049
1.479AlaGln: 1.479 ± 0.055
2.743AlaArg: 2.743 ± 0.068
4.326AlaSer: 4.326 ± 0.09
3.054AlaThr: 3.054 ± 0.076
5.429AlaVal: 5.429 ± 0.089
0.433AlaTrp: 0.433 ± 0.025
2.165AlaTyr: 2.165 ± 0.063
0.0AlaXaa: 0.0 ± 0.0
Cys
0.451CysAla: 0.451 ± 0.027
0.133CysCys: 0.133 ± 0.017
0.464CysAsp: 0.464 ± 0.025
0.628CysGlu: 0.628 ± 0.028
0.306CysPhe: 0.306 ± 0.025
0.958CysGly: 0.958 ± 0.041
0.171CysHis: 0.171 ± 0.017
0.721CysIle: 0.721 ± 0.034
0.64CysLys: 0.64 ± 0.035
0.588CysLeu: 0.588 ± 0.03
0.216CysMet: 0.216 ± 0.017
0.399CysAsn: 0.399 ± 0.026
0.481CysPro: 0.481 ± 0.03
0.209CysGln: 0.209 ± 0.019
0.45CysArg: 0.45 ± 0.028
0.678CysSer: 0.678 ± 0.035
0.444CysThr: 0.444 ± 0.027
0.457CysVal: 0.457 ± 0.029
0.058CysTrp: 0.058 ± 0.009
0.293CysTyr: 0.293 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.059AspAla: 3.059 ± 0.078
0.477AspCys: 0.477 ± 0.028
2.557AspAsp: 2.557 ± 0.071
5.144AspGlu: 5.144 ± 0.088
2.574AspPhe: 2.574 ± 0.065
4.269AspGly: 4.269 ± 0.082
0.715AspHis: 0.715 ± 0.035
6.36AspIle: 6.36 ± 0.109
4.17AspLys: 4.17 ± 0.093
4.749AspLeu: 4.749 ± 0.085
2.026AspMet: 2.026 ± 0.049
1.905AspAsn: 1.905 ± 0.057
1.658AspPro: 1.658 ± 0.049
1.083AspGln: 1.083 ± 0.042
3.043AspArg: 3.043 ± 0.075
4.359AspSer: 4.359 ± 0.077
2.548AspThr: 2.548 ± 0.065
3.847AspVal: 3.847 ± 0.081
0.515AspTrp: 0.515 ± 0.027
2.66AspTyr: 2.66 ± 0.062
0.0AspXaa: 0.0 ± 0.0
Glu
5.658GluAla: 5.658 ± 0.103
0.65GluCys: 0.65 ± 0.033
5.055GluAsp: 5.055 ± 0.09
8.4GluGlu: 8.4 ± 0.155
3.18GluPhe: 3.18 ± 0.076
5.254GluGly: 5.254 ± 0.099
1.204GluHis: 1.204 ± 0.046
7.573GluIle: 7.573 ± 0.115
8.145GluLys: 8.145 ± 0.115
8.754GluLeu: 8.754 ± 0.126
2.424GluMet: 2.424 ± 0.058
4.553GluAsn: 4.553 ± 0.084
1.773GluPro: 1.773 ± 0.061
2.217GluGln: 2.217 ± 0.057
3.963GluArg: 3.963 ± 0.082
6.263GluSer: 6.263 ± 0.111
3.493GluThr: 3.493 ± 0.07
6.195GluVal: 6.195 ± 0.108
0.529GluTrp: 0.529 ± 0.027
3.276GluTyr: 3.276 ± 0.074
0.0GluXaa: 0.0 ± 0.0
Phe
2.279PheAla: 2.279 ± 0.065
0.364PheCys: 0.364 ± 0.02
2.145PheAsp: 2.145 ± 0.06
3.198PheGlu: 3.198 ± 0.065
1.591PhePhe: 1.591 ± 0.056
3.156PheGly: 3.156 ± 0.082
0.498PheHis: 0.498 ± 0.024
2.981PheIle: 2.981 ± 0.072
3.3PheLys: 3.3 ± 0.076
3.446PheLeu: 3.446 ± 0.074
1.124PheMet: 1.124 ± 0.046
1.656PheAsn: 1.656 ± 0.048
1.158PhePro: 1.158 ± 0.037
0.915PheGln: 0.915 ± 0.034
1.581PheArg: 1.581 ± 0.053
2.873PheSer: 2.873 ± 0.07
1.898PheThr: 1.898 ± 0.051
2.544PheVal: 2.544 ± 0.069
0.265PheTrp: 0.265 ± 0.021
1.402PheTyr: 1.402 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
5.048GlyAla: 5.048 ± 0.107
0.759GlyCys: 0.759 ± 0.038
4.191GlyAsp: 4.191 ± 0.078
5.91GlyGlu: 5.91 ± 0.094
3.016GlyPhe: 3.016 ± 0.069
5.136GlyGly: 5.136 ± 0.117
1.241GlyHis: 1.241 ± 0.044
6.686GlyIle: 6.686 ± 0.107
5.783GlyLys: 5.783 ± 0.084
6.425GlyLeu: 6.425 ± 0.109
2.136GlyMet: 2.136 ± 0.057
2.937GlyAsn: 2.937 ± 0.067
1.403GlyPro: 1.403 ± 0.048
1.643GlyGln: 1.643 ± 0.052
3.095GlyArg: 3.095 ± 0.077
4.503GlySer: 4.503 ± 0.084
3.733GlyThr: 3.733 ± 0.086
5.77GlyVal: 5.77 ± 0.095
0.532GlyTrp: 0.532 ± 0.032
3.265GlyTyr: 3.265 ± 0.095
0.0GlyXaa: 0.0 ± 0.0
His
0.759HisAla: 0.759 ± 0.035
0.203HisCys: 0.203 ± 0.018
0.654HisAsp: 0.654 ± 0.03
0.887HisGlu: 0.887 ± 0.034
0.606HisPhe: 0.606 ± 0.03
1.189HisGly: 1.189 ± 0.05
0.307HisHis: 0.307 ± 0.022
1.444HisIle: 1.444 ± 0.04
0.877HisLys: 0.877 ± 0.039
1.189HisLeu: 1.189 ± 0.04
0.45HisMet: 0.45 ± 0.024
0.618HisAsn: 0.618 ± 0.027
0.718HisPro: 0.718 ± 0.031
0.354HisGln: 0.354 ± 0.023
0.819HisArg: 0.819 ± 0.035
1.13HisSer: 1.13 ± 0.044
0.678HisThr: 0.678 ± 0.034
0.877HisVal: 0.877 ± 0.04
0.148HisTrp: 0.148 ± 0.016
0.609HisTyr: 0.609 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
6.194IleAla: 6.194 ± 0.104
0.79IleCys: 0.79 ± 0.038
5.347IleAsp: 5.347 ± 0.09
7.632IleGlu: 7.632 ± 0.103
2.833IlePhe: 2.833 ± 0.078
5.937IleGly: 5.937 ± 0.114
0.986IleHis: 0.986 ± 0.039
5.328IleIle: 5.328 ± 0.112
5.779IleLys: 5.779 ± 0.098
6.7IleLeu: 6.7 ± 0.1
2.286IleMet: 2.286 ± 0.063
3.336IleAsn: 3.336 ± 0.078
2.902IlePro: 2.902 ± 0.07
1.737IleGln: 1.737 ± 0.051
3.389IleArg: 3.389 ± 0.066
6.535IleSer: 6.535 ± 0.118
3.683IleThr: 3.683 ± 0.077
6.547IleVal: 6.547 ± 0.105
0.51IleTrp: 0.51 ± 0.031
2.93IleTyr: 2.93 ± 0.066
0.0IleXaa: 0.0 ± 0.0
Lys
5.177LysAla: 5.177 ± 0.096
0.558LysCys: 0.558 ± 0.032
4.559LysAsp: 4.559 ± 0.085
6.825LysGlu: 6.825 ± 0.122
2.359LysPhe: 2.359 ± 0.056
4.57LysGly: 4.57 ± 0.093
1.256LysHis: 1.256 ± 0.052
5.765LysIle: 5.765 ± 0.083
6.409LysLys: 6.409 ± 0.099
7.853LysLeu: 7.853 ± 0.103
2.194LysMet: 2.194 ± 0.064
3.547LysAsn: 3.547 ± 0.066
2.028LysPro: 2.028 ± 0.06
2.025LysGln: 2.025 ± 0.055
3.743LysArg: 3.743 ± 0.082
6.21LysSer: 6.21 ± 0.111
3.806LysThr: 3.806 ± 0.074
5.409LysVal: 5.409 ± 0.089
0.515LysTrp: 0.515 ± 0.029
3.023LysTyr: 3.023 ± 0.07
0.0LysXaa: 0.0 ± 0.0
Leu
5.91LeuAla: 5.91 ± 0.105
0.746LeuCys: 0.746 ± 0.036
5.948LeuAsp: 5.948 ± 0.101
8.726LeuGlu: 8.726 ± 0.126
3.313LeuPhe: 3.313 ± 0.075
6.877LeuGly: 6.877 ± 0.112
1.06LeuHis: 1.06 ± 0.042
5.772LeuIle: 5.772 ± 0.105
8.046LeuLys: 8.046 ± 0.119
7.783LeuLeu: 7.783 ± 0.148
2.286LeuMet: 2.286 ± 0.056
4.132LeuAsn: 4.132 ± 0.073
2.552LeuPro: 2.552 ± 0.071
1.876LeuGln: 1.876 ± 0.052
3.75LeuArg: 3.75 ± 0.086
7.506LeuSer: 7.506 ± 0.121
3.829LeuThr: 3.829 ± 0.076
6.1LeuVal: 6.1 ± 0.109
0.529LeuTrp: 0.529 ± 0.028
2.819LeuTyr: 2.819 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
2.372MetAla: 2.372 ± 0.057
0.173MetCys: 0.173 ± 0.016
1.857MetAsp: 1.857 ± 0.055
2.418MetGlu: 2.418 ± 0.058
0.936MetPhe: 0.936 ± 0.037
2.162MetGly: 2.162 ± 0.065
0.348MetHis: 0.348 ± 0.019
2.16MetIle: 2.16 ± 0.066
2.5MetLys: 2.5 ± 0.055
2.392MetLeu: 2.392 ± 0.057
0.86MetMet: 0.86 ± 0.036
1.397MetAsn: 1.397 ± 0.045
0.877MetPro: 0.877 ± 0.035
0.635MetGln: 0.635 ± 0.03
1.068MetArg: 1.068 ± 0.037
1.881MetSer: 1.881 ± 0.052
1.469MetThr: 1.469 ± 0.046
2.097MetVal: 2.097 ± 0.053
0.121MetTrp: 0.121 ± 0.012
0.812MetTyr: 0.812 ± 0.037
0.0MetXaa: 0.0 ± 0.0
Asn
2.457AsnAla: 2.457 ± 0.063
0.43AsnCys: 0.43 ± 0.029
1.809AsnAsp: 1.809 ± 0.048
2.922AsnGlu: 2.922 ± 0.072
1.63AsnPhe: 1.63 ± 0.056
3.104AsnGly: 3.104 ± 0.075
0.708AsnHis: 0.708 ± 0.034
4.387AsnIle: 4.387 ± 0.078
2.916AsnLys: 2.916 ± 0.072
3.943AsnLeu: 3.943 ± 0.083
1.472AsnMet: 1.472 ± 0.044
1.664AsnAsn: 1.664 ± 0.061
1.952AsnPro: 1.952 ± 0.053
1.107AsnGln: 1.107 ± 0.041
2.241AsnArg: 2.241 ± 0.057
3.183AsnSer: 3.183 ± 0.071
2.159AsnThr: 2.159 ± 0.051
2.544AsnVal: 2.544 ± 0.063
0.338AsnTrp: 0.338 ± 0.022
1.818AsnTyr: 1.818 ± 0.056
0.0AsnXaa: 0.0 ± 0.0
Pro
1.678ProAla: 1.678 ± 0.05
0.278ProCys: 0.278 ± 0.023
1.788ProAsp: 1.788 ± 0.057
3.17ProGlu: 3.17 ± 0.079
1.262ProPhe: 1.262 ± 0.044
2.208ProGly: 2.208 ± 0.062
0.513ProHis: 0.513 ± 0.026
2.253ProIle: 2.253 ± 0.062
2.045ProLys: 2.045 ± 0.064
2.416ProLeu: 2.416 ± 0.063
0.743ProMet: 0.743 ± 0.03
1.271ProAsn: 1.271 ± 0.046
0.733ProPro: 0.733 ± 0.033
0.653ProGln: 0.653 ± 0.03
1.038ProArg: 1.038 ± 0.036
1.984ProSer: 1.984 ± 0.057
1.443ProThr: 1.443 ± 0.049
2.523ProVal: 2.523 ± 0.062
0.219ProTrp: 0.219 ± 0.017
1.137ProTyr: 1.137 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
1.482GlnAla: 1.482 ± 0.048
0.161GlnCys: 0.161 ± 0.015
1.251GlnAsp: 1.251 ± 0.04
1.911GlnGlu: 1.911 ± 0.056
0.876GlnPhe: 0.876 ± 0.037
1.567GlnGly: 1.567 ± 0.045
0.303GlnHis: 0.303 ± 0.019
1.952GlnIle: 1.952 ± 0.051
1.99GlnLys: 1.99 ± 0.059
2.0GlnLeu: 2.0 ± 0.065
0.75GlnMet: 0.75 ± 0.029
1.314GlnAsn: 1.314 ± 0.045
0.556GlnPro: 0.556 ± 0.03
0.635GlnGln: 0.635 ± 0.039
1.201GlnArg: 1.201 ± 0.038
1.716GlnSer: 1.716 ± 0.05
1.151GlnThr: 1.151 ± 0.049
1.468GlnVal: 1.468 ± 0.047
0.168GlnTrp: 0.168 ± 0.015
0.842GlnTyr: 0.842 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
2.928ArgAla: 2.928 ± 0.068
0.406ArgCys: 0.406 ± 0.026
2.884ArgAsp: 2.884 ± 0.078
4.508ArgGlu: 4.508 ± 0.092
1.629ArgPhe: 1.629 ± 0.046
3.132ArgGly: 3.132 ± 0.073
0.701ArgHis: 0.701 ± 0.034
3.655ArgIle: 3.655 ± 0.075
3.49ArgLys: 3.49 ± 0.078
3.74ArgLeu: 3.74 ± 0.071
1.238ArgMet: 1.238 ± 0.046
2.12ArgAsn: 2.12 ± 0.058
1.227ArgPro: 1.227 ± 0.04
1.117ArgGln: 1.117 ± 0.043
2.049ArgArg: 2.049 ± 0.06
2.523ArgSer: 2.523 ± 0.063
1.922ArgThr: 1.922 ± 0.05
3.291ArgVal: 3.291 ± 0.072
0.312ArgTrp: 0.312 ± 0.024
1.79ArgTyr: 1.79 ± 0.052
0.0ArgXaa: 0.0 ± 0.0
Ser
4.2SerAla: 4.2 ± 0.086
0.613SerCys: 0.613 ± 0.034
4.035SerAsp: 4.035 ± 0.085
6.635SerGlu: 6.635 ± 0.119
3.184SerPhe: 3.184 ± 0.073
6.17SerGly: 6.17 ± 0.116
1.099SerHis: 1.099 ± 0.039
6.072SerIle: 6.072 ± 0.098
5.493SerLys: 5.493 ± 0.092
6.787SerLeu: 6.787 ± 0.116
1.932SerMet: 1.932 ± 0.046
2.794SerAsn: 2.794 ± 0.066
2.166SerPro: 2.166 ± 0.053
1.907SerGln: 1.907 ± 0.054
3.351SerArg: 3.351 ± 0.077
5.25SerSer: 5.25 ± 0.122
3.214SerThr: 3.214 ± 0.076
4.948SerVal: 4.948 ± 0.094
0.534SerTrp: 0.534 ± 0.027
2.716SerTyr: 2.716 ± 0.067
0.0SerXaa: 0.0 ± 0.0
Thr
3.445ThrAla: 3.445 ± 0.081
0.348ThrCys: 0.348 ± 0.024
2.585ThrAsp: 2.585 ± 0.065
3.702ThrGlu: 3.702 ± 0.082
1.751ThrPhe: 1.751 ± 0.048
4.222ThrGly: 4.222 ± 0.079
0.787ThrHis: 0.787 ± 0.035
3.644ThrIle: 3.644 ± 0.071
2.771ThrLys: 2.771 ± 0.067
4.163ThrLeu: 4.163 ± 0.077
1.154ThrMet: 1.154 ± 0.04
1.723ThrAsn: 1.723 ± 0.062
1.839ThrPro: 1.839 ± 0.058
1.024ThrGln: 1.024 ± 0.036
1.928ThrArg: 1.928 ± 0.049
3.001ThrSer: 3.001 ± 0.069
2.411ThrThr: 2.411 ± 0.068
4.181ThrVal: 4.181 ± 0.09
0.29ThrTrp: 0.29 ± 0.02
1.485ThrTyr: 1.485 ± 0.054
0.0ThrXaa: 0.0 ± 0.0
Val
4.993ValAla: 4.993 ± 0.087
0.659ValCys: 0.659 ± 0.031
4.635ValAsp: 4.635 ± 0.091
6.978ValGlu: 6.978 ± 0.1
2.966ValPhe: 2.966 ± 0.07
5.06ValGly: 5.06 ± 0.098
0.993ValHis: 0.993 ± 0.037
5.074ValIle: 5.074 ± 0.09
5.484ValLys: 5.484 ± 0.088
6.643ValLeu: 6.643 ± 0.112
1.952ValMet: 1.952 ± 0.054
3.038ValAsn: 3.038 ± 0.066
2.259ValPro: 2.259 ± 0.055
1.695ValGln: 1.695 ± 0.049
2.627ValArg: 2.627 ± 0.058
5.486ValSer: 5.486 ± 0.089
3.286ValThr: 3.286 ± 0.068
5.95ValVal: 5.95 ± 0.105
0.482ValTrp: 0.482 ± 0.029
2.64ValTyr: 2.64 ± 0.065
0.0ValXaa: 0.0 ± 0.0
Trp
0.461TrpAla: 0.461 ± 0.028
0.051TrpCys: 0.051 ± 0.009
0.467TrpAsp: 0.467 ± 0.03
0.515TrpGlu: 0.515 ± 0.025
0.274TrpPhe: 0.274 ± 0.02
0.519TrpGly: 0.519 ± 0.029
0.123TrpHis: 0.123 ± 0.013
0.584TrpIle: 0.584 ± 0.03
0.567TrpLys: 0.567 ± 0.027
0.551TrpLeu: 0.551 ± 0.031
0.195TrpMet: 0.195 ± 0.018
0.389TrpAsn: 0.389 ± 0.028
0.145TrpPro: 0.145 ± 0.017
0.152TrpGln: 0.152 ± 0.015
0.322TrpArg: 0.322 ± 0.023
0.477TrpSer: 0.477 ± 0.028
0.384TrpThr: 0.384 ± 0.025
0.377TrpVal: 0.377 ± 0.021
0.073TrpTrp: 0.073 ± 0.01
0.23TrpTyr: 0.23 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.0TyrAla: 2.0 ± 0.058
0.364TyrCys: 0.364 ± 0.023
2.172TyrAsp: 2.172 ± 0.074
2.874TyrGlu: 2.874 ± 0.076
1.632TyrPhe: 1.632 ± 0.05
2.837TyrGly: 2.837 ± 0.066
0.56TyrHis: 0.56 ± 0.026
3.098TyrIle: 3.098 ± 0.071
2.427TyrLys: 2.427 ± 0.061
3.159TyrLeu: 3.159 ± 0.082
1.08TyrMet: 1.08 ± 0.038
1.653TyrAsn: 1.653 ± 0.049
1.275TyrPro: 1.275 ± 0.053
0.831TyrGln: 0.831 ± 0.034
2.262TyrArg: 2.262 ± 0.061
3.215TyrSer: 3.215 ± 0.069
1.849TyrThr: 1.849 ± 0.053
2.342TyrVal: 2.342 ± 0.052
0.282TyrTrp: 0.282 ± 0.018
1.493TyrTyr: 1.493 ± 0.051
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2379 proteins (709130 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski