Amino acid dipepetide frequency for Anthracocystis flocculosa PF-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.487AlaAla: 19.487 ± 0.184
0.926AlaCys: 0.926 ± 0.023
6.258AlaAsp: 6.258 ± 0.051
6.215AlaGlu: 6.215 ± 0.068
3.172AlaPhe: 3.172 ± 0.04
9.178AlaGly: 9.178 ± 0.082
2.338AlaHis: 2.338 ± 0.032
3.752AlaIle: 3.752 ± 0.037
4.843AlaLys: 4.843 ± 0.054
8.861AlaLeu: 8.861 ± 0.07
2.143AlaMet: 2.143 ± 0.03
3.057AlaAsn: 3.057 ± 0.037
7.383AlaPro: 7.383 ± 0.085
4.469AlaGln: 4.469 ± 0.054
7.193AlaArg: 7.193 ± 0.064
12.796AlaSer: 12.796 ± 0.117
7.273AlaThr: 7.273 ± 0.067
6.024AlaVal: 6.024 ± 0.058
1.147AlaTrp: 1.147 ± 0.022
2.078AlaTyr: 2.078 ± 0.032
0.001AlaXaa: 0.001 ± 0.001
Cys
0.822CysAla: 0.822 ± 0.019
0.171CysCys: 0.171 ± 0.008
0.55CysAsp: 0.55 ± 0.014
0.45CysGlu: 0.45 ± 0.016
0.387CysPhe: 0.387 ± 0.013
0.726CysGly: 0.726 ± 0.021
0.252CysHis: 0.252 ± 0.009
0.472CysIle: 0.472 ± 0.012
0.376CysLys: 0.376 ± 0.013
0.948CysLeu: 0.948 ± 0.023
0.168CysMet: 0.168 ± 0.008
0.279CysAsn: 0.279 ± 0.011
0.518CysPro: 0.518 ± 0.018
0.341CysGln: 0.341 ± 0.012
0.678CysArg: 0.678 ± 0.018
0.733CysSer: 0.733 ± 0.022
0.534CysThr: 0.534 ± 0.021
0.595CysVal: 0.595 ± 0.017
0.146CysTrp: 0.146 ± 0.007
0.25CysTyr: 0.25 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.416AspAla: 7.416 ± 0.069
0.486AspCys: 0.486 ± 0.017
7.079AspAsp: 7.079 ± 0.115
5.423AspGlu: 5.423 ± 0.062
1.79AspPhe: 1.79 ± 0.027
5.654AspGly: 5.654 ± 0.057
1.319AspHis: 1.319 ± 0.025
1.946AspIle: 1.946 ± 0.029
2.096AspLys: 2.096 ± 0.038
4.997AspLeu: 4.997 ± 0.047
0.969AspMet: 0.969 ± 0.018
1.354AspAsn: 1.354 ± 0.024
3.614AspPro: 3.614 ± 0.038
1.98AspGln: 1.98 ± 0.028
3.799AspArg: 3.799 ± 0.044
3.982AspSer: 3.982 ± 0.045
2.529AspThr: 2.529 ± 0.031
3.867AspVal: 3.867 ± 0.047
0.74AspTrp: 0.74 ± 0.018
1.185AspTyr: 1.185 ± 0.022
0.0AspXaa: 0.0 ± 0.0
Glu
7.112GluAla: 7.112 ± 0.078
0.453GluCys: 0.453 ± 0.013
3.904GluAsp: 3.904 ± 0.054
4.54GluGlu: 4.54 ± 0.078
1.303GluPhe: 1.303 ± 0.027
4.224GluGly: 4.224 ± 0.046
1.27GluHis: 1.27 ± 0.023
2.04GluIle: 2.04 ± 0.032
2.39GluLys: 2.39 ± 0.038
4.84GluLeu: 4.84 ± 0.055
1.211GluMet: 1.211 ± 0.02
1.16GluAsn: 1.16 ± 0.024
2.65GluPro: 2.65 ± 0.038
2.468GluGln: 2.468 ± 0.036
4.649GluArg: 4.649 ± 0.056
3.613GluSer: 3.613 ± 0.039
2.711GluThr: 2.711 ± 0.034
3.284GluVal: 3.284 ± 0.041
0.721GluTrp: 0.721 ± 0.018
1.086GluTyr: 1.086 ± 0.024
0.001GluXaa: 0.001 ± 0.0
Phe
3.107PheAla: 3.107 ± 0.033
0.409PheCys: 0.409 ± 0.013
2.162PheAsp: 2.162 ± 0.032
1.782PheGlu: 1.782 ± 0.029
1.197PhePhe: 1.197 ± 0.025
2.726PheGly: 2.726 ± 0.047
0.703PheHis: 0.703 ± 0.015
1.048PheIle: 1.048 ± 0.024
1.06PheLys: 1.06 ± 0.023
2.648PheLeu: 2.648 ± 0.034
0.502PheMet: 0.502 ± 0.015
0.933PheAsn: 0.933 ± 0.02
1.406PhePro: 1.406 ± 0.026
1.005PheGln: 1.005 ± 0.019
1.758PheArg: 1.758 ± 0.028
2.405PheSer: 2.405 ± 0.034
1.469PheThr: 1.469 ± 0.026
2.053PheVal: 2.053 ± 0.032
0.431PheTrp: 0.431 ± 0.014
0.798PheTyr: 0.798 ± 0.022
0.0PheXaa: 0.0 ± 0.0
Gly
8.624GlyAla: 8.624 ± 0.081
0.72GlyCys: 0.72 ± 0.02
4.698GlyAsp: 4.698 ± 0.048
3.871GlyGlu: 3.871 ± 0.045
2.419GlyPhe: 2.419 ± 0.04
10.173GlyGly: 10.173 ± 0.129
1.944GlyHis: 1.944 ± 0.032
2.791GlyIle: 2.791 ± 0.032
3.402GlyLys: 3.402 ± 0.043
6.048GlyLeu: 6.048 ± 0.059
1.464GlyMet: 1.464 ± 0.028
2.106GlyAsn: 2.106 ± 0.039
4.383GlyPro: 4.383 ± 0.058
3.176GlyGln: 3.176 ± 0.043
5.279GlyArg: 5.279 ± 0.055
8.11GlySer: 8.11 ± 0.089
4.175GlyThr: 4.175 ± 0.052
4.146GlyVal: 4.146 ± 0.044
1.008GlyTrp: 1.008 ± 0.019
1.742GlyTyr: 1.742 ± 0.03
0.0GlyXaa: 0.0 ± 0.0
His
2.507HisAla: 2.507 ± 0.036
0.241HisCys: 0.241 ± 0.009
1.503HisAsp: 1.503 ± 0.027
1.191HisGlu: 1.191 ± 0.023
0.8HisPhe: 0.8 ± 0.019
1.96HisGly: 1.96 ± 0.033
1.225HisHis: 1.225 ± 0.041
0.836HisIle: 0.836 ± 0.019
0.685HisLys: 0.685 ± 0.016
2.343HisLeu: 2.343 ± 0.029
0.325HisMet: 0.325 ± 0.01
0.579HisAsn: 0.579 ± 0.017
1.812HisPro: 1.812 ± 0.03
1.16HisGln: 1.16 ± 0.037
1.847HisArg: 1.847 ± 0.031
1.862HisSer: 1.862 ± 0.031
1.091HisThr: 1.091 ± 0.019
1.316HisVal: 1.316 ± 0.022
0.254HisTrp: 0.254 ± 0.01
0.536HisTyr: 0.536 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
3.592IleAla: 3.592 ± 0.04
0.479IleCys: 0.479 ± 0.014
2.614IleAsp: 2.614 ± 0.033
2.289IleGlu: 2.289 ± 0.029
1.242IlePhe: 1.242 ± 0.024
2.472IleGly: 2.472 ± 0.033
0.812IleHis: 0.812 ± 0.018
1.258IleIle: 1.258 ± 0.025
1.588IleLys: 1.588 ± 0.025
3.113IleLeu: 3.113 ± 0.04
0.58IleMet: 0.58 ± 0.015
1.115IleAsn: 1.115 ± 0.024
1.943IlePro: 1.943 ± 0.028
1.29IleGln: 1.29 ± 0.025
2.383IleArg: 2.383 ± 0.032
2.712IleSer: 2.712 ± 0.035
1.684IleThr: 1.684 ± 0.029
2.394IleVal: 2.394 ± 0.037
0.447IleTrp: 0.447 ± 0.012
0.863IleTyr: 0.863 ± 0.019
0.0IleXaa: 0.0 ± 0.0
Lys
4.785LysAla: 4.785 ± 0.047
0.326LysCys: 0.326 ± 0.012
2.226LysAsp: 2.226 ± 0.036
2.397LysGlu: 2.397 ± 0.037
0.893LysPhe: 0.893 ± 0.021
2.829LysGly: 2.829 ± 0.039
0.874LysHis: 0.874 ± 0.018
1.377LysIle: 1.377 ± 0.028
2.62LysLys: 2.62 ± 0.052
3.45LysLeu: 3.45 ± 0.04
0.811LysMet: 0.811 ± 0.018
0.994LysAsn: 0.994 ± 0.025
2.311LysPro: 2.311 ± 0.035
1.702LysGln: 1.702 ± 0.029
3.512LysArg: 3.512 ± 0.043
2.701LysSer: 2.701 ± 0.038
2.175LysThr: 2.175 ± 0.032
2.518LysVal: 2.518 ± 0.035
0.448LysTrp: 0.448 ± 0.013
0.73LysTyr: 0.73 ± 0.019
0.001LysXaa: 0.001 ± 0.001
Leu
9.36LeuAla: 9.36 ± 0.073
1.042LeuCys: 1.042 ± 0.023
5.658LeuAsp: 5.658 ± 0.05
4.998LeuGlu: 4.998 ± 0.05
2.813LeuPhe: 2.813 ± 0.041
6.071LeuGly: 6.071 ± 0.056
2.054LeuHis: 2.054 ± 0.03
2.875LeuIle: 2.875 ± 0.041
3.197LeuLys: 3.197 ± 0.041
7.916LeuLeu: 7.916 ± 0.093
1.411LeuMet: 1.411 ± 0.025
2.325LeuAsn: 2.325 ± 0.03
5.732LeuPro: 5.732 ± 0.05
3.473LeuGln: 3.473 ± 0.042
6.136LeuArg: 6.136 ± 0.054
7.463LeuSer: 7.463 ± 0.057
4.083LeuThr: 4.083 ± 0.042
5.49LeuVal: 5.49 ± 0.061
0.913LeuTrp: 0.913 ± 0.02
1.845LeuTyr: 1.845 ± 0.029
0.0LeuXaa: 0.0 ± 0.0
Met
2.236MetAla: 2.236 ± 0.031
0.159MetCys: 0.159 ± 0.008
1.037MetAsp: 1.037 ± 0.022
0.85MetGlu: 0.85 ± 0.018
0.473MetPhe: 0.473 ± 0.013
1.307MetGly: 1.307 ± 0.026
0.393MetHis: 0.393 ± 0.012
0.61MetIle: 0.61 ± 0.014
0.603MetLys: 0.603 ± 0.015
1.704MetLeu: 1.704 ± 0.025
0.448MetMet: 0.448 ± 0.013
0.434MetAsn: 0.434 ± 0.013
1.344MetPro: 1.344 ± 0.025
0.806MetGln: 0.806 ± 0.021
1.243MetArg: 1.243 ± 0.019
1.707MetSer: 1.707 ± 0.028
1.055MetThr: 1.055 ± 0.019
1.074MetVal: 1.074 ± 0.021
0.188MetTrp: 0.188 ± 0.008
0.322MetTyr: 0.322 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.999AsnAla: 2.999 ± 0.031
0.238AsnCys: 0.238 ± 0.011
1.602AsnAsp: 1.602 ± 0.025
1.348AsnGlu: 1.348 ± 0.022
0.847AsnPhe: 0.847 ± 0.018
2.725AsnGly: 2.725 ± 0.045
0.588AsnHis: 0.588 ± 0.017
1.001AsnIle: 1.001 ± 0.023
1.03AsnLys: 1.03 ± 0.02
2.507AsnLeu: 2.507 ± 0.034
0.505AsnMet: 0.505 ± 0.016
0.878AsnAsn: 0.878 ± 0.025
1.739AsnPro: 1.739 ± 0.031
0.902AsnGln: 0.902 ± 0.018
1.66AsnArg: 1.66 ± 0.024
1.92AsnSer: 1.92 ± 0.033
1.314AsnThr: 1.314 ± 0.021
1.769AsnVal: 1.769 ± 0.026
0.298AsnTrp: 0.298 ± 0.012
0.585AsnTyr: 0.585 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
8.173ProAla: 8.173 ± 0.087
0.431ProCys: 0.431 ± 0.015
3.185ProAsp: 3.185 ± 0.035
2.928ProGlu: 2.928 ± 0.033
1.827ProPhe: 1.827 ± 0.028
4.54ProGly: 4.54 ± 0.061
1.572ProHis: 1.572 ± 0.033
1.997ProIle: 1.997 ± 0.03
2.129ProLys: 2.129 ± 0.034
5.089ProLeu: 5.089 ± 0.046
1.024ProMet: 1.024 ± 0.025
1.69ProAsn: 1.69 ± 0.029
6.537ProPro: 6.537 ± 0.1
2.561ProGln: 2.561 ± 0.04
4.129ProArg: 4.129 ± 0.047
8.381ProSer: 8.381 ± 0.099
4.258ProThr: 4.258 ± 0.046
3.199ProVal: 3.199 ± 0.034
0.588ProTrp: 0.588 ± 0.016
1.246ProTyr: 1.246 ± 0.026
0.001ProXaa: 0.001 ± 0.001
Gln
4.582GlnAla: 4.582 ± 0.05
0.296GlnCys: 0.296 ± 0.012
2.098GlnAsp: 2.098 ± 0.028
1.989GlnGlu: 1.989 ± 0.03
0.868GlnPhe: 0.868 ± 0.017
2.823GlnGly: 2.823 ± 0.039
1.496GlnHis: 1.496 ± 0.04
1.374GlnIle: 1.374 ± 0.024
1.42GlnLys: 1.42 ± 0.024
3.597GlnLeu: 3.597 ± 0.045
0.808GlnMet: 0.808 ± 0.022
0.978GlnAsn: 0.978 ± 0.018
3.185GlnPro: 3.185 ± 0.05
4.688GlnGln: 4.688 ± 0.143
3.472GlnArg: 3.472 ± 0.04
3.004GlnSer: 3.004 ± 0.04
1.963GlnThr: 1.963 ± 0.029
2.046GlnVal: 2.046 ± 0.031
0.428GlnTrp: 0.428 ± 0.011
0.76GlnTyr: 0.76 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
6.737ArgAla: 6.737 ± 0.052
0.7ArgCys: 0.7 ± 0.018
3.994ArgAsp: 3.994 ± 0.049
3.796ArgGlu: 3.796 ± 0.046
2.162ArgPhe: 2.162 ± 0.031
4.784ArgGly: 4.784 ± 0.056
1.885ArgHis: 1.885 ± 0.033
2.787ArgIle: 2.787 ± 0.04
3.279ArgLys: 3.279 ± 0.043
6.18ArgLeu: 6.18 ± 0.053
1.337ArgMet: 1.337 ± 0.02
1.998ArgAsn: 1.998 ± 0.027
4.684ArgPro: 4.684 ± 0.055
3.193ArgGln: 3.193 ± 0.043
6.797ArgArg: 6.797 ± 0.089
6.56ArgSer: 6.56 ± 0.074
3.719ArgThr: 3.719 ± 0.043
3.194ArgVal: 3.194 ± 0.041
0.869ArgTrp: 0.869 ± 0.019
1.511ArgTyr: 1.511 ± 0.026
0.002ArgXaa: 0.002 ± 0.001
Ser
10.93SerAla: 10.93 ± 0.106
0.712SerCys: 0.712 ± 0.021
4.887SerAsp: 4.887 ± 0.056
3.753SerGlu: 3.753 ± 0.043
2.698SerPhe: 2.698 ± 0.034
7.325SerGly: 7.325 ± 0.086
2.106SerHis: 2.106 ± 0.032
3.284SerIle: 3.284 ± 0.04
3.269SerLys: 3.269 ± 0.041
7.67SerLeu: 7.67 ± 0.059
1.676SerMet: 1.676 ± 0.024
2.595SerAsn: 2.595 ± 0.042
6.574SerPro: 6.574 ± 0.087
3.428SerGln: 3.428 ± 0.042
6.175SerArg: 6.175 ± 0.067
14.29SerSer: 14.29 ± 0.166
6.557SerThr: 6.557 ± 0.069
4.418SerVal: 4.418 ± 0.043
0.881SerTrp: 0.881 ± 0.016
1.754SerTyr: 1.754 ± 0.027
0.001SerXaa: 0.001 ± 0.0
Thr
7.033ThrAla: 7.033 ± 0.066
0.534ThrCys: 0.534 ± 0.018
2.686ThrAsp: 2.686 ± 0.04
2.394ThrGlu: 2.394 ± 0.034
1.744ThrPhe: 1.744 ± 0.031
4.096ThrGly: 4.096 ± 0.048
1.103ThrHis: 1.103 ± 0.022
2.113ThrIle: 2.113 ± 0.03
2.016ThrLys: 2.016 ± 0.028
4.774ThrLeu: 4.774 ± 0.045
0.996ThrMet: 0.996 ± 0.021
1.397ThrAsn: 1.397 ± 0.026
4.426ThrPro: 4.426 ± 0.054
1.737ThrGln: 1.737 ± 0.028
3.271ThrArg: 3.271 ± 0.035
5.914ThrSer: 5.914 ± 0.061
4.237ThrThr: 4.237 ± 0.055
3.134ThrVal: 3.134 ± 0.035
0.612ThrTrp: 0.612 ± 0.018
1.139ThrTyr: 1.139 ± 0.024
0.001ThrXaa: 0.001 ± 0.001
Val
6.157ValAla: 6.157 ± 0.053
0.642ValCys: 0.642 ± 0.021
3.922ValAsp: 3.922 ± 0.041
3.764ValGlu: 3.764 ± 0.046
1.832ValPhe: 1.832 ± 0.031
4.222ValGly: 4.222 ± 0.044
1.293ValHis: 1.293 ± 0.024
1.967ValIle: 1.967 ± 0.036
2.406ValLys: 2.406 ± 0.039
5.071ValLeu: 5.071 ± 0.05
0.994ValMet: 0.994 ± 0.02
1.499ValAsn: 1.499 ± 0.027
3.533ValPro: 3.533 ± 0.041
2.197ValGln: 2.197 ± 0.03
3.845ValArg: 3.845 ± 0.042
4.305ValSer: 4.305 ± 0.045
2.629ValThr: 2.629 ± 0.034
4.114ValVal: 4.114 ± 0.049
0.737ValTrp: 0.737 ± 0.019
1.313ValTyr: 1.313 ± 0.028
0.001ValXaa: 0.001 ± 0.001
Trp
0.929TrpAla: 0.929 ± 0.018
0.173TrpCys: 0.173 ± 0.007
0.719TrpAsp: 0.719 ± 0.02
0.566TrpGlu: 0.566 ± 0.016
0.352TrpPhe: 0.352 ± 0.012
0.669TrpGly: 0.669 ± 0.017
0.287TrpHis: 0.287 ± 0.011
0.539TrpIle: 0.539 ± 0.017
0.553TrpLys: 0.553 ± 0.015
1.082TrpLeu: 1.082 ± 0.021
0.262TrpMet: 0.262 ± 0.01
0.425TrpAsn: 0.425 ± 0.013
0.539TrpPro: 0.539 ± 0.013
0.541TrpGln: 0.541 ± 0.014
0.856TrpArg: 0.856 ± 0.02
1.0TrpSer: 1.0 ± 0.022
0.77TrpThr: 0.77 ± 0.019
0.546TrpVal: 0.546 ± 0.016
0.218TrpTrp: 0.218 ± 0.01
0.307TrpTyr: 0.307 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.004TyrAla: 2.004 ± 0.034
0.259TyrCys: 0.259 ± 0.011
1.439TyrAsp: 1.439 ± 0.026
1.09TyrGlu: 1.09 ± 0.019
0.782TyrPhe: 0.782 ± 0.021
1.775TyrGly: 1.775 ± 0.035
0.561TyrHis: 0.561 ± 0.014
0.803TyrIle: 0.803 ± 0.02
0.718TyrLys: 0.718 ± 0.021
2.015TyrLeu: 2.015 ± 0.034
0.358TyrMet: 0.358 ± 0.012
0.658TyrAsn: 0.658 ± 0.015
1.171TyrPro: 1.171 ± 0.024
0.756TyrGln: 0.756 ± 0.018
1.483TyrArg: 1.483 ± 0.024
1.526TyrSer: 1.526 ± 0.031
1.134TyrThr: 1.134 ± 0.024
1.26TyrVal: 1.26 ± 0.02
0.259TyrTrp: 0.259 ± 0.01
0.576TyrTyr: 0.576 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.001XaaSer: 0.001 ± 0.001
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.06XaaXaa: 0.06 ± 0.013
Statistics based on 4425 proteins (2781444 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski