Amino acid dipepetide frequency for Pteropus alecto (Black flying fox)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.851AlaAla: 6.851 ± 0.047
1.483AlaCys: 1.483 ± 0.015
2.955AlaAsp: 2.955 ± 0.021
4.836AlaGlu: 4.836 ± 0.035
2.713AlaPhe: 2.713 ± 0.022
4.926AlaGly: 4.926 ± 0.039
1.634AlaHis: 1.634 ± 0.015
2.778AlaIle: 2.778 ± 0.018
3.373AlaLys: 3.373 ± 0.025
7.49AlaLeu: 7.49 ± 0.043
1.511AlaMet: 1.511 ± 0.016
2.0AlaAsn: 2.0 ± 0.019
4.411AlaPro: 4.411 ± 0.04
3.315AlaGln: 3.315 ± 0.026
3.908AlaArg: 3.908 ± 0.024
5.918AlaSer: 5.918 ± 0.032
3.652AlaThr: 3.652 ± 0.026
4.856AlaVal: 4.856 ± 0.026
0.866AlaTrp: 0.866 ± 0.012
1.545AlaTyr: 1.545 ± 0.012
0.009AlaXaa: 0.009 ± 0.001
Cys
1.338CysAla: 1.338 ± 0.013
0.609CysCys: 0.609 ± 0.011
1.02CysAsp: 1.02 ± 0.014
1.266CysGlu: 1.266 ± 0.016
0.841CysPhe: 0.841 ± 0.01
1.797CysGly: 1.797 ± 0.029
0.679CysHis: 0.679 ± 0.009
0.906CysIle: 0.906 ± 0.011
1.1CysLys: 1.1 ± 0.014
2.192CysLeu: 2.192 ± 0.019
0.418CysMet: 0.418 ± 0.007
0.775CysAsn: 0.775 ± 0.012
1.46CysPro: 1.46 ± 0.018
1.066CysGln: 1.066 ± 0.014
1.342CysArg: 1.342 ± 0.016
2.026CysSer: 2.026 ± 0.02
1.103CysThr: 1.103 ± 0.011
1.306CysVal: 1.306 ± 0.018
0.304CysTrp: 0.304 ± 0.005
0.576CysTyr: 0.576 ± 0.009
0.001CysXaa: 0.001 ± 0.0
Asp
2.882AspAla: 2.882 ± 0.019
1.052AspCys: 1.052 ± 0.014
2.564AspAsp: 2.564 ± 0.023
3.367AspGlu: 3.367 ± 0.026
2.096AspPhe: 2.096 ± 0.017
3.266AspGly: 3.266 ± 0.028
1.134AspHis: 1.134 ± 0.012
2.563AspIle: 2.563 ± 0.021
2.438AspLys: 2.438 ± 0.023
4.994AspLeu: 4.994 ± 0.028
1.109AspMet: 1.109 ± 0.012
1.642AspAsn: 1.642 ± 0.018
2.925AspPro: 2.925 ± 0.018
1.804AspGln: 1.804 ± 0.017
2.483AspArg: 2.483 ± 0.019
4.136AspSer: 4.136 ± 0.026
2.468AspThr: 2.468 ± 0.02
3.085AspVal: 3.085 ± 0.024
0.627AspTrp: 0.627 ± 0.01
1.419AspTyr: 1.419 ± 0.013
0.002AspXaa: 0.002 ± 0.0
Glu
5.385GluAla: 5.385 ± 0.035
1.409GluCys: 1.409 ± 0.023
4.359GluAsp: 4.359 ± 0.028
7.767GluGlu: 7.767 ± 0.066
2.006GluPhe: 2.006 ± 0.017
4.217GluGly: 4.217 ± 0.025
1.498GluHis: 1.498 ± 0.015
3.035GluIle: 3.035 ± 0.029
5.273GluLys: 5.273 ± 0.045
6.407GluLeu: 6.407 ± 0.03
1.623GluMet: 1.623 ± 0.015
3.027GluAsn: 3.027 ± 0.023
3.301GluPro: 3.301 ± 0.034
3.14GluGln: 3.14 ± 0.029
3.987GluArg: 3.987 ± 0.031
4.305GluSer: 4.305 ± 0.027
3.36GluThr: 3.36 ± 0.026
4.194GluVal: 4.194 ± 0.031
0.699GluTrp: 0.699 ± 0.008
1.509GluTyr: 1.509 ± 0.02
0.003GluXaa: 0.003 ± 0.001
Phe
2.013PheAla: 2.013 ± 0.017
0.958PheCys: 0.958 ± 0.012
1.641PheAsp: 1.641 ± 0.014
1.973PheGlu: 1.973 ± 0.017
1.614PhePhe: 1.614 ± 0.018
2.222PheGly: 2.222 ± 0.021
1.03PheHis: 1.03 ± 0.013
1.791PheIle: 1.791 ± 0.014
1.728PheLys: 1.728 ± 0.016
4.154PheLeu: 4.154 ± 0.032
0.759PheMet: 0.759 ± 0.01
1.288PheAsn: 1.288 ± 0.015
2.012PhePro: 2.012 ± 0.015
1.783PheGln: 1.783 ± 0.015
1.981PheArg: 1.981 ± 0.019
3.455PheSer: 3.455 ± 0.025
1.998PheThr: 1.998 ± 0.015
2.157PheVal: 2.157 ± 0.016
0.493PheTrp: 0.493 ± 0.008
1.191PheTyr: 1.191 ± 0.014
0.002PheXaa: 0.002 ± 0.0
Gly
4.686GlyAla: 4.686 ± 0.036
1.335GlyCys: 1.335 ± 0.015
3.144GlyAsp: 3.144 ± 0.025
4.108GlyGlu: 4.108 ± 0.028
2.425GlyPhe: 2.425 ± 0.024
5.136GlyGly: 5.136 ± 0.048
1.758GlyHis: 1.758 ± 0.018
2.664GlyIle: 2.664 ± 0.02
3.696GlyLys: 3.696 ± 0.029
6.133GlyLeu: 6.133 ± 0.042
1.258GlyMet: 1.258 ± 0.014
2.264GlyAsn: 2.264 ± 0.018
4.43GlyPro: 4.43 ± 0.055
2.805GlyGln: 2.805 ± 0.021
3.936GlyArg: 3.936 ± 0.03
5.817GlySer: 5.817 ± 0.036
3.634GlyThr: 3.634 ± 0.028
3.626GlyVal: 3.626 ± 0.024
0.83GlyTrp: 0.83 ± 0.011
1.671GlyTyr: 1.671 ± 0.018
0.007GlyXaa: 0.007 ± 0.001
His
1.388HisAla: 1.388 ± 0.012
0.736HisCys: 0.736 ± 0.011
0.88HisAsp: 0.88 ± 0.009
1.308HisGlu: 1.308 ± 0.014
1.096HisPhe: 1.096 ± 0.012
1.6HisGly: 1.6 ± 0.015
0.889HisHis: 0.889 ± 0.013
1.201HisIle: 1.201 ± 0.012
1.217HisLys: 1.217 ± 0.012
3.004HisLeu: 3.004 ± 0.024
0.584HisMet: 0.584 ± 0.008
0.825HisAsn: 0.825 ± 0.011
1.699HisPro: 1.699 ± 0.015
1.342HisGln: 1.342 ± 0.019
1.655HisArg: 1.655 ± 0.017
2.288HisSer: 2.288 ± 0.019
1.538HisThr: 1.538 ± 0.021
1.566HisVal: 1.566 ± 0.013
0.357HisTrp: 0.357 ± 0.007
0.798HisTyr: 0.798 ± 0.01
0.001HisXaa: 0.001 ± 0.0
Ile
2.517IleAla: 2.517 ± 0.018
1.072IleCys: 1.072 ± 0.013
1.947IleAsp: 1.947 ± 0.019
2.476IleGlu: 2.476 ± 0.022
1.865IlePhe: 1.865 ± 0.019
2.141IleGly: 2.141 ± 0.018
1.321IleHis: 1.321 ± 0.015
2.297IleIle: 2.297 ± 0.025
2.474IleLys: 2.474 ± 0.022
4.527IleLeu: 4.527 ± 0.031
0.964IleMet: 0.964 ± 0.011
1.743IleAsn: 1.743 ± 0.017
2.502IlePro: 2.502 ± 0.019
2.205IleGln: 2.205 ± 0.017
2.342IleArg: 2.342 ± 0.018
3.631IleSer: 3.631 ± 0.033
2.485IleThr: 2.485 ± 0.027
2.432IleVal: 2.432 ± 0.022
0.501IleTrp: 0.501 ± 0.008
1.36IleTyr: 1.36 ± 0.013
0.002IleXaa: 0.002 ± 0.0
Lys
4.05LysAla: 4.05 ± 0.029
1.101LysCys: 1.101 ± 0.014
3.028LysAsp: 3.028 ± 0.028
4.973LysGlu: 4.973 ± 0.041
1.665LysPhe: 1.665 ± 0.015
3.154LysGly: 3.154 ± 0.029
1.352LysHis: 1.352 ± 0.015
2.651LysIle: 2.651 ± 0.024
4.544LysLys: 4.544 ± 0.041
5.038LysLeu: 5.038 ± 0.034
1.424LysMet: 1.424 ± 0.016
2.285LysAsn: 2.285 ± 0.02
3.025LysPro: 3.025 ± 0.032
2.51LysGln: 2.51 ± 0.022
3.202LysArg: 3.202 ± 0.021
3.739LysSer: 3.739 ± 0.027
2.983LysThr: 2.983 ± 0.022
3.37LysVal: 3.37 ± 0.027
0.582LysTrp: 0.582 ± 0.009
1.496LysTyr: 1.496 ± 0.017
0.003LysXaa: 0.003 ± 0.001
Leu
7.12LeuAla: 7.12 ± 0.049
2.264LeuCys: 2.264 ± 0.019
4.774LeuAsp: 4.774 ± 0.026
7.142LeuGlu: 7.142 ± 0.042
3.395LeuPhe: 3.395 ± 0.028
6.193LeuGly: 6.193 ± 0.041
2.766LeuHis: 2.766 ± 0.021
3.83LeuIle: 3.83 ± 0.026
5.618LeuLys: 5.618 ± 0.032
10.961LeuLeu: 10.961 ± 0.068
2.015LeuMet: 2.015 ± 0.023
3.414LeuAsn: 3.414 ± 0.024
6.166LeuPro: 6.166 ± 0.035
5.764LeuGln: 5.764 ± 0.039
6.16LeuArg: 6.16 ± 0.037
8.2LeuSer: 8.2 ± 0.039
5.132LeuThr: 5.132 ± 0.028
5.69LeuVal: 5.69 ± 0.034
1.182LeuTrp: 1.182 ± 0.013
2.537LeuTyr: 2.537 ± 0.022
0.007LeuXaa: 0.007 ± 0.001
Met
1.951MetAla: 1.951 ± 0.016
0.39MetCys: 0.39 ± 0.006
1.223MetAsp: 1.223 ± 0.013
1.799MetGlu: 1.799 ± 0.015
0.71MetPhe: 0.71 ± 0.009
1.309MetGly: 1.309 ± 0.021
0.461MetHis: 0.461 ± 0.007
0.815MetIle: 0.815 ± 0.009
1.382MetLys: 1.382 ± 0.012
1.994MetLeu: 1.994 ± 0.017
0.538MetMet: 0.538 ± 0.007
0.851MetAsn: 0.851 ± 0.013
1.101MetPro: 1.101 ± 0.019
0.906MetGln: 0.906 ± 0.011
1.049MetArg: 1.049 ± 0.01
1.525MetSer: 1.525 ± 0.013
1.105MetThr: 1.105 ± 0.012
1.382MetVal: 1.382 ± 0.014
0.237MetTrp: 0.237 ± 0.005
0.581MetTyr: 0.581 ± 0.009
0.001MetXaa: 0.001 ± 0.0
Asn
2.014AsnAla: 2.014 ± 0.018
0.809AsnCys: 0.809 ± 0.011
1.455AsnAsp: 1.455 ± 0.016
2.149AsnGlu: 2.149 ± 0.019
1.419AsnPhe: 1.419 ± 0.014
2.312AsnGly: 2.312 ± 0.022
0.928AsnHis: 0.928 ± 0.013
2.019AsnIle: 2.019 ± 0.02
2.11AsnLys: 2.11 ± 0.018
3.647AsnLeu: 3.647 ± 0.024
0.886AsnMet: 0.886 ± 0.015
1.439AsnAsn: 1.439 ± 0.017
2.139AsnPro: 2.139 ± 0.017
1.586AsnGln: 1.586 ± 0.016
1.792AsnArg: 1.792 ± 0.015
2.982AsnSer: 2.982 ± 0.024
1.848AsnThr: 1.848 ± 0.017
2.144AsnVal: 2.144 ± 0.017
0.433AsnTrp: 0.433 ± 0.006
1.054AsnTyr: 1.054 ± 0.013
0.001AsnXaa: 0.001 ± 0.0
Pro
5.051ProAla: 5.051 ± 0.044
1.201ProCys: 1.201 ± 0.016
2.817ProAsp: 2.817 ± 0.025
4.511ProGlu: 4.511 ± 0.037
1.959ProPhe: 1.959 ± 0.015
5.264ProGly: 5.264 ± 0.061
1.492ProHis: 1.492 ± 0.016
1.839ProIle: 1.839 ± 0.021
2.772ProLys: 2.772 ± 0.031
5.476ProLeu: 5.476 ± 0.032
1.087ProMet: 1.087 ± 0.012
1.776ProAsn: 1.776 ± 0.015
6.074ProPro: 6.074 ± 0.071
2.883ProGln: 2.883 ± 0.023
3.594ProArg: 3.594 ± 0.032
5.78ProSer: 5.78 ± 0.038
3.153ProThr: 3.153 ± 0.06
3.871ProVal: 3.871 ± 0.034
0.733ProTrp: 0.733 ± 0.009
1.507ProTyr: 1.507 ± 0.019
0.006ProXaa: 0.006 ± 0.001
Gln
3.615GlnAla: 3.615 ± 0.032
0.929GlnCys: 0.929 ± 0.013
2.339GlnAsp: 2.339 ± 0.017
3.864GlnGlu: 3.864 ± 0.033
1.337GlnPhe: 1.337 ± 0.013
2.885GlnGly: 2.885 ± 0.023
1.302GlnHis: 1.302 ± 0.015
1.931GlnIle: 1.931 ± 0.014
2.865GlnLys: 2.865 ± 0.023
4.788GlnLeu: 4.788 ± 0.034
1.072GlnMet: 1.072 ± 0.011
1.781GlnAsn: 1.781 ± 0.016
2.804GlnPro: 2.804 ± 0.027
2.893GlnGln: 2.893 ± 0.039
3.029GlnArg: 3.029 ± 0.026
3.063GlnSer: 3.063 ± 0.022
2.263GlnThr: 2.263 ± 0.019
2.867GlnVal: 2.867 ± 0.021
0.535GlnTrp: 0.535 ± 0.007
1.099GlnTyr: 1.099 ± 0.012
0.003GlnXaa: 0.003 ± 0.001
Arg
4.185ArgAla: 4.185 ± 0.025
1.244ArgCys: 1.244 ± 0.016
2.803ArgAsp: 2.803 ± 0.022
4.048ArgGlu: 4.048 ± 0.028
1.875ArgPhe: 1.875 ± 0.016
3.791ArgGly: 3.791 ± 0.031
1.595ArgHis: 1.595 ± 0.016
2.364ArgIle: 2.364 ± 0.018
3.593ArgLys: 3.593 ± 0.022
5.591ArgLeu: 5.591 ± 0.031
1.176ArgMet: 1.176 ± 0.014
2.063ArgAsn: 2.063 ± 0.015
3.449ArgPro: 3.449 ± 0.027
2.65ArgGln: 2.65 ± 0.023
4.471ArgArg: 4.471 ± 0.033
4.328ArgSer: 4.328 ± 0.038
2.83ArgThr: 2.83 ± 0.02
3.31ArgVal: 3.31 ± 0.024
0.729ArgTrp: 0.729 ± 0.01
1.454ArgTyr: 1.454 ± 0.013
0.007ArgXaa: 0.007 ± 0.001
Ser
5.452SerAla: 5.452 ± 0.031
1.846SerCys: 1.846 ± 0.017
3.824SerAsp: 3.824 ± 0.027
5.166SerGlu: 5.166 ± 0.031
3.046SerPhe: 3.046 ± 0.019
5.679SerGly: 5.679 ± 0.032
2.18SerHis: 2.18 ± 0.019
3.124SerIle: 3.124 ± 0.021
3.944SerLys: 3.944 ± 0.027
8.311SerLeu: 8.311 ± 0.035
1.567SerMet: 1.567 ± 0.015
2.561SerAsn: 2.561 ± 0.023
6.064SerPro: 6.064 ± 0.048
3.83SerGln: 3.83 ± 0.025
4.624SerArg: 4.624 ± 0.031
9.051SerSer: 9.051 ± 0.057
4.35SerThr: 4.35 ± 0.036
4.96SerVal: 4.96 ± 0.024
1.09SerTrp: 1.09 ± 0.013
2.058SerTyr: 2.058 ± 0.016
0.005SerXaa: 0.005 ± 0.001
Thr
3.804ThrAla: 3.804 ± 0.026
1.313ThrCys: 1.313 ± 0.018
2.412ThrAsp: 2.412 ± 0.017
3.492ThrGlu: 3.492 ± 0.029
2.097ThrPhe: 2.097 ± 0.016
3.557ThrGly: 3.557 ± 0.032
1.314ThrHis: 1.314 ± 0.015
2.298ThrIle: 2.298 ± 0.024
2.559ThrLys: 2.559 ± 0.024
5.349ThrLeu: 5.349 ± 0.029
1.101ThrMet: 1.101 ± 0.014
1.65ThrAsn: 1.65 ± 0.013
3.605ThrPro: 3.605 ± 0.057
2.278ThrGln: 2.278 ± 0.017
2.519ThrArg: 2.519 ± 0.017
4.531ThrSer: 4.531 ± 0.035
2.87ThrThr: 2.87 ± 0.051
3.88ThrVal: 3.88 ± 0.031
0.684ThrTrp: 0.684 ± 0.01
1.396ThrTyr: 1.396 ± 0.015
0.004ThrXaa: 0.004 ± 0.001
Val
4.455ValAla: 4.455 ± 0.025
1.466ValCys: 1.466 ± 0.015
2.946ValAsp: 2.946 ± 0.022
3.785ValGlu: 3.785 ± 0.028
2.412ValPhe: 2.412 ± 0.02
3.498ValGly: 3.498 ± 0.023
1.585ValHis: 1.585 ± 0.014
2.866ValIle: 2.866 ± 0.021
3.269ValLys: 3.269 ± 0.025
6.388ValLeu: 6.388 ± 0.036
1.329ValMet: 1.329 ± 0.012
2.21ValAsn: 2.21 ± 0.02
3.821ValPro: 3.821 ± 0.035
2.73ValGln: 2.73 ± 0.019
3.162ValArg: 3.162 ± 0.023
4.942ValSer: 4.942 ± 0.03
3.829ValThr: 3.829 ± 0.032
4.039ValVal: 4.039 ± 0.03
0.73ValTrp: 0.73 ± 0.01
1.6ValTyr: 1.6 ± 0.013
0.003ValXaa: 0.003 ± 0.0
Trp
0.872TrpAla: 0.872 ± 0.012
0.241TrpCys: 0.241 ± 0.005
0.666TrpAsp: 0.666 ± 0.009
0.819TrpGlu: 0.819 ± 0.01
0.448TrpPhe: 0.448 ± 0.009
0.791TrpGly: 0.791 ± 0.01
0.312TrpHis: 0.312 ± 0.006
0.522TrpIle: 0.522 ± 0.007
0.789TrpLys: 0.789 ± 0.009
1.221TrpLeu: 1.221 ± 0.012
0.312TrpMet: 0.312 ± 0.005
0.518TrpAsn: 0.518 ± 0.008
0.563TrpPro: 0.563 ± 0.008
0.542TrpGln: 0.542 ± 0.007
0.765TrpArg: 0.765 ± 0.009
0.86TrpSer: 0.86 ± 0.012
0.668TrpThr: 0.668 ± 0.01
0.73TrpVal: 0.73 ± 0.01
0.191TrpTrp: 0.191 ± 0.005
0.324TrpTyr: 0.324 ± 0.006
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.392TyrAla: 1.392 ± 0.014
0.666TyrCys: 0.666 ± 0.01
1.258TyrAsp: 1.258 ± 0.014
1.653TyrGlu: 1.653 ± 0.017
1.221TyrPhe: 1.221 ± 0.013
1.618TyrGly: 1.618 ± 0.015
0.741TyrHis: 0.741 ± 0.009
1.306TyrIle: 1.306 ± 0.015
1.429TyrLys: 1.429 ± 0.02
2.661TyrLeu: 2.661 ± 0.021
0.58TyrMet: 0.58 ± 0.009
1.054TyrAsn: 1.054 ± 0.012
1.286TyrPro: 1.286 ± 0.014
1.23TyrGln: 1.23 ± 0.011
1.544TyrArg: 1.544 ± 0.015
2.151TyrSer: 2.151 ± 0.018
1.442TyrThr: 1.442 ± 0.016
1.595TyrVal: 1.595 ± 0.013
0.349TyrTrp: 0.349 ± 0.007
0.908TyrTyr: 0.908 ± 0.011
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.009XaaAla: 0.009 ± 0.001
0.002XaaCys: 0.002 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.004XaaGlu: 0.004 ± 0.001
0.002XaaPhe: 0.002 ± 0.0
0.007XaaGly: 0.007 ± 0.001
0.002XaaHis: 0.002 ± 0.0
0.002XaaIle: 0.002 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.007XaaLeu: 0.007 ± 0.001
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.006XaaPro: 0.006 ± 0.001
0.003XaaGln: 0.003 ± 0.001
0.007XaaArg: 0.007 ± 0.001
0.005XaaSer: 0.005 ± 0.001
0.002XaaThr: 0.002 ± 0.0
0.003XaaVal: 0.003 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
2.437XaaXaa: 2.437 ± 0.199
Statistics based on 19521 proteins (9971383 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski