Amino acid dipepetide frequency for Abisko virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.645AlaAla: 4.645 ± 0.676
0.995AlaCys: 0.995 ± 0.47
3.981AlaAsp: 3.981 ± 1.655
4.313AlaGlu: 4.313 ± 2.038
2.986AlaPhe: 2.986 ± 0.737
4.313AlaGly: 4.313 ± 1.518
0.995AlaHis: 0.995 ± 1.126
2.654AlaIle: 2.654 ± 0.736
1.991AlaLys: 1.991 ± 2.961
6.636AlaLeu: 6.636 ± 3.693
1.659AlaMet: 1.659 ± 0.784
1.659AlaAsn: 1.659 ± 1.874
2.322AlaPro: 2.322 ± 2.136
0.664AlaGln: 0.664 ± 0.313
3.65AlaArg: 3.65 ± 1.734
2.322AlaSer: 2.322 ± 1.303
2.322AlaThr: 2.322 ± 1.097
3.65AlaVal: 3.65 ± 4.373
0.664AlaTrp: 0.664 ± 1.595
2.322AlaTyr: 2.322 ± 1.097
0.0AlaXaa: 0.0 ± 0.0
Cys
0.332CysAla: 0.332 ± 0.157
0.664CysCys: 0.664 ± 0.313
0.995CysAsp: 0.995 ± 0.47
1.991CysGlu: 1.991 ± 0.94
1.327CysPhe: 1.327 ± 0.627
0.664CysGly: 0.664 ± 0.313
0.332CysHis: 0.332 ± 0.157
1.327CysIle: 1.327 ± 0.627
0.995CysLys: 0.995 ± 1.509
2.654CysLeu: 2.654 ± 0.736
0.664CysMet: 0.664 ± 0.313
0.664CysAsn: 0.664 ± 0.313
0.332CysPro: 0.332 ± 0.157
0.332CysGln: 0.332 ± 0.157
0.995CysArg: 0.995 ± 0.47
1.991CysSer: 1.991 ± 0.94
0.664CysThr: 0.664 ± 0.313
0.995CysVal: 0.995 ± 0.47
0.0CysTrp: 0.0 ± 0.0
0.995CysTyr: 0.995 ± 0.47
0.0CysXaa: 0.0 ± 0.0
Asp
2.986AspAla: 2.986 ± 0.737
0.664AspCys: 0.664 ± 0.313
3.318AspAsp: 3.318 ± 0.771
3.65AspGlu: 3.65 ± 1.734
3.318AspPhe: 3.318 ± 1.15
2.654AspGly: 2.654 ± 1.254
1.991AspHis: 1.991 ± 0.94
4.977AspIle: 4.977 ± 1.496
3.981AspLys: 3.981 ± 0.89
7.963AspLeu: 7.963 ± 4.296
0.995AspMet: 0.995 ± 0.436
0.995AspAsn: 0.995 ± 0.47
3.318AspPro: 3.318 ± 1.334
1.991AspGln: 1.991 ± 0.828
2.654AspArg: 2.654 ± 2.025
4.977AspSer: 4.977 ± 1.26
1.659AspThr: 1.659 ± 0.911
7.963AspVal: 7.963 ± 3.762
0.332AspTrp: 0.332 ± 0.157
2.654AspTyr: 2.654 ± 1.254
0.0AspXaa: 0.0 ± 0.0
Glu
3.318GluAla: 3.318 ± 1.823
1.327GluCys: 1.327 ± 0.627
4.645GluAsp: 4.645 ± 2.606
5.972GluGlu: 5.972 ± 2.821
2.322GluPhe: 2.322 ± 1.097
1.991GluGly: 1.991 ± 0.94
1.991GluHis: 1.991 ± 2.253
4.645GluIle: 4.645 ± 2.194
3.318GluLys: 3.318 ± 1.334
6.636GluLeu: 6.636 ± 3.135
0.664GluMet: 0.664 ± 0.313
3.318GluAsn: 3.318 ± 0.771
1.991GluPro: 1.991 ± 0.828
2.322GluGln: 2.322 ± 0.767
3.65GluArg: 3.65 ± 2.179
5.309GluSer: 5.309 ± 1.472
3.981GluThr: 3.981 ± 0.919
5.64GluVal: 5.64 ± 1.921
0.332GluTrp: 0.332 ± 0.157
1.327GluTyr: 1.327 ± 0.627
0.0GluXaa: 0.0 ± 0.0
Phe
2.654PheAla: 2.654 ± 1.432
1.659PheCys: 1.659 ± 1.374
2.986PheAsp: 2.986 ± 0.737
3.65PheGlu: 3.65 ± 0.834
0.995PhePhe: 0.995 ± 0.47
2.654PheGly: 2.654 ± 0.736
1.659PheHis: 1.659 ± 0.784
1.991PheIle: 1.991 ± 0.94
0.995PheLys: 0.995 ± 0.47
2.654PheLeu: 2.654 ± 1.254
0.995PheMet: 0.995 ± 0.47
1.991PheAsn: 1.991 ± 1.33
1.991PhePro: 1.991 ± 1.725
1.327PheGln: 1.327 ± 1.434
2.654PheArg: 2.654 ± 1.294
4.977PheSer: 4.977 ± 1.26
3.65PheThr: 3.65 ± 1.38
2.654PheVal: 2.654 ± 0.736
0.0PheTrp: 0.0 ± 0.0
1.659PheTyr: 1.659 ± 0.911
0.0PheXaa: 0.0 ± 0.0
Gly
0.995GlyAla: 0.995 ± 0.47
0.995GlyCys: 0.995 ± 0.47
3.318GlyAsp: 3.318 ± 1.334
2.986GlyGlu: 2.986 ± 0.737
3.65GlyPhe: 3.65 ± 1.724
3.318GlyGly: 3.318 ± 1.334
0.0GlyHis: 0.0 ± 0.0
3.318GlyIle: 3.318 ± 1.823
2.322GlyLys: 2.322 ± 1.097
1.659GlyLeu: 1.659 ± 3.556
1.991GlyMet: 1.991 ± 0.94
2.654GlyAsn: 2.654 ± 1.254
1.327GlyPro: 1.327 ± 0.627
1.327GlyGln: 1.327 ± 0.627
1.659GlyArg: 1.659 ± 0.911
4.977GlySer: 4.977 ± 2.365
0.995GlyThr: 0.995 ± 0.47
3.981GlyVal: 3.981 ± 0.89
0.332GlyTrp: 0.332 ± 0.157
2.654GlyTyr: 2.654 ± 2.869
0.0GlyXaa: 0.0 ± 0.0
His
2.322HisAla: 2.322 ± 1.097
0.995HisCys: 0.995 ± 0.47
1.659HisAsp: 1.659 ± 0.911
0.332HisGlu: 0.332 ± 0.157
1.327HisPhe: 1.327 ± 0.627
0.664HisGly: 0.664 ± 0.313
0.0HisHis: 0.0 ± 0.0
1.991HisIle: 1.991 ± 0.828
0.995HisLys: 0.995 ± 0.47
0.995HisLeu: 0.995 ± 0.47
0.0HisMet: 0.0 ± 0.0
2.654HisAsn: 2.654 ± 0.736
0.0HisPro: 0.0 ± 0.0
0.332HisGln: 0.332 ± 0.157
1.327HisArg: 1.327 ± 1.012
0.664HisSer: 0.664 ± 0.313
0.0HisThr: 0.0 ± 0.0
1.327HisVal: 1.327 ± 0.627
0.0HisTrp: 0.0 ± 0.0
0.664HisTyr: 0.664 ± 0.313
0.0HisXaa: 0.0 ± 0.0
Ile
3.318IleAla: 3.318 ± 1.823
0.995IleCys: 0.995 ± 0.47
3.318IleAsp: 3.318 ± 1.567
2.986IleGlu: 2.986 ± 1.289
2.322IlePhe: 2.322 ± 1.097
1.991IleGly: 1.991 ± 0.828
0.664IleHis: 0.664 ± 0.313
3.318IleIle: 3.318 ± 2.336
4.313IleLys: 4.313 ± 1.021
7.631IleLeu: 7.631 ± 2.229
0.995IleMet: 0.995 ± 3.284
4.313IleAsn: 4.313 ± 2.038
3.65IlePro: 3.65 ± 0.834
1.659IleGln: 1.659 ± 0.911
4.977IleArg: 4.977 ± 2.351
3.981IleSer: 3.981 ± 2.023
2.986IleThr: 2.986 ± 1.411
4.645IleVal: 4.645 ± 2.194
0.332IleTrp: 0.332 ± 0.157
1.327IleTyr: 1.327 ± 0.627
0.0IleXaa: 0.0 ± 0.0
Lys
2.986LysAla: 2.986 ± 0.737
1.659LysCys: 1.659 ± 0.784
3.318LysAsp: 3.318 ± 1.334
3.65LysGlu: 3.65 ± 1.016
4.645LysPhe: 4.645 ± 1.605
1.327LysGly: 1.327 ± 1.434
0.664LysHis: 0.664 ± 0.313
3.65LysIle: 3.65 ± 1.724
6.636LysLys: 6.636 ± 3.135
5.972LysLeu: 5.972 ± 0.615
1.991LysMet: 1.991 ± 1.33
1.327LysAsn: 1.327 ± 0.627
3.981LysPro: 3.981 ± 1.442
2.986LysGln: 2.986 ± 1.411
1.991LysArg: 1.991 ± 0.828
5.309LysSer: 5.309 ± 0.565
3.318LysThr: 3.318 ± 0.771
2.654LysVal: 2.654 ± 1.294
0.332LysTrp: 0.332 ± 0.157
4.313LysTyr: 4.313 ± 1.867
0.0LysXaa: 0.0 ± 0.0
Leu
4.977LeuAla: 4.977 ± 2.365
0.995LeuCys: 0.995 ± 0.47
6.636LeuAsp: 6.636 ± 3.785
5.64LeuGlu: 5.64 ± 1.921
3.65LeuPhe: 3.65 ± 2.7
4.645LeuGly: 4.645 ± 1.605
2.322LeuHis: 2.322 ± 1.097
6.304LeuIle: 6.304 ± 3.777
5.972LeuLys: 5.972 ± 2.821
9.954LeuLeu: 9.954 ± 2.991
4.313LeuMet: 4.313 ± 1.92
3.318LeuAsn: 3.318 ± 1.567
3.65LeuPro: 3.65 ± 2.797
2.986LeuGln: 2.986 ± 0.737
5.309LeuArg: 5.309 ± 1.391
8.958LeuSer: 8.958 ± 0.376
5.64LeuThr: 5.64 ± 0.57
7.299LeuVal: 7.299 ± 3.505
0.664LeuTrp: 0.664 ± 0.313
4.977LeuTyr: 4.977 ± 0.603
0.0LeuXaa: 0.0 ± 0.0
Met
1.659MetAla: 1.659 ± 0.784
0.664MetCys: 0.664 ± 0.313
0.995MetAsp: 0.995 ± 0.47
0.995MetGlu: 0.995 ± 0.47
0.995MetPhe: 0.995 ± 0.47
1.327MetGly: 1.327 ± 2.024
0.0MetHis: 0.0 ± 0.0
1.327MetIle: 1.327 ± 0.627
1.991MetLys: 1.991 ± 1.33
3.318MetLeu: 3.318 ± 1.334
0.664MetMet: 0.664 ± 0.313
0.995MetAsn: 0.995 ± 0.47
0.0MetPro: 0.0 ± 0.0
1.327MetGln: 1.327 ± 0.627
2.322MetArg: 2.322 ± 2.136
2.322MetSer: 2.322 ± 1.303
1.991MetThr: 1.991 ± 1.725
0.995MetVal: 0.995 ± 0.47
0.0MetTrp: 0.0 ± 0.0
0.664MetTyr: 0.664 ± 0.313
0.0MetXaa: 0.0 ± 0.0
Asn
1.991AsnAla: 1.991 ± 0.94
0.664AsnCys: 0.664 ± 0.313
1.659AsnAsp: 1.659 ± 0.784
1.991AsnGlu: 1.991 ± 0.94
3.65AsnPhe: 3.65 ± 0.834
1.327AsnGly: 1.327 ± 1.434
0.664AsnHis: 0.664 ± 0.313
3.318AsnIle: 3.318 ± 1.567
1.327AsnLys: 1.327 ± 0.627
3.318AsnLeu: 3.318 ± 1.567
1.659AsnMet: 1.659 ± 0.911
1.659AsnAsn: 1.659 ± 0.784
2.322AsnPro: 2.322 ± 2.136
0.995AsnGln: 0.995 ± 0.47
3.65AsnArg: 3.65 ± 1.734
4.313AsnSer: 4.313 ± 1.021
1.327AsnThr: 1.327 ± 1.434
4.313AsnVal: 4.313 ± 1.021
0.995AsnTrp: 0.995 ± 1.126
2.322AsnTyr: 2.322 ± 1.577
0.0AsnXaa: 0.0 ± 0.0
Pro
1.327ProAla: 1.327 ± 2.024
0.0ProCys: 0.0 ± 0.0
3.65ProAsp: 3.65 ± 0.834
4.313ProGlu: 4.313 ± 1.518
1.659ProPhe: 1.659 ± 0.911
1.991ProGly: 1.991 ± 0.94
0.664ProHis: 0.664 ± 0.313
3.318ProIle: 3.318 ± 0.771
1.659ProLys: 1.659 ± 1.374
4.977ProLeu: 4.977 ± 3.01
0.995ProMet: 0.995 ± 0.47
2.322ProAsn: 2.322 ± 1.097
1.991ProPro: 1.991 ± 0.94
0.995ProGln: 0.995 ± 1.509
1.659ProArg: 1.659 ± 0.911
3.981ProSer: 3.981 ± 1.655
1.659ProThr: 1.659 ± 0.784
4.977ProVal: 4.977 ± 1.496
0.0ProTrp: 0.0 ± 0.0
1.991ProTyr: 1.991 ± 0.94
0.0ProXaa: 0.0 ± 0.0
Gln
1.659GlnAla: 1.659 ± 1.374
0.995GlnCys: 0.995 ± 0.47
2.322GlnAsp: 2.322 ± 1.097
0.995GlnGlu: 0.995 ± 0.47
0.664GlnPhe: 0.664 ± 0.313
1.991GlnGly: 1.991 ± 0.94
0.664GlnHis: 0.664 ± 0.313
1.659GlnIle: 1.659 ± 0.784
0.664GlnLys: 0.664 ± 1.595
3.318GlnLeu: 3.318 ± 1.567
0.0GlnMet: 0.0 ± 0.0
1.327GlnAsn: 1.327 ± 1.012
1.659GlnPro: 1.659 ± 2.374
0.664GlnGln: 0.664 ± 0.313
2.654GlnArg: 2.654 ± 1.432
3.318GlnSer: 3.318 ± 1.823
1.991GlnThr: 1.991 ± 0.94
1.659GlnVal: 1.659 ± 1.374
0.664GlnTrp: 0.664 ± 0.313
1.327GlnTyr: 1.327 ± 0.627
0.0GlnXaa: 0.0 ± 0.0
Arg
1.991ArgAla: 1.991 ± 1.33
0.995ArgCys: 0.995 ± 1.126
2.986ArgAsp: 2.986 ± 3.379
6.304ArgGlu: 6.304 ± 3.74
1.991ArgPhe: 1.991 ± 0.828
2.654ArgGly: 2.654 ± 2.025
1.659ArgHis: 1.659 ± 0.784
3.318ArgIle: 3.318 ± 1.823
3.65ArgLys: 3.65 ± 1.724
6.304ArgLeu: 6.304 ± 2.037
0.332ArgMet: 0.332 ± 0.157
2.322ArgAsn: 2.322 ± 1.097
2.986ArgPro: 2.986 ± 0.737
1.659ArgGln: 1.659 ± 0.784
4.645ArgArg: 4.645 ± 2.83
5.972ArgSer: 5.972 ± 5.283
2.322ArgThr: 2.322 ± 1.097
3.65ArgVal: 3.65 ± 1.016
0.332ArgTrp: 0.332 ± 1.38
3.318ArgTyr: 3.318 ± 0.771
0.0ArgXaa: 0.0 ± 0.0
Ser
5.309SerAla: 5.309 ± 2.271
1.991SerCys: 1.991 ± 0.94
5.64SerAsp: 5.64 ± 1.526
5.64SerGlu: 5.64 ± 1.526
2.654SerPhe: 2.654 ± 2.648
4.313SerGly: 4.313 ± 2.628
0.995SerHis: 0.995 ± 0.47
2.986SerIle: 2.986 ± 0.737
6.304SerLys: 6.304 ± 0.934
9.954SerLeu: 9.954 ± 2.947
2.654SerMet: 2.654 ± 0.736
4.645SerAsn: 4.645 ± 4.272
2.654SerPro: 2.654 ± 1.254
3.318SerGln: 3.318 ± 1.15
5.64SerArg: 5.64 ± 2.56
7.631SerSer: 7.631 ± 4.749
5.309SerThr: 5.309 ± 4.359
6.304SerVal: 6.304 ± 2.414
0.995SerTrp: 0.995 ± 0.47
3.318SerTyr: 3.318 ± 1.334
0.0SerXaa: 0.0 ± 0.0
Thr
3.318ThrAla: 3.318 ± 2.916
0.0ThrCys: 0.0 ± 0.0
1.991ThrAsp: 1.991 ± 0.828
2.986ThrGlu: 2.986 ± 1.411
2.322ThrPhe: 2.322 ± 2.94
2.654ThrGly: 2.654 ± 1.294
0.995ThrHis: 0.995 ± 0.47
4.645ThrIle: 4.645 ± 0.676
4.313ThrLys: 4.313 ± 1.518
4.645ThrLeu: 4.645 ± 1.136
1.659ThrMet: 1.659 ± 0.784
1.659ThrAsn: 1.659 ± 0.911
1.327ThrPro: 1.327 ± 0.627
1.991ThrGln: 1.991 ± 0.828
2.986ThrArg: 2.986 ± 1.92
5.309ThrSer: 5.309 ± 4.571
2.986ThrThr: 2.986 ± 0.737
3.981ThrVal: 3.981 ± 1.655
0.0ThrTrp: 0.0 ± 0.0
2.322ThrTyr: 2.322 ± 1.097
0.0ThrXaa: 0.0 ± 0.0
Val
6.304ValAla: 6.304 ± 0.934
0.995ValCys: 0.995 ± 0.47
2.986ValAsp: 2.986 ± 0.737
3.65ValGlu: 3.65 ± 1.016
1.659ValPhe: 1.659 ± 0.911
2.322ValGly: 2.322 ± 1.097
1.327ValHis: 1.327 ± 1.012
3.981ValIle: 3.981 ± 1.881
7.963ValLys: 7.963 ± 0.218
5.309ValLeu: 5.309 ± 1.808
1.327ValMet: 1.327 ± 0.783
3.318ValAsn: 3.318 ± 0.771
6.967ValPro: 6.967 ± 2.098
2.654ValGln: 2.654 ± 1.294
3.318ValArg: 3.318 ± 0.771
7.299ValSer: 7.299 ± 0.48
4.977ValThr: 4.977 ± 4.705
7.963ValVal: 7.963 ± 2.544
0.0ValTrp: 0.0 ± 0.0
3.318ValTyr: 3.318 ± 1.567
0.0ValXaa: 0.0 ± 0.0
Trp
0.332TrpAla: 0.332 ± 0.157
0.0TrpCys: 0.0 ± 0.0
0.332TrpAsp: 0.332 ± 0.157
0.995TrpGlu: 0.995 ± 0.47
0.664TrpPhe: 0.664 ± 0.313
0.332TrpGly: 0.332 ± 0.157
0.0TrpHis: 0.0 ± 0.0
0.332TrpIle: 0.332 ± 0.157
0.664TrpLys: 0.664 ± 0.313
0.332TrpLeu: 0.332 ± 0.157
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.995TrpArg: 0.995 ± 1.509
0.332TrpSer: 0.332 ± 0.157
0.664TrpThr: 0.664 ± 1.25
0.995TrpVal: 0.995 ± 1.126
0.0TrpTrp: 0.0 ± 0.0
0.332TrpTyr: 0.332 ± 0.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.322TyrAla: 2.322 ± 1.097
1.659TyrCys: 1.659 ± 0.784
6.304TyrAsp: 6.304 ± 2.163
1.659TyrGlu: 1.659 ± 0.784
0.995TyrPhe: 0.995 ± 0.47
1.327TyrGly: 1.327 ± 2.499
0.664TyrHis: 0.664 ± 0.313
0.664TyrIle: 0.664 ± 0.313
2.654TyrLys: 2.654 ± 1.254
4.313TyrLeu: 4.313 ± 0.775
0.664TyrMet: 0.664 ± 0.313
2.322TyrAsn: 2.322 ± 1.303
1.659TyrPro: 1.659 ± 0.784
0.664TyrGln: 0.664 ± 0.313
2.654TyrArg: 2.654 ± 0.736
4.313TyrSer: 4.313 ± 1.867
3.65TyrThr: 3.65 ± 2.7
1.991TyrVal: 1.991 ± 0.94
1.327TyrTrp: 1.327 ± 0.627
2.654TyrTyr: 2.654 ± 1.254
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3015 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski