Amino acid dipepetide frequency for Sanxia picorna-like virus 12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.43AlaAla: 10.43 ± 2.25
2.503AlaCys: 2.503 ± 0.562
4.589AlaAsp: 4.589 ± 0.826
5.006AlaGlu: 5.006 ± 0.538
5.006AlaPhe: 5.006 ± 1.221
8.761AlaGly: 8.761 ± 3.016
2.086AlaHis: 2.086 ± 0.957
4.172AlaIle: 4.172 ± 0.155
4.172AlaLys: 4.172 ± 1.328
6.675AlaLeu: 6.675 ± 0.131
2.086AlaMet: 2.086 ± 0.216
5.006AlaAsn: 5.006 ± 0.048
4.589AlaPro: 4.589 ± 0.826
0.834AlaGln: 0.834 ± 0.383
5.423AlaArg: 5.423 ± 1.616
7.509AlaSer: 7.509 ± 0.514
4.589AlaThr: 4.589 ± 0.346
6.258AlaVal: 6.258 ± 1.112
1.669AlaTrp: 1.669 ± 0.179
2.086AlaTyr: 2.086 ± 0.371
0.0AlaXaa: 0.0 ± 0.0
Cys
1.252CysAla: 1.252 ± 0.012
0.0CysCys: 0.0 ± 0.0
0.417CysAsp: 0.417 ± 0.191
0.834CysGlu: 0.834 ± 0.203
0.834CysPhe: 0.834 ± 0.383
0.417CysGly: 0.417 ± 0.191
0.417CysHis: 0.417 ± 0.191
0.834CysIle: 0.834 ± 0.383
0.417CysLys: 0.417 ± 0.191
0.417CysLeu: 0.417 ± 0.191
0.834CysMet: 0.834 ± 0.203
0.0CysAsn: 0.0 ± 0.0
1.252CysPro: 1.252 ± 1.185
0.834CysGln: 0.834 ± 0.79
0.0CysArg: 0.0 ± 0.0
0.417CysSer: 0.417 ± 0.191
1.252CysThr: 1.252 ± 0.574
1.669CysVal: 1.669 ± 0.179
0.0CysTrp: 0.0 ± 0.0
2.086CysTyr: 2.086 ± 0.216
0.0CysXaa: 0.0 ± 0.0
Asp
5.841AspAla: 5.841 ± 0.252
0.834AspCys: 0.834 ± 0.383
1.252AspAsp: 1.252 ± 0.012
4.589AspGlu: 4.589 ± 0.933
4.172AspPhe: 4.172 ± 0.155
3.338AspGly: 3.338 ± 0.359
0.417AspHis: 0.417 ± 0.191
2.92AspIle: 2.92 ± 0.167
2.92AspLys: 2.92 ± 0.753
4.172AspLeu: 4.172 ± 0.741
1.252AspMet: 1.252 ± 0.574
2.92AspAsn: 2.92 ± 0.419
2.086AspPro: 2.086 ± 0.216
0.417AspGln: 0.417 ± 0.395
1.669AspArg: 1.669 ± 0.179
2.503AspSer: 2.503 ± 1.197
2.086AspThr: 2.086 ± 0.216
2.086AspVal: 2.086 ± 0.216
0.0AspTrp: 0.0 ± 0.0
2.92AspTyr: 2.92 ± 1.005
0.0AspXaa: 0.0 ± 0.0
Glu
5.006GluAla: 5.006 ± 0.538
0.417GluCys: 0.417 ± 0.191
3.338GluAsp: 3.338 ± 0.359
3.755GluGlu: 3.755 ± 1.722
1.669GluPhe: 1.669 ± 0.179
3.755GluGly: 3.755 ± 1.722
0.834GluHis: 0.834 ± 0.203
2.086GluIle: 2.086 ± 0.957
2.92GluLys: 2.92 ± 0.167
3.755GluLeu: 3.755 ± 0.55
0.834GluMet: 0.834 ± 0.203
1.669GluAsn: 1.669 ± 0.179
1.669GluPro: 1.669 ± 0.179
4.172GluGln: 4.172 ± 1.914
5.423GluArg: 5.423 ± 1.902
4.172GluSer: 4.172 ± 1.328
2.92GluThr: 2.92 ± 0.167
5.841GluVal: 5.841 ± 0.334
0.417GluTrp: 0.417 ± 0.191
2.503GluTyr: 2.503 ± 0.562
0.0GluXaa: 0.0 ± 0.0
Phe
5.006PheAla: 5.006 ± 0.048
0.834PheCys: 0.834 ± 0.383
2.503PheAsp: 2.503 ± 0.61
2.503PheGlu: 2.503 ± 0.562
1.669PhePhe: 1.669 ± 0.179
3.755PheGly: 3.755 ± 0.036
1.252PheHis: 1.252 ± 0.598
2.086PheIle: 2.086 ± 0.957
2.92PheLys: 2.92 ± 0.753
2.92PheLeu: 2.92 ± 0.753
1.669PheMet: 1.669 ± 0.472
1.669PheAsn: 1.669 ± 0.407
1.669PhePro: 1.669 ± 0.179
2.086PheGln: 2.086 ± 0.802
3.755PheArg: 3.755 ± 0.036
4.172PheSer: 4.172 ± 1.017
3.338PheThr: 3.338 ± 1.4
5.006PheVal: 5.006 ± 1.221
0.834PheTrp: 0.834 ± 0.203
1.252PheTyr: 1.252 ± 0.012
0.0PheXaa: 0.0 ± 0.0
Gly
7.927GlyAla: 7.927 ± 0.119
0.834GlyCys: 0.834 ± 0.203
4.589GlyAsp: 4.589 ± 0.346
6.675GlyGlu: 6.675 ± 1.89
2.086GlyPhe: 2.086 ± 0.957
8.344GlyGly: 8.344 ± 2.035
1.252GlyHis: 1.252 ± 0.574
3.755GlyIle: 3.755 ± 1.136
2.503GlyLys: 2.503 ± 0.562
7.092GlyLeu: 7.092 ± 0.322
1.252GlyMet: 1.252 ± 0.574
3.338GlyAsn: 3.338 ± 1.4
2.503GlyPro: 2.503 ± 0.61
0.834GlyGln: 0.834 ± 0.203
2.503GlyArg: 2.503 ± 0.61
5.006GlySer: 5.006 ± 1.807
5.423GlyThr: 5.423 ± 2.202
6.258GlyVal: 6.258 ± 0.647
1.252GlyTrp: 1.252 ± 0.012
2.503GlyTyr: 2.503 ± 0.562
0.0GlyXaa: 0.0 ± 0.0
His
1.669HisAla: 1.669 ± 0.766
0.417HisCys: 0.417 ± 0.191
0.417HisAsp: 0.417 ± 0.191
0.834HisGlu: 0.834 ± 0.203
1.252HisPhe: 1.252 ± 0.574
1.252HisGly: 1.252 ± 0.598
0.834HisHis: 0.834 ± 0.203
0.834HisIle: 0.834 ± 0.383
0.0HisLys: 0.0 ± 0.0
2.086HisLeu: 2.086 ± 0.371
0.417HisMet: 0.417 ± 0.191
1.669HisAsn: 1.669 ± 0.179
0.417HisPro: 0.417 ± 0.191
0.417HisGln: 0.417 ± 0.191
0.0HisArg: 0.0 ± 0.0
0.417HisSer: 0.417 ± 0.191
0.0HisThr: 0.0 ± 0.0
2.086HisVal: 2.086 ± 0.802
0.417HisTrp: 0.417 ± 0.191
0.834HisTyr: 0.834 ± 0.383
0.0HisXaa: 0.0 ± 0.0
Ile
5.006IleAla: 5.006 ± 0.048
0.417IleCys: 0.417 ± 0.191
0.834IleAsp: 0.834 ± 0.383
3.338IleGlu: 3.338 ± 0.945
1.669IlePhe: 1.669 ± 0.179
5.006IleGly: 5.006 ± 0.048
0.417IleHis: 0.417 ± 0.395
0.417IleIle: 0.417 ± 0.191
1.252IleLys: 1.252 ± 0.574
4.172IleLeu: 4.172 ± 0.741
0.834IleMet: 0.834 ± 0.383
2.086IleAsn: 2.086 ± 0.957
3.338IlePro: 3.338 ± 0.814
1.669IleGln: 1.669 ± 0.766
1.669IleArg: 1.669 ± 0.766
3.338IleSer: 3.338 ± 0.228
3.755IleThr: 3.755 ± 0.036
4.589IleVal: 4.589 ± 0.346
0.417IleTrp: 0.417 ± 0.191
1.669IleTyr: 1.669 ± 0.407
0.0IleXaa: 0.0 ± 0.0
Lys
2.086LysAla: 2.086 ± 0.957
0.417LysCys: 0.417 ± 0.191
1.252LysAsp: 1.252 ± 0.012
4.172LysGlu: 4.172 ± 0.741
3.338LysPhe: 3.338 ± 0.359
2.086LysGly: 2.086 ± 0.957
0.417LysHis: 0.417 ± 0.191
0.834LysIle: 0.834 ± 0.383
1.669LysLys: 1.669 ± 0.179
4.172LysLeu: 4.172 ± 0.741
1.252LysMet: 1.252 ± 0.574
0.417LysAsn: 0.417 ± 0.191
2.92LysPro: 2.92 ± 0.753
1.252LysGln: 1.252 ± 0.012
4.589LysArg: 4.589 ± 1.519
4.589LysSer: 4.589 ± 1.519
2.92LysThr: 2.92 ± 1.005
3.338LysVal: 3.338 ± 0.945
0.834LysTrp: 0.834 ± 0.383
0.834LysTyr: 0.834 ± 0.203
0.0LysXaa: 0.0 ± 0.0
Leu
9.178LeuAla: 9.178 ± 1.865
2.086LeuCys: 2.086 ± 0.216
4.589LeuAsp: 4.589 ± 1.519
3.755LeuGlu: 3.755 ± 0.55
3.338LeuPhe: 3.338 ± 0.228
5.423LeuGly: 5.423 ± 0.443
1.252LeuHis: 1.252 ± 0.012
3.338LeuIle: 3.338 ± 0.359
3.755LeuLys: 3.755 ± 0.55
5.423LeuLeu: 5.423 ± 1.315
0.834LeuMet: 0.834 ± 0.203
2.503LeuAsn: 2.503 ± 1.148
4.172LeuPro: 4.172 ± 2.19
1.252LeuGln: 1.252 ± 0.574
4.589LeuArg: 4.589 ± 0.24
5.006LeuSer: 5.006 ± 0.048
5.841LeuThr: 5.841 ± 0.838
6.258LeuVal: 6.258 ± 1.112
1.669LeuTrp: 1.669 ± 0.407
2.086LeuTyr: 2.086 ± 1.388
0.0LeuXaa: 0.0 ± 0.0
Met
4.172MetAla: 4.172 ± 0.431
0.417MetCys: 0.417 ± 0.191
2.503MetAsp: 2.503 ± 1.148
0.0MetGlu: 0.0 ± 0.0
1.252MetPhe: 1.252 ± 0.598
0.417MetGly: 0.417 ± 0.191
0.417MetHis: 0.417 ± 0.191
0.834MetIle: 0.834 ± 0.203
2.086MetLys: 2.086 ± 0.216
1.252MetLeu: 1.252 ± 0.574
1.252MetMet: 1.252 ± 0.012
0.417MetAsn: 0.417 ± 0.191
1.669MetPro: 1.669 ± 0.179
0.417MetGln: 0.417 ± 0.395
2.086MetArg: 2.086 ± 0.957
2.086MetSer: 2.086 ± 0.957
1.669MetThr: 1.669 ± 0.407
1.252MetVal: 1.252 ± 0.012
0.417MetTrp: 0.417 ± 0.191
2.086MetTyr: 2.086 ± 0.957
0.0MetXaa: 0.0 ± 0.0
Asn
3.755AsnAla: 3.755 ± 0.623
0.417AsnCys: 0.417 ± 0.191
0.834AsnAsp: 0.834 ± 0.383
2.086AsnGlu: 2.086 ± 0.216
2.503AsnPhe: 2.503 ± 0.024
2.92AsnGly: 2.92 ± 0.419
0.417AsnHis: 0.417 ± 0.191
2.92AsnIle: 2.92 ± 0.419
0.834AsnLys: 0.834 ± 0.383
2.92AsnLeu: 2.92 ± 0.753
0.834AsnMet: 0.834 ± 0.383
1.252AsnAsn: 1.252 ± 0.598
2.503AsnPro: 2.503 ± 0.024
1.669AsnGln: 1.669 ± 0.179
2.086AsnArg: 2.086 ± 0.957
4.589AsnSer: 4.589 ± 1.999
3.755AsnThr: 3.755 ± 0.55
3.338AsnVal: 3.338 ± 0.359
0.834AsnTrp: 0.834 ± 0.383
0.834AsnTyr: 0.834 ± 0.79
0.0AsnXaa: 0.0 ± 0.0
Pro
2.503ProAla: 2.503 ± 0.61
0.834ProCys: 0.834 ± 0.79
1.669ProAsp: 1.669 ± 0.993
1.669ProGlu: 1.669 ± 0.766
3.755ProPhe: 3.755 ± 0.623
2.92ProGly: 2.92 ± 1.005
0.417ProHis: 0.417 ± 0.395
1.669ProIle: 1.669 ± 0.766
1.669ProLys: 1.669 ± 0.179
5.006ProLeu: 5.006 ± 0.048
1.252ProMet: 1.252 ± 0.598
2.086ProAsn: 2.086 ± 0.957
0.834ProPro: 0.834 ± 0.79
2.92ProGln: 2.92 ± 0.419
2.503ProArg: 2.503 ± 0.024
3.338ProSer: 3.338 ± 0.814
5.841ProThr: 5.841 ± 3.183
3.755ProVal: 3.755 ± 0.623
0.417ProTrp: 0.417 ± 0.191
3.755ProTyr: 3.755 ± 0.623
0.0ProXaa: 0.0 ± 0.0
Gln
2.92GlnAla: 2.92 ± 0.419
0.417GlnCys: 0.417 ± 0.191
0.834GlnAsp: 0.834 ± 0.383
2.086GlnGlu: 2.086 ± 0.957
0.417GlnPhe: 0.417 ± 0.191
2.086GlnGly: 2.086 ± 0.957
0.417GlnHis: 0.417 ± 0.191
1.669GlnIle: 1.669 ± 0.407
0.834GlnLys: 0.834 ± 0.383
1.669GlnLeu: 1.669 ± 0.766
0.417GlnMet: 0.417 ± 0.191
0.0GlnAsn: 0.0 ± 0.0
2.086GlnPro: 2.086 ± 0.216
1.669GlnGln: 1.669 ± 0.407
2.086GlnArg: 2.086 ± 0.216
3.755GlnSer: 3.755 ± 0.623
0.417GlnThr: 0.417 ± 0.191
2.086GlnVal: 2.086 ± 0.216
0.417GlnTrp: 0.417 ± 0.191
2.503GlnTyr: 2.503 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
3.338ArgAla: 3.338 ± 0.228
0.417ArgCys: 0.417 ± 0.395
4.172ArgAsp: 4.172 ± 0.155
3.338ArgGlu: 3.338 ± 1.531
2.086ArgPhe: 2.086 ± 0.216
3.755ArgGly: 3.755 ± 1.136
1.669ArgHis: 1.669 ± 0.179
3.338ArgIle: 3.338 ± 0.945
2.503ArgLys: 2.503 ± 0.562
6.675ArgLeu: 6.675 ± 0.131
2.503ArgMet: 2.503 ± 1.148
3.755ArgAsn: 3.755 ± 0.036
4.172ArgPro: 4.172 ± 0.155
0.834ArgGln: 0.834 ± 0.383
1.669ArgArg: 1.669 ± 0.766
4.172ArgSer: 4.172 ± 1.017
1.252ArgThr: 1.252 ± 0.012
7.092ArgVal: 7.092 ± 0.85
0.0ArgTrp: 0.0 ± 0.0
2.503ArgTyr: 2.503 ± 0.562
0.0ArgXaa: 0.0 ± 0.0
Ser
7.092SerAla: 7.092 ± 1.437
0.834SerCys: 0.834 ± 0.383
2.92SerAsp: 2.92 ± 0.167
3.338SerGlu: 3.338 ± 0.945
4.589SerPhe: 4.589 ± 0.24
7.927SerGly: 7.927 ± 0.119
0.417SerHis: 0.417 ± 0.191
2.92SerIle: 2.92 ± 0.167
4.172SerLys: 4.172 ± 0.155
7.509SerLeu: 7.509 ± 1.245
3.755SerMet: 3.755 ± 0.55
3.755SerAsn: 3.755 ± 1.209
2.503SerPro: 2.503 ± 0.024
0.834SerGln: 0.834 ± 0.203
4.172SerArg: 4.172 ± 0.155
5.841SerSer: 5.841 ± 1.424
5.841SerThr: 5.841 ± 2.011
5.006SerVal: 5.006 ± 0.048
0.417SerTrp: 0.417 ± 0.191
1.669SerTyr: 1.669 ± 0.407
0.0SerXaa: 0.0 ± 0.0
Thr
4.589ThrAla: 4.589 ± 0.24
0.0ThrCys: 0.0 ± 0.0
4.589ThrAsp: 4.589 ± 1.999
3.338ThrGlu: 3.338 ± 0.359
5.006ThrPhe: 5.006 ± 1.807
4.172ThrGly: 4.172 ± 1.017
0.417ThrHis: 0.417 ± 0.191
3.338ThrIle: 3.338 ± 1.4
2.086ThrLys: 2.086 ± 0.371
4.589ThrLeu: 4.589 ± 1.412
2.086ThrMet: 2.086 ± 0.802
3.338ThrAsn: 3.338 ± 1.4
4.589ThrPro: 4.589 ± 0.826
1.669ThrGln: 1.669 ± 0.766
4.172ThrArg: 4.172 ± 0.431
4.172ThrSer: 4.172 ± 0.431
6.675ThrThr: 6.675 ± 3.973
5.423ThrVal: 5.423 ± 1.616
0.417ThrTrp: 0.417 ± 0.395
0.834ThrTyr: 0.834 ± 0.203
0.0ThrXaa: 0.0 ± 0.0
Val
7.092ValAla: 7.092 ± 1.437
0.834ValCys: 0.834 ± 0.203
6.258ValAsp: 6.258 ± 0.061
3.338ValGlu: 3.338 ± 0.359
2.92ValPhe: 2.92 ± 0.419
6.258ValGly: 6.258 ± 0.526
2.92ValHis: 2.92 ± 0.753
4.172ValIle: 4.172 ± 0.741
4.589ValLys: 4.589 ± 2.105
3.755ValLeu: 3.755 ± 2.381
1.669ValMet: 1.669 ± 0.179
3.338ValAsn: 3.338 ± 0.945
4.172ValPro: 4.172 ± 2.19
2.503ValGln: 2.503 ± 0.024
7.509ValArg: 7.509 ± 1.1
6.258ValSer: 6.258 ± 0.061
5.006ValThr: 5.006 ± 1.807
7.092ValVal: 7.092 ± 0.264
0.417ValTrp: 0.417 ± 0.191
2.086ValTyr: 2.086 ± 0.802
0.0ValXaa: 0.0 ± 0.0
Trp
0.834TrpAla: 0.834 ± 0.383
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.669TrpGlu: 1.669 ± 0.179
2.086TrpPhe: 2.086 ± 0.216
0.417TrpGly: 0.417 ± 0.191
0.0TrpHis: 0.0 ± 0.0
1.252TrpIle: 1.252 ± 0.574
0.417TrpLys: 0.417 ± 0.191
0.834TrpLeu: 0.834 ± 0.203
0.417TrpMet: 0.417 ± 0.191
0.834TrpAsn: 0.834 ± 0.383
0.417TrpPro: 0.417 ± 0.395
0.834TrpGln: 0.834 ± 0.383
0.417TrpArg: 0.417 ± 0.395
0.417TrpSer: 0.417 ± 0.191
0.834TrpThr: 0.834 ± 0.79
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.834TrpTyr: 0.834 ± 0.383
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.755TyrAla: 3.755 ± 0.036
1.252TyrCys: 1.252 ± 1.185
0.834TyrAsp: 0.834 ± 0.203
0.834TyrGlu: 0.834 ± 0.383
1.669TyrPhe: 1.669 ± 0.179
2.92TyrGly: 2.92 ± 0.419
0.0TyrHis: 0.0 ± 0.0
2.503TyrIle: 2.503 ± 1.197
1.669TyrLys: 1.669 ± 0.179
1.252TyrLeu: 1.252 ± 0.012
0.834TyrMet: 0.834 ± 0.383
1.252TyrAsn: 1.252 ± 0.012
1.252TyrPro: 1.252 ± 0.012
1.669TyrGln: 1.669 ± 0.179
2.92TyrArg: 2.92 ± 0.167
3.755TyrSer: 3.755 ± 0.55
2.086TyrThr: 2.086 ± 0.216
3.755TyrVal: 3.755 ± 0.623
1.669TyrTrp: 1.669 ± 0.407
1.669TyrTyr: 1.669 ± 0.179
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2398 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski