Amino acid dipepetide frequency for Xingshan nematode virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.704AlaAla: 2.704 ± 1.09
0.811AlaCys: 0.811 ± 0.417
4.597AlaAsp: 4.597 ± 1.889
2.975AlaGlu: 2.975 ± 0.134
3.245AlaPhe: 3.245 ± 0.707
1.622AlaGly: 1.622 ± 0.703
1.082AlaHis: 1.082 ± 0.66
4.327AlaIle: 4.327 ± 2.454
2.163AlaLys: 2.163 ± 0.676
6.76AlaLeu: 6.76 ± 1.291
1.352AlaMet: 1.352 ± 0.694
1.622AlaAsn: 1.622 ± 0.746
0.541AlaPro: 0.541 ± 0.339
1.082AlaGln: 1.082 ± 0.66
1.622AlaArg: 1.622 ± 0.435
1.352AlaSer: 1.352 ± 0.668
2.975AlaThr: 2.975 ± 1.528
5.138AlaVal: 5.138 ± 1.342
0.27AlaTrp: 0.27 ± 0.799
1.893AlaTyr: 1.893 ± 0.762
0.0AlaXaa: 0.0 ± 0.0
Cys
1.352CysAla: 1.352 ± 0.336
1.622CysCys: 1.622 ± 0.833
1.352CysAsp: 1.352 ± 0.694
1.622CysGlu: 1.622 ± 0.833
1.352CysPhe: 1.352 ± 0.694
2.163CysGly: 2.163 ± 1.111
0.0CysHis: 0.0 ± 0.0
0.541CysIle: 0.541 ± 0.278
0.811CysLys: 0.811 ± 0.417
2.704CysLeu: 2.704 ± 0.801
0.541CysMet: 0.541 ± 0.278
0.541CysAsn: 0.541 ± 0.879
1.622CysPro: 1.622 ± 0.435
0.27CysGln: 0.27 ± 0.139
1.893CysArg: 1.893 ± 0.531
2.163CysSer: 2.163 ± 0.676
0.811CysThr: 0.811 ± 0.417
2.975CysVal: 2.975 ± 1.072
0.0CysTrp: 0.0 ± 0.0
0.541CysTyr: 0.541 ± 0.278
0.0CysXaa: 0.0 ± 0.0
Asp
3.515AspAla: 3.515 ± 1.805
1.893AspCys: 1.893 ± 0.972
5.949AspAsp: 5.949 ± 2.015
2.975AspGlu: 2.975 ± 0.597
4.867AspPhe: 4.867 ± 0.406
4.056AspGly: 4.056 ± 1.616
2.163AspHis: 2.163 ± 0.395
2.975AspIle: 2.975 ± 1.072
4.327AspLys: 4.327 ± 1.753
8.383AspLeu: 8.383 ± 2.125
2.163AspMet: 2.163 ± 0.676
4.056AspAsn: 4.056 ± 1.07
1.622AspPro: 1.622 ± 0.746
0.811AspGln: 0.811 ± 0.275
5.408AspArg: 5.408 ± 1.365
3.786AspSer: 3.786 ± 1.103
2.163AspThr: 2.163 ± 0.873
6.76AspVal: 6.76 ± 0.154
0.541AspTrp: 0.541 ± 0.339
3.245AspTyr: 3.245 ± 1.101
0.0AspXaa: 0.0 ± 0.0
Glu
1.082GluAla: 1.082 ± 0.274
2.163GluCys: 2.163 ± 1.111
2.163GluAsp: 2.163 ± 0.676
2.163GluGlu: 2.163 ± 1.111
4.056GluPhe: 4.056 ± 0.531
3.515GluGly: 3.515 ± 0.287
0.811GluHis: 0.811 ± 0.417
4.597GluIle: 4.597 ± 0.803
4.327GluLys: 4.327 ± 1.098
4.327GluLeu: 4.327 ± 1.753
1.622GluMet: 1.622 ± 1.364
4.056GluAsn: 4.056 ± 0.531
0.811GluPro: 0.811 ± 0.417
0.811GluGln: 0.811 ± 0.417
2.975GluArg: 2.975 ± 0.799
2.704GluSer: 2.704 ± 0.672
1.893GluThr: 1.893 ± 0.531
4.327GluVal: 4.327 ± 1.098
0.541GluTrp: 0.541 ± 0.278
1.622GluTyr: 1.622 ± 0.464
0.0GluXaa: 0.0 ± 0.0
Phe
1.082PheAla: 1.082 ± 0.555
2.434PheCys: 2.434 ± 0.597
2.975PheAsp: 2.975 ± 1.072
3.245PheGlu: 3.245 ± 1.539
3.786PhePhe: 3.786 ± 3.249
4.327PheGly: 4.327 ± 1.189
0.541PheHis: 0.541 ± 0.339
3.245PheIle: 3.245 ± 0.162
4.327PheLys: 4.327 ± 1.54
5.408PheLeu: 5.408 ± 2.415
1.622PheMet: 1.622 ± 1.037
3.245PheAsn: 3.245 ± 2.079
1.082PhePro: 1.082 ± 0.274
1.082PheGln: 1.082 ± 0.555
3.786PheArg: 3.786 ± 1.419
6.49PheSer: 6.49 ± 1.935
5.679PheThr: 5.679 ± 2.32
8.924PheVal: 8.924 ± 2.185
0.811PheTrp: 0.811 ± 1.159
3.515PheTyr: 3.515 ± 2.129
0.0PheXaa: 0.0 ± 0.0
Gly
1.622GlyAla: 1.622 ± 0.464
1.622GlyCys: 1.622 ± 0.551
2.975GlyAsp: 2.975 ± 1.528
2.975GlyGlu: 2.975 ± 1.146
3.515GlyPhe: 3.515 ± 0.991
3.245GlyGly: 3.245 ± 1.262
0.811GlyHis: 0.811 ± 0.417
2.975GlyIle: 2.975 ± 1.146
2.434GlyLys: 2.434 ± 1.25
3.245GlyLeu: 3.245 ± 1.053
0.811GlyMet: 0.811 ± 0.275
2.434GlyAsn: 2.434 ± 0.597
1.082GlyPro: 1.082 ± 1.46
0.27GlyGln: 0.27 ± 0.799
3.515GlyArg: 3.515 ± 0.27
4.056GlySer: 4.056 ± 1.616
2.704GlyThr: 2.704 ± 1.363
5.138GlyVal: 5.138 ± 1.608
0.541GlyTrp: 0.541 ± 0.278
2.434GlyTyr: 2.434 ± 0.343
0.0GlyXaa: 0.0 ± 0.0
His
1.082HisAla: 1.082 ± 0.274
1.082HisCys: 1.082 ± 0.555
0.811HisAsp: 0.811 ± 1.159
1.622HisGlu: 1.622 ± 0.435
0.811HisPhe: 0.811 ± 0.682
0.27HisGly: 0.27 ± 0.439
0.541HisHis: 0.541 ± 0.339
1.352HisIle: 1.352 ± 0.336
0.27HisLys: 0.27 ± 0.799
2.704HisLeu: 2.704 ± 1.225
0.0HisMet: 0.0 ± 0.0
0.811HisAsn: 0.811 ± 0.275
0.0HisPro: 0.0 ± 0.0
0.27HisGln: 0.27 ± 0.139
1.352HisArg: 1.352 ± 0.694
1.082HisSer: 1.082 ± 0.555
0.27HisThr: 0.27 ± 0.139
1.622HisVal: 1.622 ± 0.435
0.541HisTrp: 0.541 ± 0.339
2.434HisTyr: 2.434 ± 0.343
0.0HisXaa: 0.0 ± 0.0
Ile
2.704IleAla: 2.704 ± 0.938
0.811IleCys: 0.811 ± 0.275
7.842IleAsp: 7.842 ± 2.42
3.245IleGlu: 3.245 ± 1.666
4.597IlePhe: 4.597 ± 2.042
1.893IleGly: 1.893 ± 0.531
0.541IleHis: 0.541 ± 0.278
2.434IleIle: 2.434 ± 0.429
3.786IleLys: 3.786 ± 1.088
7.572IleLeu: 7.572 ± 2.177
2.704IleMet: 2.704 ± 1.335
2.434IleAsn: 2.434 ± 1.25
1.622IlePro: 1.622 ± 0.435
1.622IleGln: 1.622 ± 1.542
2.975IleArg: 2.975 ± 0.134
3.786IleSer: 3.786 ± 1.479
1.622IleThr: 1.622 ± 0.464
4.327IleVal: 4.327 ± 0.215
0.811IleTrp: 0.811 ± 0.417
4.597IleTyr: 4.597 ± 0.333
0.0IleXaa: 0.0 ± 0.0
Lys
3.245LysAla: 3.245 ± 0.707
1.082LysCys: 1.082 ± 0.555
3.786LysAsp: 3.786 ± 1.524
3.245LysGlu: 3.245 ± 0.707
5.408LysPhe: 5.408 ± 1.447
2.434LysGly: 2.434 ± 0.597
0.541LysHis: 0.541 ± 0.278
4.056LysIle: 4.056 ± 1.008
4.056LysLys: 4.056 ± 1.226
4.867LysLeu: 4.867 ± 2.02
0.811LysMet: 0.811 ± 0.485
3.515LysAsn: 3.515 ± 1.343
2.434LysPro: 2.434 ± 1.134
1.352LysGln: 1.352 ± 0.668
2.975LysArg: 2.975 ± 0.134
4.327LysSer: 4.327 ± 0.666
2.704LysThr: 2.704 ± 0.501
4.597LysVal: 4.597 ± 1.482
0.541LysTrp: 0.541 ± 0.278
3.245LysTyr: 3.245 ± 1.053
0.0LysXaa: 0.0 ± 0.0
Leu
7.301LeuAla: 7.301 ± 2.517
1.352LeuCys: 1.352 ± 0.694
5.408LeuAsp: 5.408 ± 1.343
6.22LeuGlu: 6.22 ± 1.145
4.327LeuPhe: 4.327 ± 4.394
2.975LeuGly: 2.975 ± 0.597
1.893LeuHis: 1.893 ± 0.61
7.842LeuIle: 7.842 ± 1.2
4.867LeuLys: 4.867 ± 0.858
11.628LeuLeu: 11.628 ± 7.979
3.786LeuMet: 3.786 ± 1.419
4.327LeuAsn: 4.327 ± 1.602
3.515LeuPro: 3.515 ± 0.287
2.434LeuGln: 2.434 ± 1.321
7.031LeuArg: 7.031 ± 1.139
9.735LeuSer: 9.735 ± 3.67
3.515LeuThr: 3.515 ± 1.355
7.572LeuVal: 7.572 ± 1.599
1.082LeuTrp: 1.082 ± 0.555
2.975LeuTyr: 2.975 ± 1.072
0.0LeuXaa: 0.0 ± 0.0
Met
1.893MetAla: 1.893 ± 0.408
0.27MetCys: 0.27 ± 0.139
1.352MetAsp: 1.352 ± 0.602
1.082MetGlu: 1.082 ± 0.274
0.811MetPhe: 0.811 ± 0.773
1.352MetGly: 1.352 ± 0.336
0.811MetHis: 0.811 ± 0.275
2.434MetIle: 2.434 ± 0.806
1.352MetLys: 1.352 ± 0.668
4.056MetLeu: 4.056 ± 1.882
0.27MetMet: 0.27 ± 0.139
1.893MetAsn: 1.893 ± 0.551
0.811MetPro: 0.811 ± 0.275
0.0MetGln: 0.0 ± 0.0
1.082MetArg: 1.082 ± 0.274
1.622MetSer: 1.622 ± 0.551
1.622MetThr: 1.622 ± 0.435
2.163MetVal: 2.163 ± 1.192
0.27MetTrp: 0.27 ± 0.139
0.811MetTyr: 0.811 ± 0.417
0.0MetXaa: 0.0 ± 0.0
Asn
2.975AsnAla: 2.975 ± 0.134
1.893AsnCys: 1.893 ± 0.551
3.515AsnAsp: 3.515 ± 0.983
2.434AsnGlu: 2.434 ± 0.806
4.867AsnPhe: 4.867 ± 1.194
2.434AsnGly: 2.434 ± 0.933
0.27AsnHis: 0.27 ± 0.139
2.163AsnIle: 2.163 ± 1.311
4.056AsnLys: 4.056 ± 0.531
3.245AsnLeu: 3.245 ± 0.87
0.541AsnMet: 0.541 ± 0.265
2.434AsnAsn: 2.434 ± 1.25
1.352AsnPro: 1.352 ± 0.602
0.541AsnGln: 0.541 ± 0.339
3.515AsnArg: 3.515 ± 0.866
2.163AsnSer: 2.163 ± 1.623
1.352AsnThr: 1.352 ± 0.336
5.138AsnVal: 5.138 ± 2.018
0.541AsnTrp: 0.541 ± 0.339
1.352AsnTyr: 1.352 ± 0.602
0.0AsnXaa: 0.0 ± 0.0
Pro
0.811ProAla: 0.811 ± 2.398
0.0ProCys: 0.0 ± 0.0
2.163ProAsp: 2.163 ± 1.192
1.622ProGlu: 1.622 ± 0.435
2.975ProPhe: 2.975 ± 1.095
1.352ProGly: 1.352 ± 0.694
0.27ProHis: 0.27 ± 0.139
1.622ProIle: 1.622 ± 0.551
1.622ProLys: 1.622 ± 0.703
4.056ProLeu: 4.056 ± 2.05
0.27ProMet: 0.27 ± 0.139
0.541ProAsn: 0.541 ± 0.339
0.541ProPro: 0.541 ± 0.893
1.082ProGln: 1.082 ± 0.655
1.893ProArg: 1.893 ± 0.972
2.163ProSer: 2.163 ± 1.311
0.27ProThr: 0.27 ± 0.799
3.515ProVal: 3.515 ± 1.459
0.27ProTrp: 0.27 ± 0.139
1.622ProTyr: 1.622 ± 0.551
0.0ProXaa: 0.0 ± 0.0
Gln
0.27GlnAla: 0.27 ± 0.139
0.27GlnCys: 0.27 ± 0.139
0.0GlnAsp: 0.0 ± 0.0
1.893GlnGlu: 1.893 ± 0.551
1.082GlnPhe: 1.082 ± 0.274
1.082GlnGly: 1.082 ± 0.66
0.27GlnHis: 0.27 ± 0.139
1.893GlnIle: 1.893 ± 0.408
1.082GlnLys: 1.082 ± 0.66
1.082GlnLeu: 1.082 ± 0.555
0.541GlnMet: 0.541 ± 0.278
1.622GlnAsn: 1.622 ± 0.435
0.541GlnPro: 0.541 ± 0.339
0.541GlnGln: 0.541 ± 0.73
1.893GlnArg: 1.893 ± 0.408
0.27GlnSer: 0.27 ± 0.139
1.082GlnThr: 1.082 ± 0.66
1.622GlnVal: 1.622 ± 2.401
0.0GlnTrp: 0.0 ± 0.0
0.811GlnTyr: 0.811 ± 0.275
0.0GlnXaa: 0.0 ± 0.0
Arg
2.434ArgAla: 2.434 ± 0.597
1.622ArgCys: 1.622 ± 0.435
5.138ArgAsp: 5.138 ± 1.342
2.434ArgGlu: 2.434 ± 0.806
2.975ArgPhe: 2.975 ± 0.765
2.163ArgGly: 2.163 ± 1.311
2.975ArgHis: 2.975 ± 1.095
5.138ArgIle: 5.138 ± 1.342
4.597ArgLys: 4.597 ± 2.361
5.408ArgLeu: 5.408 ± 0.441
1.622ArgMet: 1.622 ± 0.435
3.515ArgAsn: 3.515 ± 0.826
1.622ArgPro: 1.622 ± 1.542
0.811ArgGln: 0.811 ± 0.417
3.786ArgArg: 3.786 ± 0.929
3.245ArgSer: 3.245 ± 1.539
3.515ArgThr: 3.515 ± 0.863
5.949ArgVal: 5.949 ± 0.669
0.541ArgTrp: 0.541 ± 0.879
4.056ArgTyr: 4.056 ± 1.377
0.0ArgXaa: 0.0 ± 0.0
Ser
4.327SerAla: 4.327 ± 1.54
2.163SerCys: 2.163 ± 1.111
5.408SerAsp: 5.408 ± 0.733
2.975SerGlu: 2.975 ± 2.098
4.597SerPhe: 4.597 ± 2.306
3.786SerGly: 3.786 ± 0.95
1.082SerHis: 1.082 ± 0.555
4.327SerIle: 4.327 ± 1.096
2.975SerLys: 2.975 ± 0.799
5.949SerLeu: 5.949 ± 0.669
1.622SerMet: 1.622 ± 1.018
2.163SerAsn: 2.163 ± 0.548
2.163SerPro: 2.163 ± 0.475
1.352SerGln: 1.352 ± 0.602
5.138SerArg: 5.138 ± 1.706
5.408SerSer: 5.408 ± 2.263
5.138SerThr: 5.138 ± 2.37
6.76SerVal: 6.76 ± 0.976
0.541SerTrp: 0.541 ± 0.339
2.975SerTyr: 2.975 ± 0.765
0.0SerXaa: 0.0 ± 0.0
Thr
2.975ThrAla: 2.975 ± 1.072
0.811ThrCys: 0.811 ± 0.275
2.704ThrAsp: 2.704 ± 0.221
1.622ThrGlu: 1.622 ± 0.435
5.408ThrPhe: 5.408 ± 1.968
2.434ThrGly: 2.434 ± 1.203
0.811ThrHis: 0.811 ± 0.773
3.245ThrIle: 3.245 ± 1.111
2.975ThrLys: 2.975 ± 1.146
3.515ThrLeu: 3.515 ± 1.4
1.082ThrMet: 1.082 ± 0.679
1.082ThrAsn: 1.082 ± 0.274
1.893ThrPro: 1.893 ± 3.026
0.811ThrGln: 0.811 ± 0.417
2.163ThrArg: 2.163 ± 0.475
3.515ThrSer: 3.515 ± 1.4
2.704ThrThr: 2.704 ± 2.812
3.786ThrVal: 3.786 ± 0.397
0.27ThrTrp: 0.27 ± 0.139
2.163ThrTyr: 2.163 ± 0.548
0.0ThrXaa: 0.0 ± 0.0
Val
4.867ValAla: 4.867 ± 1.341
1.622ValCys: 1.622 ± 1.018
8.383ValAsp: 8.383 ± 1.738
5.408ValGlu: 5.408 ± 2.304
6.49ValPhe: 6.49 ± 3.918
3.515ValGly: 3.515 ± 1.382
1.893ValHis: 1.893 ± 0.938
3.245ValIle: 3.245 ± 1.262
6.49ValLys: 6.49 ± 1.56
8.383ValLeu: 8.383 ± 2.996
2.434ValMet: 2.434 ± 0.933
2.975ValAsn: 2.975 ± 0.597
3.786ValPro: 3.786 ± 1.524
1.352ValGln: 1.352 ± 0.694
4.867ValArg: 4.867 ± 1.652
8.924ValSer: 8.924 ± 1.468
4.597ValThr: 4.597 ± 0.444
8.112ValVal: 8.112 ± 2.168
0.27ValTrp: 0.27 ± 0.139
4.327ValTyr: 4.327 ± 1.745
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.27TrpCys: 0.27 ± 0.139
0.811TrpAsp: 0.811 ± 0.275
0.27TrpGlu: 0.27 ± 0.139
0.27TrpPhe: 0.27 ± 0.139
0.811TrpGly: 0.811 ± 0.682
0.27TrpHis: 0.27 ± 0.139
0.541TrpIle: 0.541 ± 0.278
1.082TrpLys: 1.082 ± 0.274
1.352TrpLeu: 1.352 ± 1.37
0.0TrpMet: 0.0 ± 0.0
0.27TrpAsn: 0.27 ± 0.439
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.811TrpArg: 0.811 ± 0.275
0.541TrpSer: 0.541 ± 0.278
0.27TrpThr: 0.27 ± 0.139
0.27TrpVal: 0.27 ± 0.139
0.0TrpTrp: 0.0 ± 0.0
0.541TrpTyr: 0.541 ± 0.339
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.893TyrAla: 1.893 ± 0.551
1.082TyrCys: 1.082 ± 0.555
4.327TyrAsp: 4.327 ± 0.517
0.541TyrGlu: 0.541 ± 0.339
2.163TyrPhe: 2.163 ± 0.873
2.975TyrGly: 2.975 ± 0.134
1.352TyrHis: 1.352 ± 0.551
2.975TyrIle: 2.975 ± 0.597
1.893TyrLys: 1.893 ± 0.408
5.138TyrLeu: 5.138 ± 0.56
1.893TyrMet: 1.893 ± 1.094
3.245TyrAsn: 3.245 ± 0.162
1.622TyrPro: 1.622 ± 0.435
1.352TyrGln: 1.352 ± 0.602
4.867TyrArg: 4.867 ± 1.305
3.515TyrSer: 3.515 ± 1.956
1.082TyrThr: 1.082 ± 0.274
3.245TyrVal: 3.245 ± 0.822
0.0TyrTrp: 0.0 ± 0.0
3.786TyrTyr: 3.786 ± 1.419
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3699 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski