Amino acid dipepetide frequency for Shuangao Bedbug Virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.887AlaAla: 2.887 ± 1.545
0.962AlaCys: 0.962 ± 0.599
1.283AlaAsp: 1.283 ± 0.687
2.887AlaGlu: 2.887 ± 0.985
1.604AlaPhe: 1.604 ± 1.011
2.246AlaGly: 2.246 ± 0.804
1.283AlaHis: 1.283 ± 0.687
3.529AlaIle: 3.529 ± 1.282
1.604AlaLys: 1.604 ± 0.859
4.171AlaLeu: 4.171 ± 0.454
0.642AlaMet: 0.642 ± 0.343
3.208AlaAsn: 3.208 ± 1.142
1.604AlaPro: 1.604 ± 1.435
1.604AlaGln: 1.604 ± 0.859
2.246AlaArg: 2.246 ± 0.59
2.567AlaSer: 2.567 ± 0.34
1.604AlaThr: 1.604 ± 0.409
1.925AlaVal: 1.925 ± 0.76
0.321AlaTrp: 0.321 ± 0.792
0.321AlaTyr: 0.321 ± 0.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.962CysAla: 0.962 ± 0.515
0.0CysCys: 0.0 ± 0.0
0.642CysAsp: 0.642 ± 0.343
0.962CysGlu: 0.962 ± 0.716
0.642CysPhe: 0.642 ± 0.343
0.321CysGly: 0.321 ± 0.426
1.604CysHis: 1.604 ± 0.409
1.925CysIle: 1.925 ± 0.631
1.604CysLys: 1.604 ± 0.512
2.887CysLeu: 2.887 ± 0.429
0.321CysMet: 0.321 ± 0.172
1.283CysAsn: 1.283 ± 0.598
0.642CysPro: 0.642 ± 0.343
0.962CysGln: 0.962 ± 0.237
0.962CysArg: 0.962 ± 0.515
0.642CysSer: 0.642 ± 0.343
0.642CysThr: 0.642 ± 0.971
0.321CysVal: 0.321 ± 0.426
0.321CysTrp: 0.321 ± 0.172
0.962CysTyr: 0.962 ± 0.515
0.0CysXaa: 0.0 ± 0.0
Asp
1.604AspAla: 1.604 ± 0.513
0.0AspCys: 0.0 ± 0.0
1.604AspAsp: 1.604 ± 0.409
1.604AspGlu: 1.604 ± 0.859
2.887AspPhe: 2.887 ± 1.047
2.246AspGly: 2.246 ± 0.727
0.321AspHis: 0.321 ± 0.172
3.85AspIle: 3.85 ± 0.788
2.246AspLys: 2.246 ± 0.325
7.379AspLeu: 7.379 ± 0.859
0.0AspMet: 0.0 ± 0.0
1.604AspAsn: 1.604 ± 0.859
4.812AspPro: 4.812 ± 1.059
0.642AspGln: 0.642 ± 0.681
1.283AspArg: 1.283 ± 0.687
2.246AspSer: 2.246 ± 1.308
1.604AspThr: 1.604 ± 0.409
0.642AspVal: 0.642 ± 0.343
1.283AspTrp: 1.283 ± 0.598
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.887GluAla: 2.887 ± 0.254
0.321GluCys: 0.321 ± 0.426
2.887GluAsp: 2.887 ± 0.429
2.887GluGlu: 2.887 ± 0.685
4.171GluPhe: 4.171 ± 0.173
2.567GluGly: 2.567 ± 0.573
2.246GluHis: 2.246 ± 1.12
4.171GluIle: 4.171 ± 0.72
3.85GluLys: 3.85 ± 1.446
6.096GluLeu: 6.096 ± 1.829
0.962GluMet: 0.962 ± 0.515
0.962GluAsn: 0.962 ± 0.515
1.604GluPro: 1.604 ± 0.859
3.208GluGln: 3.208 ± 2.648
1.283GluArg: 1.283 ± 0.687
3.529GluSer: 3.529 ± 1.816
3.208GluThr: 3.208 ± 0.558
0.962GluVal: 0.962 ± 0.599
1.604GluTrp: 1.604 ± 1.011
1.604GluTyr: 1.604 ± 1.27
0.0GluXaa: 0.0 ± 0.0
Phe
1.283PheAla: 1.283 ± 0.687
0.321PheCys: 0.321 ± 0.172
1.604PheAsp: 1.604 ± 1.011
3.529PheGlu: 3.529 ± 0.776
2.567PhePhe: 2.567 ± 0.573
2.246PheGly: 2.246 ± 0.717
2.887PheHis: 2.887 ± 0.984
4.812PheIle: 4.812 ± 1.572
4.171PheLys: 4.171 ± 0.702
6.096PheLeu: 6.096 ± 1.068
0.0PheMet: 0.0 ± 0.0
3.208PheAsn: 3.208 ± 1.13
3.85PhePro: 3.85 ± 1.446
2.246PheGln: 2.246 ± 0.497
2.567PheArg: 2.567 ± 0.881
5.775PheSer: 5.775 ± 0.406
2.246PheThr: 2.246 ± 0.717
1.604PheVal: 1.604 ± 0.409
0.321PheTrp: 0.321 ± 0.172
2.567PheTyr: 2.567 ± 0.881
0.0PheXaa: 0.0 ± 0.0
Gly
1.283GlyAla: 1.283 ± 0.287
0.962GlyCys: 0.962 ± 0.716
2.246GlyAsp: 2.246 ± 0.804
0.642GlyGlu: 0.642 ± 0.852
2.567GlyPhe: 2.567 ± 0.573
2.246GlyGly: 2.246 ± 0.497
1.604GlyHis: 1.604 ± 0.931
3.85GlyIle: 3.85 ± 1.115
1.925GlyLys: 1.925 ± 0.631
4.491GlyLeu: 4.491 ± 0.758
0.0GlyMet: 0.0 ± 0.368
1.925GlyAsn: 1.925 ± 1.29
2.567GlyPro: 2.567 ± 0.42
2.887GlyGln: 2.887 ± 1.1
1.604GlyArg: 1.604 ± 0.409
3.85GlySer: 3.85 ± 1.314
0.962GlyThr: 0.962 ± 0.237
2.887GlyVal: 2.887 ± 2.087
1.283GlyTrp: 1.283 ± 0.598
1.925GlyTyr: 1.925 ± 0.557
0.0GlyXaa: 0.0 ± 0.0
His
1.604HisAla: 1.604 ± 0.571
0.321HisCys: 0.321 ± 0.426
1.604HisAsp: 1.604 ± 0.859
1.283HisGlu: 1.283 ± 0.687
1.283HisPhe: 1.283 ± 0.287
1.604HisGly: 1.604 ± 0.512
1.283HisHis: 1.283 ± 0.657
2.567HisIle: 2.567 ± 0.34
0.962HisLys: 0.962 ± 0.599
5.454HisLeu: 5.454 ± 0.423
0.0HisMet: 0.0 ± 0.0
2.887HisAsn: 2.887 ± 0.781
4.491HisPro: 4.491 ± 1.433
0.962HisGln: 0.962 ± 0.515
1.925HisArg: 1.925 ± 0.394
2.567HisSer: 2.567 ± 0.34
1.925HisThr: 1.925 ± 2.913
0.962HisVal: 0.962 ± 0.237
0.321HisTrp: 0.321 ± 0.172
3.208HisTyr: 3.208 ± 1.027
0.0HisXaa: 0.0 ± 0.0
Ile
2.567IleAla: 2.567 ± 0.573
0.962IleCys: 0.962 ± 0.515
1.604IleAsp: 1.604 ± 0.512
3.208IleGlu: 3.208 ± 0.558
5.133IlePhe: 5.133 ± 0.879
2.246IleGly: 2.246 ± 0.325
2.887IleHis: 2.887 ± 1.545
4.812IleIle: 4.812 ± 1.354
4.171IleLys: 4.171 ± 0.967
9.304IleLeu: 9.304 ± 1.418
1.604IleMet: 1.604 ± 1.27
6.096IleAsn: 6.096 ± 1.829
5.775IlePro: 5.775 ± 0.406
1.925IleGln: 1.925 ± 1.303
2.567IleArg: 2.567 ± 0.881
10.908IleSer: 10.908 ± 0.046
6.737IleThr: 6.737 ± 0.69
2.887IleVal: 2.887 ± 0.254
1.604IleTrp: 1.604 ± 0.513
4.491IleTyr: 4.491 ± 1.469
0.0IleXaa: 0.0 ± 0.0
Lys
1.604LysAla: 1.604 ± 0.513
1.604LysCys: 1.604 ± 0.409
2.567LysAsp: 2.567 ± 1.374
3.208LysGlu: 3.208 ± 1.755
3.208LysPhe: 3.208 ± 1.186
2.246LysGly: 2.246 ± 1.308
1.283LysHis: 1.283 ± 0.559
6.416LysIle: 6.416 ± 0.56
2.887LysLys: 2.887 ± 4.4
6.416LysLeu: 6.416 ± 0.667
0.962LysMet: 0.962 ± 0.403
4.171LysAsn: 4.171 ± 0.908
1.925LysPro: 1.925 ± 0.474
1.925LysGln: 1.925 ± 1.03
5.454LysArg: 5.454 ± 1.519
5.133LysSer: 5.133 ± 0.841
8.021LysThr: 8.021 ± 1.093
1.925LysVal: 1.925 ± 0.557
1.604LysTrp: 1.604 ± 0.409
1.604LysTyr: 1.604 ± 0.513
0.0LysXaa: 0.0 ± 0.0
Leu
4.491LeuAla: 4.491 ± 0.758
3.529LeuCys: 3.529 ± 1.384
2.567LeuAsp: 2.567 ± 1.313
6.416LeuGlu: 6.416 ± 1.799
5.454LeuPhe: 5.454 ± 0.399
3.85LeuGly: 3.85 ± 0.862
5.133LeuHis: 5.133 ± 0.879
8.662LeuIle: 8.662 ± 1.841
10.908LeuLys: 10.908 ± 2.403
11.229LeuLeu: 11.229 ± 0.851
4.171LeuMet: 4.171 ± 0.967
5.454LeuAsn: 5.454 ± 2.405
4.171LeuPro: 4.171 ± 0.908
2.246LeuGln: 2.246 ± 0.717
5.133LeuArg: 5.133 ± 1.369
12.512LeuSer: 12.512 ± 1.034
8.021LeuThr: 8.021 ± 0.74
5.454LeuVal: 5.454 ± 2.255
0.321LeuTrp: 0.321 ± 0.426
5.133LeuTyr: 5.133 ± 0.879
0.0LeuXaa: 0.0 ± 0.0
Met
0.321MetAla: 0.321 ± 0.792
0.321MetCys: 0.321 ± 0.426
1.925MetAsp: 1.925 ± 1.03
1.925MetGlu: 1.925 ± 0.631
1.604MetPhe: 1.604 ± 0.409
2.567MetGly: 2.567 ± 0.573
0.321MetHis: 0.321 ± 0.172
2.246MetIle: 2.246 ± 0.717
0.642MetLys: 0.642 ± 0.681
3.208MetLeu: 3.208 ± 0.725
0.0MetMet: 0.0 ± 0.0
0.962MetAsn: 0.962 ± 0.515
0.321MetPro: 0.321 ± 0.172
0.321MetGln: 0.321 ± 0.172
0.962MetArg: 0.962 ± 0.237
1.283MetSer: 1.283 ± 0.287
1.283MetThr: 1.283 ± 0.687
1.604MetVal: 1.604 ± 0.409
0.321MetTrp: 0.321 ± 0.792
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.604AsnAla: 1.604 ± 1.27
0.962AsnCys: 0.962 ± 0.515
1.604AsnAsp: 1.604 ± 0.571
3.85AsnGlu: 3.85 ± 0.86
1.925AsnPhe: 1.925 ± 1.03
1.925AsnGly: 1.925 ± 1.29
2.567AsnHis: 2.567 ± 1.72
4.171AsnIle: 4.171 ± 1.024
4.171AsnLys: 4.171 ± 1.672
8.021AsnLeu: 8.021 ± 1.987
0.962AsnMet: 0.962 ± 0.237
2.246AsnAsn: 2.246 ± 1.202
2.246AsnPro: 2.246 ± 0.804
1.604AsnGln: 1.604 ± 0.513
0.962AsnArg: 0.962 ± 0.237
6.416AsnSer: 6.416 ± 1.651
1.925AsnThr: 1.925 ± 0.557
2.567AsnVal: 2.567 ± 1.313
0.962AsnTrp: 0.962 ± 0.515
4.171AsnTyr: 4.171 ± 0.173
0.0AsnXaa: 0.0 ± 0.0
Pro
1.604ProAla: 1.604 ± 0.409
0.962ProCys: 0.962 ± 0.599
2.887ProAsp: 2.887 ± 1.047
2.567ProGlu: 2.567 ± 0.34
2.887ProPhe: 2.887 ± 1.545
1.604ProGly: 1.604 ± 1.564
2.246ProHis: 2.246 ± 0.325
2.887ProIle: 2.887 ± 0.712
2.567ProLys: 2.567 ± 0.95
5.775ProLeu: 5.775 ± 2.094
1.283ProMet: 1.283 ± 0.287
3.529ProAsn: 3.529 ± 1.026
4.171ProPro: 4.171 ± 0.967
2.567ProGln: 2.567 ± 0.34
1.604ProArg: 1.604 ± 0.512
6.737ProSer: 6.737 ± 0.863
2.887ProThr: 2.887 ± 0.685
3.208ProVal: 3.208 ± 1.949
0.642ProTrp: 0.642 ± 0.971
2.567ProTyr: 2.567 ± 0.42
0.0ProXaa: 0.0 ± 0.0
Gln
2.246GlnAla: 2.246 ± 0.59
0.962GlnCys: 0.962 ± 0.237
2.246GlnAsp: 2.246 ± 0.717
2.887GlnGlu: 2.887 ± 2.628
3.529GlnPhe: 3.529 ± 0.133
0.962GlnGly: 0.962 ± 0.237
1.283GlnHis: 1.283 ± 0.287
1.604GlnIle: 1.604 ± 0.859
2.246GlnLys: 2.246 ± 1.182
3.85GlnLeu: 3.85 ± 0.949
0.962GlnMet: 0.962 ± 0.237
1.925GlnAsn: 1.925 ± 0.76
2.246GlnPro: 2.246 ± 0.497
0.642GlnGln: 0.642 ± 0.343
2.246GlnArg: 2.246 ± 0.325
2.567GlnSer: 2.567 ± 0.573
2.246GlnThr: 2.246 ± 0.727
0.321GlnVal: 0.321 ± 0.172
0.642GlnTrp: 0.642 ± 0.681
0.642GlnTyr: 0.642 ± 0.343
0.0GlnXaa: 0.0 ± 0.0
Arg
1.604ArgAla: 1.604 ± 0.409
0.642ArgCys: 0.642 ± 0.343
1.283ArgAsp: 1.283 ± 0.287
1.925ArgGlu: 1.925 ± 0.557
3.85ArgPhe: 3.85 ± 1.115
0.642ArgGly: 0.642 ± 0.299
0.321ArgHis: 0.321 ± 0.172
3.85ArgIle: 3.85 ± 0.862
2.887ArgLys: 2.887 ± 1.047
4.171ArgLeu: 4.171 ± 1.598
0.962ArgMet: 0.962 ± 0.515
0.962ArgAsn: 0.962 ± 0.237
1.925ArgPro: 1.925 ± 0.474
1.604ArgGln: 1.604 ± 0.409
0.962ArgArg: 0.962 ± 0.599
5.454ArgSer: 5.454 ± 1.857
3.529ArgThr: 3.529 ± 1.816
0.962ArgVal: 0.962 ± 0.515
0.962ArgTrp: 0.962 ± 0.237
1.604ArgTyr: 1.604 ± 0.571
0.0ArgXaa: 0.0 ± 0.0
Ser
3.529SerAla: 3.529 ± 0.133
2.887SerCys: 2.887 ± 0.781
4.171SerAsp: 4.171 ± 1.273
5.133SerGlu: 5.133 ± 1.369
3.529SerPhe: 3.529 ± 1.397
4.491SerGly: 4.491 ± 3.775
4.491SerHis: 4.491 ± 0.758
6.737SerIle: 6.737 ± 1.261
4.812SerLys: 4.812 ± 1.186
9.304SerLeu: 9.304 ± 2.151
3.529SerMet: 3.529 ± 0.962
5.133SerAsn: 5.133 ± 1.3
4.812SerPro: 4.812 ± 0.47
5.454SerGln: 5.454 ± 2.295
1.925SerArg: 1.925 ± 0.474
9.945SerSer: 9.945 ± 4.001
3.85SerThr: 3.85 ± 0.287
5.133SerVal: 5.133 ± 0.679
2.246SerTrp: 2.246 ± 0.804
2.887SerTyr: 2.887 ± 0.685
0.0SerXaa: 0.0 ± 0.0
Thr
2.567ThrAla: 2.567 ± 0.881
0.321ThrCys: 0.321 ± 0.172
1.283ThrAsp: 1.283 ± 0.287
2.246ThrGlu: 2.246 ± 1.12
3.529ThrPhe: 3.529 ± 2.463
2.246ThrGly: 2.246 ± 1.308
1.283ThrHis: 1.283 ± 0.687
5.775ThrIle: 5.775 ± 0.406
3.85ThrLys: 3.85 ± 1.691
7.058ThrLeu: 7.058 ± 0.872
2.887ThrMet: 2.887 ± 0.653
2.567ThrAsn: 2.567 ± 0.848
3.85ThrPro: 3.85 ± 0.88
2.567ThrGln: 2.567 ± 0.848
1.283ThrArg: 1.283 ± 0.559
5.133ThrSer: 5.133 ± 0.879
4.171ThrThr: 4.171 ± 1.293
3.529ThrVal: 3.529 ± 0.133
0.962ThrTrp: 0.962 ± 0.515
1.925ThrTyr: 1.925 ± 0.474
0.0ThrXaa: 0.0 ± 0.0
Val
2.246ValAla: 2.246 ± 1.146
1.283ValCys: 1.283 ± 0.687
1.283ValAsp: 1.283 ± 0.687
1.283ValGlu: 1.283 ± 0.657
0.642ValPhe: 0.642 ± 0.343
2.246ValGly: 2.246 ± 0.727
1.925ValHis: 1.925 ± 0.474
3.85ValIle: 3.85 ± 1.691
3.208ValLys: 3.208 ± 0.108
4.491ValLeu: 4.491 ± 1.179
0.962ValMet: 0.962 ± 0.515
3.529ValAsn: 3.529 ± 0.706
1.283ValPro: 1.283 ± 0.687
1.925ValGln: 1.925 ± 0.557
1.604ValArg: 1.604 ± 0.931
3.208ValSer: 3.208 ± 1.186
1.925ValThr: 1.925 ± 0.394
2.246ValVal: 2.246 ± 0.325
0.642ValTrp: 0.642 ± 0.971
2.246ValTyr: 2.246 ± 0.717
0.0ValXaa: 0.0 ± 0.0
Trp
0.321TrpAla: 0.321 ± 0.172
0.321TrpCys: 0.321 ± 0.426
1.283TrpAsp: 1.283 ± 1.632
0.642TrpGlu: 0.642 ± 0.343
0.321TrpPhe: 0.321 ± 0.172
1.604TrpGly: 1.604 ± 0.512
0.0TrpHis: 0.0 ± 0.0
1.604TrpIle: 1.604 ± 0.409
1.283TrpLys: 1.283 ± 0.657
0.962TrpLeu: 0.962 ± 0.716
0.321TrpMet: 0.321 ± 0.792
0.962TrpAsn: 0.962 ± 0.599
0.321TrpPro: 0.321 ± 0.172
0.642TrpGln: 0.642 ± 0.343
1.604TrpArg: 1.604 ± 1.435
1.604TrpSer: 1.604 ± 1.011
1.283TrpThr: 1.283 ± 0.287
0.642TrpVal: 0.642 ± 0.343
0.321TrpTrp: 0.321 ± 0.172
0.642TrpTyr: 0.642 ± 0.299
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.283TyrAla: 1.283 ± 0.687
0.962TyrCys: 0.962 ± 0.515
1.283TyrAsp: 1.283 ± 1.575
1.925TyrGlu: 1.925 ± 0.76
2.567TyrPhe: 2.567 ± 0.573
1.925TyrGly: 1.925 ± 0.557
2.246TyrHis: 2.246 ± 1.948
4.171TyrIle: 4.171 ± 1.866
4.171TyrLys: 4.171 ± 0.173
4.491TyrLeu: 4.491 ± 1.893
1.283TyrMet: 1.283 ± 0.687
2.246TyrAsn: 2.246 ± 0.59
2.567TyrPro: 2.567 ± 0.34
0.321TyrGln: 0.321 ± 0.172
1.925TyrArg: 1.925 ± 1.03
1.925TyrSer: 1.925 ± 0.76
0.962TyrThr: 0.962 ± 0.237
2.246TyrVal: 2.246 ± 0.804
0.0TyrTrp: 0.0 ± 0.0
1.925TyrTyr: 1.925 ± 0.897
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3118 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski