Amino acid dipepetide frequency for Lassa virus (strain Mouse/Sierra Leone/Josiah/1976) (LASV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.555AlaAla: 3.555 ± 1.566
1.185AlaCys: 1.185 ± 0.737
2.962AlaAsp: 2.962 ± 0.619
2.073AlaGlu: 2.073 ± 0.548
0.592AlaPhe: 0.592 ± 0.333
2.073AlaGly: 2.073 ± 2.048
1.481AlaHis: 1.481 ± 0.476
2.666AlaIle: 2.666 ± 1.028
2.37AlaLys: 2.37 ± 1.258
7.405AlaLeu: 7.405 ± 2.041
1.481AlaMet: 1.481 ± 0.181
2.073AlaAsn: 2.073 ± 0.758
1.185AlaPro: 1.185 ± 2.499
1.481AlaGln: 1.481 ± 0.181
1.185AlaArg: 1.185 ± 0.44
4.443AlaSer: 4.443 ± 1.306
2.073AlaThr: 2.073 ± 0.736
5.628AlaVal: 5.628 ± 2.034
1.481AlaTrp: 1.481 ± 0.181
0.889AlaTyr: 0.889 ± 0.5
0.0AlaXaa: 0.0 ± 0.0
Cys
0.296CysAla: 0.296 ± 0.436
0.0CysCys: 0.0 ± 0.0
0.889CysAsp: 0.889 ± 0.954
2.073CysGlu: 2.073 ± 1.166
1.777CysPhe: 1.777 ± 0.592
2.073CysGly: 2.073 ± 1.81
0.889CysHis: 0.889 ± 0.37
1.777CysIle: 1.777 ± 0.14
1.185CysLys: 1.185 ± 1.573
2.962CysLeu: 2.962 ± 1.537
0.296CysMet: 0.296 ± 0.436
2.37CysAsn: 2.37 ± 1.673
1.185CysPro: 1.185 ± 0.987
1.185CysGln: 1.185 ± 0.319
0.296CysArg: 0.296 ± 0.167
1.481CysSer: 1.481 ± 0.181
1.481CysThr: 1.481 ± 0.794
1.185CysVal: 1.185 ± 0.666
1.185CysTrp: 1.185 ± 0.987
0.889CysTyr: 0.889 ± 0.37
0.0CysXaa: 0.0 ± 0.0
Asp
4.443AspAla: 4.443 ± 2.068
1.185AspCys: 1.185 ± 0.319
2.37AspAsp: 2.37 ± 0.865
2.073AspGlu: 2.073 ± 0.729
3.555AspPhe: 3.555 ± 0.663
3.851AspGly: 3.851 ± 1.801
1.777AspHis: 1.777 ± 0.758
4.147AspIle: 4.147 ± 1.098
1.481AspLys: 1.481 ± 0.833
8.886AspLeu: 8.886 ± 3.608
2.37AspMet: 2.37 ± 0.399
2.37AspAsn: 2.37 ± 0.399
2.073AspPro: 2.073 ± 0.795
3.258AspGln: 3.258 ± 0.605
1.481AspArg: 1.481 ± 0.552
2.666AspSer: 2.666 ± 2.053
2.962AspThr: 2.962 ± 0.49
3.851AspVal: 3.851 ± 1.519
0.592AspTrp: 0.592 ± 0.333
2.37AspTyr: 2.37 ± 1.333
0.0AspXaa: 0.0 ± 0.0
Glu
2.962GluAla: 2.962 ± 1.131
1.777GluCys: 1.777 ± 0.827
4.147GluAsp: 4.147 ± 1.382
5.924GluGlu: 5.924 ± 1.884
3.851GluPhe: 3.851 ± 1.666
3.258GluGly: 3.258 ± 0.886
1.185GluHis: 1.185 ± 0.666
3.258GluIle: 3.258 ± 1.231
3.258GluLys: 3.258 ± 0.605
6.517GluLeu: 6.517 ± 0.675
0.592GluMet: 0.592 ± 0.475
2.073GluAsn: 2.073 ± 1.221
2.962GluPro: 2.962 ± 1.303
1.481GluGln: 1.481 ± 0.476
4.147GluArg: 4.147 ± 0.92
3.851GluSer: 3.851 ± 1.056
2.666GluThr: 2.666 ± 0.969
3.851GluVal: 3.851 ± 0.797
0.0GluTrp: 0.0 ± 0.0
2.37GluTyr: 2.37 ± 0.875
0.0GluXaa: 0.0 ± 0.0
Phe
0.889PheAla: 0.889 ± 0.475
1.185PheCys: 1.185 ± 1.51
3.555PheAsp: 3.555 ± 1.051
2.37PheGlu: 2.37 ± 1.025
1.777PhePhe: 1.777 ± 0.14
1.185PheGly: 1.185 ± 0.319
0.592PheHis: 0.592 ± 0.431
1.481PheIle: 1.481 ± 0.552
3.851PheLys: 3.851 ± 1.788
5.036PheLeu: 5.036 ± 1.403
1.185PheMet: 1.185 ± 0.396
2.37PheAsn: 2.37 ± 1.168
1.185PhePro: 1.185 ± 0.666
1.481PheGln: 1.481 ± 0.476
1.777PheArg: 1.777 ± 0.74
3.258PheSer: 3.258 ± 0.812
2.37PheThr: 2.37 ± 0.399
3.555PheVal: 3.555 ± 1.504
0.0PheTrp: 0.0 ± 0.0
2.666PheTyr: 2.666 ± 1.028
0.0PheXaa: 0.0 ± 0.0
Gly
3.258GlyAla: 3.258 ± 1.006
0.889GlyCys: 0.889 ± 0.37
2.073GlyAsp: 2.073 ± 0.249
3.258GlyGlu: 3.258 ± 1.464
0.592GlyPhe: 0.592 ± 0.369
3.851GlyGly: 3.851 ± 1.245
0.889GlyHis: 0.889 ± 0.5
3.258GlyIle: 3.258 ± 0.278
3.851GlyLys: 3.851 ± 1.204
6.517GlyLeu: 6.517 ± 2.998
0.592GlyMet: 0.592 ± 0.873
4.147GlyAsn: 4.147 ± 1.219
2.962GlyPro: 2.962 ± 1.28
2.37GlyGln: 2.37 ± 0.399
4.147GlyArg: 4.147 ± 1.098
4.443GlySer: 4.443 ± 1.091
1.777GlyThr: 1.777 ± 0.951
3.851GlyVal: 3.851 ± 0.808
0.592GlyTrp: 0.592 ± 0.369
2.073GlyTyr: 2.073 ± 0.792
0.0GlyXaa: 0.0 ± 0.0
His
0.296HisAla: 0.296 ± 0.532
0.889HisCys: 0.889 ± 1.09
1.777HisAsp: 1.777 ± 0.592
1.185HisGlu: 1.185 ± 0.666
0.889HisPhe: 0.889 ± 0.475
0.889HisGly: 0.889 ± 0.791
0.889HisHis: 0.889 ± 0.379
1.481HisIle: 1.481 ± 0.476
1.777HisLys: 1.777 ± 0.758
2.37HisLeu: 2.37 ± 1.673
1.481HisMet: 1.481 ± 0.181
1.185HisAsn: 1.185 ± 0.666
0.889HisPro: 0.889 ± 0.37
0.592HisGln: 0.592 ± 0.333
1.185HisArg: 1.185 ± 0.862
1.185HisSer: 1.185 ± 0.396
0.592HisThr: 0.592 ± 0.637
1.185HisVal: 1.185 ± 0.396
0.0HisTrp: 0.0 ± 0.0
2.073HisTyr: 2.073 ± 0.855
0.0HisXaa: 0.0 ± 0.0
Ile
2.666IleAla: 2.666 ± 0.945
1.481IleCys: 1.481 ± 0.961
4.147IleAsp: 4.147 ± 1.59
5.332IleGlu: 5.332 ± 1.25
1.777IlePhe: 1.777 ± 0.592
2.073IleGly: 2.073 ± 0.249
1.185IleHis: 1.185 ± 0.44
2.073IleIle: 2.073 ± 1.22
3.851IleLys: 3.851 ± 1.527
7.701IleLeu: 7.701 ± 2.842
2.37IleMet: 2.37 ± 1.717
2.962IleAsn: 2.962 ± 1.131
2.666IlePro: 2.666 ± 1.588
2.37IleGln: 2.37 ± 1.168
2.37IleArg: 2.37 ± 0.779
5.332IleSer: 5.332 ± 0.871
2.666IleThr: 2.666 ± 0.792
2.962IleVal: 2.962 ± 1.131
0.0IleTrp: 0.0 ± 0.0
1.777IleTyr: 1.777 ± 0.14
0.0IleXaa: 0.0 ± 0.0
Lys
2.962LysAla: 2.962 ± 1.338
2.37LysCys: 2.37 ± 1.168
4.147LysAsp: 4.147 ± 1.047
4.147LysGlu: 4.147 ± 0.92
3.555LysPhe: 3.555 ± 1.504
3.851LysGly: 3.851 ± 1.192
1.481LysHis: 1.481 ± 0.476
2.666LysIle: 2.666 ± 0.435
3.258LysLys: 3.258 ± 1.154
7.998LysLeu: 7.998 ± 2.577
1.481LysMet: 1.481 ± 0.765
1.777LysAsn: 1.777 ± 1.315
0.889LysPro: 0.889 ± 0.475
2.962LysGln: 2.962 ± 1.338
3.555LysArg: 3.555 ± 0.74
7.701LysSer: 7.701 ± 1.939
3.258LysThr: 3.258 ± 0.605
3.555LysVal: 3.555 ± 1.319
1.481LysTrp: 1.481 ± 0.476
3.555LysTyr: 3.555 ± 0.77
0.0LysXaa: 0.0 ± 0.0
Leu
4.147LeuAla: 4.147 ± 1.039
3.555LeuCys: 3.555 ± 0.746
4.739LeuAsp: 4.739 ± 1.453
5.924LeuGlu: 5.924 ± 0.98
2.666LeuPhe: 2.666 ± 0.435
5.628LeuGly: 5.628 ± 0.51
2.073LeuHis: 2.073 ± 0.249
10.664LeuIle: 10.664 ± 1.763
7.701LeuLys: 7.701 ± 1.526
10.664LeuLeu: 10.664 ± 0.98
3.258LeuMet: 3.258 ± 0.985
9.479LeuAsn: 9.479 ± 0.994
2.073LeuPro: 2.073 ± 0.976
4.739LeuGln: 4.739 ± 0.448
7.998LeuArg: 7.998 ± 1.276
13.329LeuSer: 13.329 ± 1.442
7.701LeuThr: 7.701 ± 1.422
6.22LeuVal: 6.22 ± 1.74
1.185LeuTrp: 1.185 ± 0.44
3.851LeuTyr: 3.851 ± 1.204
0.0LeuXaa: 0.0 ± 0.0
Met
0.592MetAla: 0.592 ± 0.431
0.889MetCys: 0.889 ± 0.5
1.777MetAsp: 1.777 ± 1.106
1.777MetGlu: 1.777 ± 0.758
1.185MetPhe: 1.185 ± 0.737
3.258MetGly: 3.258 ± 1.336
0.592MetHis: 0.592 ± 0.369
1.777MetIle: 1.777 ± 0.538
1.777MetLys: 1.777 ± 0.592
2.666MetLeu: 2.666 ± 1.426
1.481MetMet: 1.481 ± 0.833
1.185MetAsn: 1.185 ± 0.396
0.889MetPro: 0.889 ± 1.236
0.296MetGln: 0.296 ± 0.167
1.185MetArg: 1.185 ± 0.319
2.666MetSer: 2.666 ± 1.089
1.481MetThr: 1.481 ± 0.795
2.666MetVal: 2.666 ± 0.411
0.296MetTrp: 0.296 ± 0.167
0.296MetTyr: 0.296 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
2.962AsnAla: 2.962 ± 0.748
1.185AsnCys: 1.185 ± 0.735
1.777AsnAsp: 1.777 ± 0.538
3.258AsnGlu: 3.258 ± 1.148
2.073AsnPhe: 2.073 ± 0.548
2.666AsnGly: 2.666 ± 1.137
2.962AsnHis: 2.962 ± 2.665
2.073AsnIle: 2.073 ± 0.963
4.443AsnLys: 4.443 ± 1.719
5.332AsnLeu: 5.332 ± 0.754
2.073AsnMet: 2.073 ± 2.186
2.962AsnAsn: 2.962 ± 0.991
1.185AsnPro: 1.185 ± 0.44
2.962AsnGln: 2.962 ± 0.748
2.37AsnArg: 2.37 ± 1.025
2.962AsnSer: 2.962 ± 0.722
2.962AsnThr: 2.962 ± 1.267
2.073AsnVal: 2.073 ± 0.834
0.592AsnTrp: 0.592 ± 0.431
1.185AsnTyr: 1.185 ± 0.396
0.0AsnXaa: 0.0 ± 0.0
Pro
1.481ProAla: 1.481 ± 0.833
0.889ProCys: 0.889 ± 0.5
1.777ProAsp: 1.777 ± 0.827
1.185ProGlu: 1.185 ± 0.755
1.777ProPhe: 1.777 ± 0.999
2.073ProGly: 2.073 ± 0.792
1.481ProHis: 1.481 ± 0.795
1.777ProIle: 1.777 ± 0.84
2.073ProLys: 2.073 ± 0.758
3.851ProLeu: 3.851 ± 0.775
1.185ProMet: 1.185 ± 0.661
1.777ProAsn: 1.777 ± 0.538
1.481ProPro: 1.481 ± 2.425
1.185ProGln: 1.185 ± 1.095
1.777ProArg: 1.777 ± 1.322
3.258ProSer: 3.258 ± 0.551
2.962ProThr: 2.962 ± 2.371
1.777ProVal: 1.777 ± 0.14
0.0ProTrp: 0.0 ± 0.0
1.481ProTyr: 1.481 ± 1.269
0.0ProXaa: 0.0 ± 0.0
Gln
2.962GlnAla: 2.962 ± 1.338
1.185GlnCys: 1.185 ± 0.666
1.481GlnAsp: 1.481 ± 0.552
1.185GlnGlu: 1.185 ± 0.396
2.073GlnPhe: 2.073 ± 0.976
2.073GlnGly: 2.073 ± 1.504
0.0GlnHis: 0.0 ± 0.0
2.37GlnIle: 2.37 ± 0.875
3.851GlnLys: 3.851 ± 0.886
5.628GlnLeu: 5.628 ± 0.957
0.296GlnMet: 0.296 ± 0.532
1.777GlnAsn: 1.777 ± 0.592
1.185GlnPro: 1.185 ± 0.804
1.481GlnGln: 1.481 ± 0.664
1.777GlnArg: 1.777 ± 0.951
3.258GlnSer: 3.258 ± 1.231
2.073GlnThr: 2.073 ± 0.548
2.073GlnVal: 2.073 ± 0.795
0.296GlnTrp: 0.296 ± 0.167
2.073GlnTyr: 2.073 ± 1.188
0.0GlnXaa: 0.0 ± 0.0
Arg
2.073ArgAla: 2.073 ± 1.705
0.592ArgCys: 0.592 ± 0.786
3.258ArgAsp: 3.258 ± 1.197
2.962ArgGlu: 2.962 ± 1.105
2.37ArgPhe: 2.37 ± 1.333
2.37ArgGly: 2.37 ± 0.399
0.296ArgHis: 0.296 ± 0.532
2.073ArgIle: 2.073 ± 1.166
2.962ArgLys: 2.962 ± 1.439
7.998ArgLeu: 7.998 ± 1.254
1.777ArgMet: 1.777 ± 0.716
1.777ArgAsn: 1.777 ± 0.74
2.666ArgPro: 2.666 ± 1.644
1.777ArgGln: 1.777 ± 0.592
1.777ArgArg: 1.777 ± 1.077
2.666ArgSer: 2.666 ± 1.028
4.739ArgThr: 4.739 ± 0.571
2.073ArgVal: 2.073 ± 1.082
0.296ArgTrp: 0.296 ± 0.532
1.481ArgTyr: 1.481 ± 0.833
0.0ArgXaa: 0.0 ± 0.0
Ser
3.258SerAla: 3.258 ± 1.371
3.555SerCys: 3.555 ± 2.471
6.517SerAsp: 6.517 ± 0.842
3.555SerGlu: 3.555 ± 1.051
3.555SerPhe: 3.555 ± 1.051
6.517SerGly: 6.517 ± 1.921
2.073SerHis: 2.073 ± 0.792
4.739SerIle: 4.739 ± 2.153
6.22SerLys: 6.22 ± 1.154
10.367SerLeu: 10.367 ± 2.236
1.777SerMet: 1.777 ± 0.999
3.851SerAsn: 3.851 ± 0.706
2.962SerPro: 2.962 ± 1.444
3.555SerGln: 3.555 ± 0.77
4.147SerArg: 4.147 ± 0.497
5.628SerSer: 5.628 ± 2.086
4.147SerThr: 4.147 ± 0.497
4.147SerVal: 4.147 ± 0.803
0.592SerTrp: 0.592 ± 0.333
3.851SerTyr: 3.851 ± 1.506
0.0SerXaa: 0.0 ± 0.0
Thr
3.555ThrAla: 3.555 ± 0.529
0.889ThrCys: 0.889 ± 0.475
2.666ThrAsp: 2.666 ± 1.447
3.555ThrGlu: 3.555 ± 1.051
2.37ThrPhe: 2.37 ± 2.334
2.962ThrGly: 2.962 ± 0.997
1.481ThrHis: 1.481 ± 1.401
3.258ThrIle: 3.258 ± 1.464
3.555ThrLys: 3.555 ± 0.529
5.036ThrLeu: 5.036 ± 1.367
1.185ThrMet: 1.185 ± 0.804
2.073ThrAsn: 2.073 ± 0.729
2.962ThrPro: 2.962 ± 0.804
1.481ThrGln: 1.481 ± 1.657
2.37ThrArg: 2.37 ± 0.793
6.22ThrSer: 6.22 ± 1.864
3.258ThrThr: 3.258 ± 1.544
2.37ThrVal: 2.37 ± 0.399
1.185ThrTrp: 1.185 ± 0.939
0.592ThrTyr: 0.592 ± 0.369
0.0ThrXaa: 0.0 ± 0.0
Val
2.666ValAla: 2.666 ± 0.86
0.296ValCys: 0.296 ± 0.167
5.036ValAsp: 5.036 ± 1.513
5.036ValGlu: 5.036 ± 1.4
1.777ValPhe: 1.777 ± 0.14
3.555ValGly: 3.555 ± 0.804
0.889ValHis: 0.889 ± 0.5
2.666ValIle: 2.666 ± 0.792
5.628ValLys: 5.628 ± 0.889
6.22ValLeu: 6.22 ± 0.632
1.481ValMet: 1.481 ± 0.476
2.073ValAsn: 2.073 ± 0.249
2.37ValPro: 2.37 ± 0.793
2.073ValGln: 2.073 ± 0.44
2.962ValArg: 2.962 ± 1.303
6.517ValSer: 6.517 ± 1.429
1.777ValThr: 1.777 ± 0.662
5.036ValVal: 5.036 ± 2.187
0.889ValTrp: 0.889 ± 0.791
1.185ValTyr: 1.185 ± 0.319
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.185TrpAsp: 1.185 ± 0.804
0.889TrpGlu: 0.889 ± 0.475
0.592TrpPhe: 0.592 ± 0.786
1.185TrpGly: 1.185 ± 0.396
0.296TrpHis: 0.296 ± 0.167
0.592TrpIle: 0.592 ± 0.333
0.592TrpLys: 0.592 ± 0.431
0.889TrpLeu: 0.889 ± 0.379
0.592TrpMet: 0.592 ± 0.637
0.0TrpAsn: 0.0 ± 0.0
0.296TrpPro: 0.296 ± 0.436
0.296TrpGln: 0.296 ± 0.167
0.592TrpArg: 0.592 ± 0.333
0.592TrpSer: 0.592 ± 0.333
0.592TrpThr: 0.592 ± 0.637
1.185TrpVal: 1.185 ± 0.666
0.0TrpTrp: 0.0 ± 0.0
0.889TrpTyr: 0.889 ± 0.379
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.258TyrAla: 3.258 ± 1.344
1.481TyrCys: 1.481 ± 0.795
1.481TyrAsp: 1.481 ± 0.476
2.37TyrGlu: 2.37 ± 0.638
2.962TyrPhe: 2.962 ± 1.666
0.0TyrGly: 0.0 ± 0.0
0.296TyrHis: 0.296 ± 0.167
2.962TyrIle: 2.962 ± 0.946
2.962TyrLys: 2.962 ± 0.362
3.851TyrLeu: 3.851 ± 2.386
1.481TyrMet: 1.481 ± 0.181
1.777TyrAsn: 1.777 ± 0.662
1.185TyrPro: 1.185 ± 0.44
2.073TyrGln: 2.073 ± 0.249
0.889TyrArg: 0.889 ± 0.5
3.555TyrSer: 3.555 ± 0.529
1.481TyrThr: 1.481 ± 0.552
0.889TyrVal: 0.889 ± 0.5
0.592TyrTrp: 0.592 ± 0.431
0.296TyrTyr: 0.296 ± 0.167
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3377 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski