kuku@worker1:~$ python
Python 2.7.12 (default, Jul 1 2016, 15:12:24)
[GCC 5.4.0 20160609] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>>
可以看到Ubuntu16.04已经预置了python 2.7
接着查看是否还有lxml
1
2
3
4
5
6
7
>>> import lxml
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
ImportError: No module named lxml
>>>
>>> import OpenSSL
>>>
发现没有lxml,有OpenSSL,于是先安装lxml
1
kuku@worker1:~$ sudo apt-get install python-lxml
重新进入
1
2
3
4
5
6
kuku@worker1:~$ python
Python 2.7.12 (default, Jul 1 2016, 15:12:24)
[GCC 5.4.0 20160609] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import lxml
>>>
没有报错。为保证后续正确安装,需依次执行
1
kuku@worker1:~$ sudo apt-get install python-dev
1
kuku@worker1:~$ sudo apt-get install libevent-dev
1
kuku@worker1:~$ sudo apt-get install python-pip
1
kuku@worker1:~$ sudo pip install --upgrade pip
最后执行
1
kuku@worker1:~$ pip install Scrapy
即可完成Scrapy的安装,查看下scrapy相关命令
1
kuku@worker1:~$ scrapy
然后可以创建一个新的项目
1
2
3
4
5
6
7
8
9
kuku@worker1:~$ scrapy startproject newproject
New Scrapy project 'newproject', using template directory '/usr/local/lib/python2.7/dist-packages/scrapy/templates/project', created in:
/home/kuku/newproject
You can start your first spider with:
cd newproject
scrapy genspider example example.com
kuku@worker1:~$
kuku@worker1:~$ sudo apt install tree